[Devel] [PATCH] spfs: Main process wakes and kills its children on exit

Kirill Tkhai ktkhai at virtuozzo.com
Tue Jan 23 13:17:23 MSK 2018


On 23.01.2018 13:14, Stanislav Kinsburskiy wrote:
> 
> 
> 23.01.2018 11:09, Kirill Tkhai пишет:
>> On 23.01.2018 13:07, Stanislav Kinsburskiy wrote:
>>>
>>>
>>> 23.01.2018 10:51, Kirill Tkhai пишет:
>>>> On 23.01.2018 12:48, Stanislav Kinsburskiy wrote:
>>>>> Please, see a couple of nits below
>>>>>
>>>>> 23.01.2018 10:41, Kirill Tkhai пишет:
>>>>>> Stanislav Kinsburskiy says:
>>>>>>
>>>>>> "SPFS manager has a special "--exit-with-spfs" options, which is used by CRIU.
>>>>>>  The idea of the option is simple: force SPFS manager to exit, when it has some
>>>>>>  SPFS processes among its children (i.e. spfs was mounted at least once),
>>>>>>  but all these processes have exited for whatever reason (which usually happens
>>>>>>  when restore has failed and spfs mounts where unmounted).
>>>>>>  Although it works in overall (main SPFS manager process exits), its children
>>>>>>  (responsible to SPFS replacement) may wait on FUTEX for "release" command
>>>>>>  for corresponding SPFS mount and thus never stop until they are killed".
>>>>>>
>>>>>> 1 spfs-manager
>>>>>> 2   \_ spfs
>>>>>> 3   \_ spfs-manager
>>>>>> 4   \_ spfs
>>>>>> 5   \_ spfs-manager
>>>>>>
>>>>>> 2 and 3 are pair of a mount, and 4 and 5 are pair of another mount.
>>>>>> The patch makes spfs-manager 1 kill 3 in case of 2 exited.
>>>>>>
>>>>>> https://jira.sw.ru/browse/PSBM-80055
>>>>>>
>>>>>> Signed-off-by: Kirill Tkhai <ktkhai at virtuozzo.com>
>>>>>> ---
>>>>>>  manager/context.c |    4 ++++
>>>>>>  manager/spfs.c    |    1 +
>>>>>>  2 files changed, 5 insertions(+)
>>>>>>
>>>>>> diff --git a/manager/context.c b/manager/context.c
>>>>>> index 1eb37c9..4464a23 100644
>>>>>> --- a/manager/context.c
>>>>>> +++ b/manager/context.c
>>>>>> @@ -53,6 +53,9 @@ static void cleanup_spfs_mount(struct spfs_manager_context_s *ctx,
>>>>>>  		/* SPFS master was killed. We need to release the reference */
>>>>>>  		spfs_release_mnt(info);
>>>>>>  
>>>>>> +	if (killed || WEXITSTATUS(status))
>>>>>> +		kill(info->replacer, SIGKILL);
>>>>>> +
>>>>>
>>>>> There is "if (killed)" check above.
>>>>> Could you please move this hunk there?
>>>>
>>>> There is logical OR (||) in the hunk. How should I move it to unconditional check?
>>>>
>>>
>>> Ah, ok. Then let the check be as it is.
>>> Could you please add warning message for this kill with the process pid being killed and the reason why (spfs was killed of exited with error)?
>>
>> Maybe we should call spfs_release_mnt() in case of exit status != 0 too?
>>
> 
> Well, this makes sense.
> If spfs exited with non-zero result (either it was killed or exited due to some error), then there is no need in replacer. And there is no need in the reference.
> So, probably this check you proposed should be used for both spfs_release_mnt(info) and kill(info->replacer, SIGKILL).

Are you OK with this change sent in the only patch, or should we use one more patch?


More information about the Devel mailing list