[Devel] Re: [RFC v6][PATCH 0/9] Kernel based checkpoint/restart
Ingo Molnar
mingo at elte.hu
Thu Oct 9 06:44:15 PDT 2008
* Dave Hansen <dave at linux.vnet.ibm.com> wrote:
> On Thu, 2008-10-09 at 15:17 +0200, Ingo Molnar wrote:
> > * Dave Hansen <dave at linux.vnet.ibm.com> wrote
> > > On Thu, 2008-10-09 at 14:46 +0200, Ingo Molnar wrote:
> > > > i'm wondering about the following productization aspect: it would be
> > > > very useful to applications and users if they knew whether it is safe to
> > > > checkpoint a given app. I.e. whether that app has any state that cannot
> > > > be stored/restored yet.
> > >
> > > Absolutely!
> > >
> > > My first inclination was to do this at checkpoint time: detect and
> > > tell users why an app or container can't actually be checkpointed.
> > > But, if I get you right, you're talking about something that happens
> > > more during the runtime of the app than during the checkpoint. This
> > > sounds like a wonderful approach to me, and much better than what I
> > > was thinking of.
> > >
> > > What kind of mechanism do you have in mind?
> > >
> > > int sys_remap_file_pages(...)
> > > {
> > > ...
> > > oh_crap_we_dont_support_this_yet(current);
> > > }
> > >
> > > Then the oh_crap..() function sets a task flag or something?
> >
> > yeah, something like that. A key aspect of it is that is has to be very
> > low-key on the source code level - we dont want to sprinkle the kernel
> > with anything ugly. Perhaps something pretty explicit:
> >
> > current->flags |= PF_NOCR;
>
> Am I miscounting, or are we out of these suckers on 32-bit platforms?
We've still got a few holes: you can pick 0x00000020, 0x00000080,
0x00004000, 0x08000000.
> > as we do the same thing today for certain facilities:
> >
> > current->flags |= PF_NOFREEZE;
> >
> > you probably want to hide it behind:
> >
> > set_current_nocr();
>
> Yeah, that all looks reasonable. Letting this be a dynamic thing
> where you can move back and forth between the two states would make a
> lot of sense too. But, for now, I guess it can be a one-way trip.
there might be races as well, especially with proxy state - and
current->flags updates are not serialized.
So maybe it should be a completely separate flag after all? Stick it
into the end of task_struct perhaps.
Ingo
_______________________________________________
Containers mailing list
Containers at lists.linux-foundation.org
https://lists.linux-foundation.org/mailman/listinfo/containers
More information about the Devel
mailing list