[Devel] Re: [PATCH 1/1] c/r: define s390-specific checkpoint-restart code (v3)

Serge E. Hallyn serue at us.ibm.com
Wed Feb 4 10:27:13 PST 2009


Quoting Christian Borntraeger (borntraeger at de.ibm.com):
> Am Tuesday 03 February 2009 17:12:23 schrieb Serge E. Hallyn:
> [...]
> > +/* Nothing to do for mm context state */
> > +int cr_write_mm_context(struct cr_ctx *ctx, struct mm_struct *mm, int
> > parent) +{
> > +	struct cr_hdr h;
> > +	struct cr_hdr_mm_context *hh = cr_hbuf_get(ctx, sizeof(*hh));
> > +	int ret;
> > +
> > +	h.type = CR_HDR_MM_CONTEXT;
> > +	h.len = sizeof(*hh);
> > +	h.parent = parent;
> > +
> > +#if 0
> > +	/* Oren's v13 is on an older kernel which has no vdso_base */
> > +	/* on newer kernel, we'll have to enable this */
> > +	hh->vdso_base = mm->context.vdso_base;
> > +	printk(KERN_NOTICE "checkpointing vdso_base %lx\n", hh->vdso_base);
> > +#else
> > +	hh->vdso_base = 0;
> > +#endif
> > +
> > +	ret = cr_write_obj(ctx, &h, hh);
> > +	cr_hbuf_put(ctx, sizeof(*hh));
> > +
> > +	return ret;
> > +}

Hi,  thanks for taking a look.  (I can definately use some help)

> Hmm, maybe you should also save/restore other elements of mm_context_t.

Oren pointed out (on Jan 15, see
https://lists.linux-foundation.org/pipermail/containers/2009-January/015304.html)
that the arch-independent restart code will re-create the checkpointed
memory mappings using do_mmap_pgoff().  So the other mm_context_t
contents should be automatically handled, right?

> At least noexec, has_pgste and alloc_pgste have an impact on the page table 
> layout and special features like no execute or the ability to run kvm guests.

I went ahead and added those three - but they're always all 0 on my
testcases so far, and restoring them doesn't fix my restart segfault :(

How and why are they supposed to be set though?  Looking through the
s390 arch code, I don't really see anything justifying hand-tweaking
them.

> > +int cr_read_mm_context(struct cr_ctx *ctx, struct mm_struct *mm, int
> > rparent) +{
> > +	struct cr_hdr_mm_context *hh = cr_hbuf_get(ctx, sizeof(*hh));
> > +	int parent, ret = -EINVAL;
> > +
> > +	s390_enable_sie();
> 
> Hmm, why do you call s390_enable_sie()? It will fail on multi-threaded apps 
> and will create enhanced page tables for running kvm guest otherwise. It is 
> not needed for non-kvm processes. See the has_pgste/alloc_pgste topic above.

Short answer: bc martin told me to :)  But I take it I misunderstood,
and should only do that if running in kvm?
(That email at
https://lists.linux-foundation.org/pipermail/containers/2009-January/015305.html)

But thinking through this:  if some task 798 is running inside kvm,
it will have already done s390_enable_sie(), right?  So if it now
does sys_restart(), we shouldn't run that again, right?

So I've removed it again.  (Which still doesn't solve my segfault on
restart.  At the moment I'm busily learning about the VM debugger :)

thanks,
-serge
_______________________________________________
Containers mailing list
Containers at lists.linux-foundation.org
https://lists.linux-foundation.org/mailman/listinfo/containers




More information about the Devel mailing list