[CRIU] Fake mount points in dump

Stanislav Kinsburskiy skinsbursky at odin.com
Tue Jan 19 09:09:47 PST 2016



19.01.2016 16:44, Pavel Emelyanov пишет:
> On 01/19/2016 05:54 PM, Stanislav Kinsburskiy wrote:
>>
>> 19.01.2016 13:28, Pavel Emelyanov пишет:
>>> On 01/19/2016 01:46 PM, Stanislav Kinsburskiy wrote:
>>>> 18.01.2016 17:26, Pavel Emelyanov пишет:
>>>>> On 01/18/2016 03:35 PM, Stanislav Kinsburskiy wrote:
>>>>>> Hi,
>>>>>>
>>>>>> I'm trying to suspend a container with the following mount list in it:
>>>>>>
>>>>>> [root at centos-7-x86_64 ~]# cat /proc/mounts
>>>>>> rootfs / rootfs rw 0 0
>>>>>> /dev/ploop43992p1 / ext4 rw,relatime,data=ordered,balloon_ino=12 0 0
>>>>>> none /sys sysfs rw,relatime,ve=102 0 0
>>>>>> none /sys/fs/cgroup tmpfs rw,relatime,size=1931780k,nr_inodes=482945 0 0
>>>>>> cgroup /sys/fs/cgroup/cpuset cgroup
>>>>>> rw,nosuid,nodev,noexec,relatime,cpuset 0 0
>>>>>> cgroup /sys/fs/cgroup/cpu,cpuacct cgroup
>>>>>> rw,nosuid,nodev,noexec,relatime,cpuacct,cpu 0 0
>>>>>> cgroup /sys/fs/cgroup/memory cgroup
>>>>>> rw,nosuid,nodev,noexec,relatime,memory 0 0
>>>>>> cgroup /sys/fs/cgroup/devices cgroup
>>>>>> rw,nosuid,nodev,noexec,relatime,devices 0 0
>>>>>> cgroup /sys/fs/cgroup/freezer cgroup
>>>>>> rw,nosuid,nodev,noexec,relatime,freezer 0 0
>>>>>> cgroup /sys/fs/cgroup/net_cls cgroup
>>>>>> rw,nosuid,nodev,noexec,relatime,net_cls 0 0
>>>>>> cgroup /sys/fs/cgroup/blkio cgroup rw,nosuid,nodev,noexec,relatime,blkio 0 0
>>>>>> cgroup /sys/fs/cgroup/perf_event cgroup
>>>>>> rw,nosuid,nodev,noexec,relatime,perf_event 0 0
>>>>>> cgroup /sys/fs/cgroup/hugetlb cgroup
>>>>>> rw,nosuid,nodev,noexec,relatime,hugetlb 0 0
>>>>>> cgroup /sys/fs/cgroup/systemd cgroup
>>>>>> rw,nosuid,nodev,noexec,relatime,xattr,release_agent=/usr/lib/systemd/systemd-cgroups-agent,name=systemd
>>>>>> 0 0
>>>>>> proc /proc proc rw,relatime 0 0
>>>>>> devtmpfs /dev devtmpfs rw,nosuid,size=1931780k,nr_inodes=482945 0 0
>>>>>> tmpfs /dev/shm tmpfs rw,nosuid,nodev 0 0
>>>>>> devpts /dev/pts devpts
>>>>>> rw,nosuid,noexec,relatime,gid=5,mode=620,ptmxmode=000 0 0
>>>>>> tmpfs /run tmpfs rw,nosuid,nodev,mode=755 0 0
>>>>>> mqueue /dev/mqueue mqueue rw,relatime 0 0
>>>>>> sunrpc /var/lib/nfs/rpc_pipefs rpc_pipefs rw,relatime 0 0
>>>>>>
>>>>>> and get the following error (persistent):
>>>>>>
>>>>>> (00.221054) mnt: <--
>>>>>> (00.221489)     type proc source proc mnt_id 20 s_dev 0x3 / @ ./proc
>>>>>> flags 0x30000e options
>>>>>> (00.221651)     type sysfs source sysfs mnt_id 21 s_dev 0x13 / @ ./sys
>>>>>> flags 0x30000e options
>>>>>> (00.221754)     type devtmpfs source devtmpfs mnt_id 22 s_dev 0x5 / @
>>>>>> ./dev flags 0x1100000 options size=1922508k,nr_inodes=480627,mode=755
>>>>>> (00.221842)     type securityfs source securityfs mnt_id 23 s_dev 0x14 /
>>>>>> @ ./sys/kernel/security flags 0x30000e options
>>>>>> (00.221927)     type tmpfs source tmpfs mnt_id 24 s_dev 0x15 / @
>>>>>> ./dev/shm flags 0x1100000 options
>>>>>> (00.222015)     type devpts source devpts mnt_id 25 s_dev 0xb / @
>>>>>> ./dev/pts flags 0x30000a options gid=5,mode=620,ptmxmode=000
>>>>>> (00.222163)     type tmpfs source tmpfs mnt_id 26 s_dev 0x16 / @ ./run
>>>>>> flags 0x1100000 options mode=755
>>>>>> (00.222624)     type tmpfs source tmpfs mnt_id 27 s_dev 0x17 / @
>>>>>> ./sys/fs/cgroup flags 0x1100000 options mode=755
>>>>>> (00.222716)     type cgroup source cgroup mnt_id 28 s_dev 0x18 / @
>>>>>> ./sys/fs/cgroup/systemd flags 0x30000e options
>>>>>> xattr,release_agent=/usr/lib/systemd/systemd-cgroups-agent,name=systemd
>>>>>> (00.222803)     type pstore source pstore mnt_id 29 s_dev 0x19 / @
>>>>>> ./sys/fs/pstore flags 0x30000e options
>>>>>> (00.222917)     type cgroup source cgroup mnt_id 30 s_dev 0x12 / @
>>>>>> ./sys/fs/cgroup/cpuset flags 0x30000e options cpuset
>>>>>> (00.223022)     type cgroup source cgroup mnt_id 31 s_dev 0x11 / @
>>>>>> ./sys/fs/cgroup/cpu,cpuacct flags 0x30000e options cpuacct,cpu
>>>>>> (00.223147)     type cgroup source cgroup mnt_id 32 s_dev 0xf / @
>>>>>> ./sys/fs/cgroup/memory flags 0x30000e options memory
>>>>>> (00.223303)     type cgroup source cgroup mnt_id 33 s_dev 0x1a / @
>>>>>> ./sys/fs/cgroup/devices flags 0x30000e options devices
>>>>>> (00.223425)     type cgroup source cgroup mnt_id 34 s_dev 0x1b / @
>>>>>> ./sys/fs/cgroup/freezer flags 0x30000e options freezer
>>>>>> (00.223514)     type cgroup source cgroup mnt_id 35 s_dev 0x1c / @
>>>>>> ./sys/fs/cgroup/net_cls flags 0x30000e options net_cls
>>>>>> (00.223602)     type cgroup source cgroup mnt_id 36 s_dev 0xe / @
>>>>>> ./sys/fs/cgroup/blkio flags 0x30000e options blkio
>>>>>> (00.223688)     type cgroup source cgroup mnt_id 37 s_dev 0x1d / @
>>>>>> ./sys/fs/cgroup/perf_event flags 0x30000e options perf_event
>>>>>> (00.223773)     type cgroup source cgroup mnt_id 38 s_dev 0x1e / @
>>>>>> ./sys/fs/cgroup/hugetlb flags 0x30000e options hugetlb
>>>>>> (00.223858)     type cgroup source cgroup mnt_id 39 s_dev 0x1f / @
>>>>>> ./sys/fs/cgroup/ve flags 0x30000e options ve
>>>>>> (00.224002)     type cgroup source cgroup mnt_id 40 s_dev 0x10 / @
>>>>>> ./sys/fs/cgroup/beancounter flags 0x30000e options beancounter
>>>>>> (00.224116)     type configfs source configfs mnt_id 43 s_dev 0x20 / @
>>>>>> ./sys/kernel/config flags 0x300000 options
>>>>>> (00.224274)     type ext4 source /dev/mapper/vz_skinsbursky--vz7-root
>>>>>> mnt_id 44 s_dev 0xfd00001 / @ ./ flags 0x300000 options
>>>>>> quota,usrquota,grpquota,data=ordered
>>>>>> (00.224387)     type autofs source systemd-1 mnt_id 45 s_dev 0x21 / @
>>>>>> ./proc/sys/fs/binfmt_misc flags 0x300000 options
>>>>>> fd=32,pgrp=1,timeout=300,minproto=5,maxproto=5,direct
>>>>>> (00.224485) Error (autofs.c:220): Failed to find pipe_ino option (old
>>>>>> kernel?)
>>>>>> (00.224565) Error (proc_parse.c:1385): Failed to parse FS specific data
>>>>>> on ./proc/sys/fs/binfmt_misc
>>>>>> (00.224686) Error (mount.c:1908): mnt: Can't parse 83507's mountinfo
>>>>>> (00.224770) Error (mount.c:824): mnt: Failed to find criu pid's mount ns
>>>>> This error only appear if you use --ext-mount-map auto, do you?
>>>> Yes, above is correct.
>>>> What does it mean?
>>> It means than any external bind-mounts are tried to get auto-resolved.
>>> Your error means that either a) we don't have such, but they are erroneously
>>> detected or b) we have such, they are auto-detected by parsing host's
>>> mount points and latter contain unsupported entries. In the latter case
>>> we can fix it by relaxing the requirement for host's mountpoints e.g.
>>> we can ignore options, since we don't need them.
>> This is probably the former case.
>> If I shutdown autofs service on host, dump succeeds.
>> One more question: what makes you think, that it's the former case?
> I didn't tell I thought this was the former case.
>
>> Who or what can bind-mount service mount to a container?
> This can be done by vzctl or by kernel propagation, but I don't
> know whether this is the case.
>
> Looking at the resolve_external_mounts() I see that we parse host's
> mountpoints regardless of whether we have ext mounts or not. So I
> withdraw my previous comment and make new statement: host's mount
> points contain unsupported entries, but we "fail in advance" w/o
> actually checking whether we need them at all.

This message:

(00.122918)     type autofs source systemd-1 mnt_id 86 s_dev 0x2e / @ 
./proc/sys/fs/binfmt_misc flags 0x300000 options 
fd=43,pgrp=1,timeout=300,minproto=5,maxproto=5,direct

is printed in parse_mountinfo(). Thus this information comes from kernel.
It looks strange to me, that mount_info of criu process contains 
container's mount points + autofs from host.
Isn't it?


> -- Pavel



More information about the CRIU mailing list