[CRIU] Docker checkpoint: real-world workloads

Alice Frosi alice at linux.vnet.ibm.com
Fri Sep 22 17:28:03 MSK 2017


If the attachment does not work, please let me know and I will upload somewhere

On 22.09.2017 15:53, Ross Boucher wrote:

> Can you post the full dump and restore logs somewhere and link to them?
>
> On Fri, Sep 22, 2017 at 9:43 AM Alice Frosi <alice at linux.vnet.ibm.com 
> <mailto:alice at linux.vnet.ibm.com>> wrote:
>
>     Docker version:
>
>     Client:
>       Version:      17.10.0-dev
>       API version:  1.32
>       Go version:   go1.9
>       Git commit:
>       Built:        Thu Sep 21 10:13:54 2017
>       OS/Arch:      linux/s390x
>
>     Server:
>       Version:      17.06.0-dev
>       API version:  1.32 (minimum version 1.12)
>       Go version:   go1.8.3
>       Git commit:   e9cb25e967
>       Built:        Thu Sep 21 09:31:03 2017
>       OS/Arch:      linux/s390x
>       Experimental: true
>
>     Docker checkpoint commands:
>
>     docker checkpoint create CONTAINER_NAME checkpoint
>
>     docker start CONTAINER_NAME --checkpoint=checkpoint
>
>     Thanks for the help
>
>     Alice
>
>>     Could you share the version of Docker you were running, along
>>     with the commands you used to invoke the checkpoint and restore?
>>
>>     On Fri, Sep 22, 2017 at 8:05 AM Alice Frosi
>>     <alice at linux.vnet.ibm.com <mailto:alice at linux.vnet.ibm.com>> wrote:
>>
>>         Hi,
>>
>>         We have started to test docker checkpoint with a real-world
>>         workloads.
>>
>>         Until now we tried with acmeair-nodejs [1] with the
>>         microservice usecase.
>>
>>         This application creates 10 containers and they interact
>>         between each other with volumes and network
>>
>>         connection. The checkpoint war not successful. Here, the
>>         error message in restore:
>>
>>         (00.195135)      1: No iptables-9.img image
>>
>>         (00.195136)      1: Error (criu/include/image.h:144): BUG at
>>         criu/include/image.h:144
>>
>>         (00.195171) Error (criu/cr-restore.c:1284): 15638 killed by
>>         signal 127: Unknown signal 127
>>
>>         (00.315156) Error (criu/mount.c:2980): mnt: Can't remove the
>>         directory /tmp/.criu.mntns.C12otH: No such file or directory
>>
>>         (00.315160) Error (criu/cr-restore.c:2152): Restoring FAILED.
>>
>>         We tested with criu version:
>>
>>         Version: 3.4
>>
>>         GitID: v3.4-224-g3706af7b
>>
>>         1) Any ideas for what could have caused the error?
>>
>>         Additionally, we have two more general questions.
>>
>>         2) How should we perform checkpoint for applications with
>>         multiple container interaction?
>>
>>         3) For which use cases have you already tried the C/R?
>>
>>         We saw this overview in your wiki [2], but it seems to be not
>>         recently updated.
>>
>>         Regards
>>
>>         Alice Frosi
>>
>>         [1]
>>         https://github.com/blueperf/acmeair-nodejs/blob/master/README_Docker.md
>>         [2] https://criu.org/What_software_is_supported
>>
>>
>>
>>         _______________________________________________
>>         CRIU mailing list
>>         CRIU at openvz.org <mailto:CRIU at openvz.org>
>>         https://lists.openvz.org/mailman/listinfo/criu
>>
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openvz.org/pipermail/criu/attachments/20170922/54b76c9f/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: dump.log
Type: text/x-log
Size: 163762 bytes
Desc: not available
URL: <http://lists.openvz.org/pipermail/criu/attachments/20170922/54b76c9f/attachment-0002.bin>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: restore.log
Type: text/x-log
Size: 22095 bytes
Desc: not available
URL: <http://lists.openvz.org/pipermail/criu/attachments/20170922/54b76c9f/attachment-0003.bin>


More information about the CRIU mailing list