[CRIU] p.haul: mtouch test "Pre-dump failed"

Andrew Vagin avagin at parallels.com
Fri Mar 28 01:41:31 PDT 2014


On Thu, Mar 27, 2014 at 05:47:32PM +0100, Oscar Fernández wrote:
> Hello and congratulations for your great work.
> 
> I'm interested in p.haul and I wanted to see how it works. I found the mtouch
> test as a process live migration example ( https://github.com/xemul/p.haul/blob
> /master/test/mtouch/HOWTO ), I follow all the steps but I get an error when
> executing the command
> 
> # ./p.haul pid $test_pid $destination_node_ip

Which version of CRIU do you use?

Could you provide more info?

$ lsof -p $test_pid > mem-touch.lsof
$ strace -fo criu.predump.log ./criu pre-dump -v4 -o pd.log -D tmp -t 945

and send  criu.predump.log, mem-touch.lsof, tmp/pd.log to us.

Thanks.

> 
> at the source node.
> 
> I'm trying to do the migration between two Ubuntu server 13.10 VMs, should I
> try with other distro?
> 
> The output at the SRC node is:
> 
> Connecting to target host
> Initialized shared FS hauler
> Connecting to CRIU service
> Preliminary FS migration
> Starting iterations
> * Iteration 0
> making directory /var/local/p.haul-fs/14.03.27-15.35-bvrTFM/img/1
> Issuing pre-dump command to service
> Traceback (most recent call last):
>   File "p.haul/p.haul", line 39, in <module>
>     worker.start_migration()
>   File "/home/test/p.haul/p_haul_iters.py", line 100, in start_migration
>     raise Exception("Pre-dump failed")
> Exception: Pre-dump failed
> 
> The output at the DST node is:
> 
> Connected
> Selecting htype to ('pid', '1403')
> Making directory /var/local/p.haul-fs/14.03.27-15.35-lI4_j3/img/1
> Starting page server for iter 1
> Sending criu rpc req
> Page server started at 1613
> Disconnected
> Stopping page server 1613
> 
> The criu service log file at SRC node is:
> 
> (00.017306) The service socket is bound to /var/run/criu_service.socket
> (00.018764) Waiting for connection...
> (3032.36109 Connected.
> (3032.36131 Waiting for connection...
> (3032.37836 Restrict C/R with 0:0 uid
> (3032.39072 Worker(pid 1505) exited with 1
> 
> The criu service log file at DST node is:
> 
> (00.012570) The service socket is bound to /var/run/criu_service.socket
> (00.014230) Waiting for connection...
> (2995.80278 Connected.
> (2995.80293 Waiting for connection...
> (2995.80455 Restrict C/R with 0:0 uid
> (2995.80500 Worker(pid 1556) exited with 0
> 
> I checked the source code and found that in p_haul_iters.py line 99, the value
> of resp.success is False, so an Exception is raised. I tried to find the reason
> of that failure without success. I'm running all comands in test as root and
> the NFS has all permissions to everybody (777).
> 
> Any idea of what can be the reason? What distro did you use to make the test
> work?
> 
> Thank you
> 
> --
> Óscar Fernández
> 

> _______________________________________________
> CRIU mailing list
> CRIU at openvz.org
> https://lists.openvz.org/mailman/listinfo/criu



More information about the CRIU mailing list