[CRIU] p.haul: mtouch test "Pre-dump failed"

Oscar Fernández oscarfernandz at gmail.com
Thu Mar 27 09:47:32 PDT 2014


Hello and congratulations for your great work.

I'm interested in p.haul and I wanted to see how it works. I found the
mtouch test as a process live migration example (
https://github.com/xemul/p.haul/blob/master/test/mtouch/HOWTO ), I follow
all the steps but I get an error when executing the command

# ./p.haul pid $test_pid $destination_node_ip

at the source node.

I'm trying to do the migration between two Ubuntu server 13.10 VMs, should
I try with other distro?

The output at the SRC node is:

Connecting to target host
Initialized shared FS hauler
Connecting to CRIU service
Preliminary FS migration
Starting iterations
* Iteration 0
making directory /var/local/p.haul-fs/14.03.27-15.35-bvrTFM/img/1
Issuing pre-dump command to service
Traceback (most recent call last):
  File "p.haul/p.haul", line 39, in <module>
    worker.start_migration()
  File "/home/test/p.haul/p_haul_iters.py", line 100, in start_migration
    raise Exception("Pre-dump failed")
Exception: Pre-dump failed

The output at the DST node is:

Connected
Selecting htype to ('pid', '1403')
Making directory /var/local/p.haul-fs/14.03.27-15.35-lI4_j3/img/1
Starting page server for iter 1
Sending criu rpc req
Page server started at 1613
Disconnected
Stopping page server 1613

The criu service log file at SRC node is:

(00.017306) The service socket is bound to /var/run/criu_service.socket
(00.018764) Waiting for connection...
(3032.36109 Connected.
(3032.36131 Waiting for connection...
(3032.37836 Restrict C/R with 0:0 uid
(3032.39072 Worker(pid 1505) exited with 1

The criu service log file at DST node is:

(00.012570) The service socket is bound to /var/run/criu_service.socket
(00.014230) Waiting for connection...
(2995.80278 Connected.
(2995.80293 Waiting for connection...
(2995.80455 Restrict C/R with 0:0 uid
(2995.80500 Worker(pid 1556) exited with 0

I checked the source code and found that in p_haul_iters.py line 99, the
value of resp.success is False, so an Exception is raised. I tried to find
the reason of that failure without success. I'm running all comands in test
as root and the NFS has all permissions to everybody (777).

Any idea of what can be the reason? What distro did you use to make the
test work?

Thank you

-- 
Óscar Fernández
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openvz.org/pipermail/criu/attachments/20140327/1060dd03/attachment.html>


More information about the CRIU mailing list