[CRIU] [PATCH v5 2/3] tests: fix builds on alpine and centos
Adrian Reber
adrian at lisas.de
Thu Jun 28 15:43:28 MSK 2018
From: Adrian Reber <areber at redhat.com>
Install sudo, create test user with ID 1000, install bash,
fix pidfile creation and pidfile chmod.
v2:
* use sleep to give the criu daemon some time to start up
v3:
* Andrei is of course right and sleep is not good solution.
After adding --status-fd support to criu service, this
is how we now detect that criu is ready.
v4:
* This was much more complicated than expected which is related
to the different versions of the tools on the different travis
test targets. There seems to be a bug in bash on Ubuntu
https://lists.gnu.org/archive/html/bug-bash/2017-07/msg00039.html
which prevents using 'read -n1' on Ubuntu. As a workaround
the result from CRIU's status FD is now read via python.
Another problem was discovered on alpine with the loop restore test.
CRIU says to use setsid even if the process is already using setsid.
As a workaround, still with setsid, this process is now using
shell-job true for checkpoint and restore.
Parts of v2 have been committed before. So the changes from this commit
are partially already in another commit.
Signed-off-by: Adrian Reber <areber at redhat.com>
---
scripts/build/Dockerfile.centos | 4 ++++
test/others/rpc/Makefile | 17 +++++++++++++----
test/others/rpc/read.py | 18 ++++++++++++++++++
test/others/rpc/restore-loop.py | 5 +++++
test/others/rpc/run.sh | 4 +++-
5 files changed, 43 insertions(+), 5 deletions(-)
create mode 100644 test/others/rpc/read.py
diff --git a/scripts/build/Dockerfile.centos b/scripts/build/Dockerfile.centos
index 0160b75..d8e70ac 100644
--- a/scripts/build/Dockerfile.centos
+++ b/scripts/build/Dockerfile.centos
@@ -40,4 +40,8 @@ WORKDIR /criu
ENV CCACHE_DIR=/tmp/.ccache CCACHE_NOCOMPRESS=1 $ENV1=yes
RUN mv .ccache /tmp && make mrproper && ccache -sz && \
date && make -j $(nproc) CC="$CC" && date && ccache -s
+
+# The rpc test cases are running as user #1000, let's add the user
+RUN adduser -u 1000 test
+
RUN make -C test/zdtm -j $(nproc)
diff --git a/test/others/rpc/Makefile b/test/others/rpc/Makefile
index 2b15873..50cd063 100644
--- a/test/others/rpc/Makefile
+++ b/test/others/rpc/Makefile
@@ -4,13 +4,22 @@ all: test-c rpc_pb2.py criu
CFLAGS += -g -Werror -Wall -I.
LDLIBS += -lprotobuf-c
+PYTHON ?= python
+
run: all
mkdir -p build
chmod a+rwx build
- @# need to start the criu daemon here to access the pidfile
- sudo -g '#1000' -u '#1000' ./criu service -v4 -W build -o service.log --address criu_service.socket -d --pidfile pidfile
- # Give the criu daemon some time to start up
- sleep 0.5
+ rm -f build/status
+ sudo -g '#1000' -u '#1000' mkfifo build/status
+ @# Need to start the criu daemon here to access the pidfile.
+ @# The script read.py is used to wait until 'criu service'
+ @# is ready. As 'read -n 1' in some releases has a bug and does
+ @# not read correctly a \0, using python is a workaround.
+ sudo -g '#1000' -u '#1000' -- bash -c "exec 200<>build/status; \
+ ./criu service -v4 -W build --address criu_service.socket \
+ -d --pidfile pidfile -o service.log --status-fd 200; \
+ $(PYTHON) read.py build/status"
+ rm -f build/status
chmod a+rw build/pidfile
sudo -g '#1000' -u '#1000' ./run.sh
sudo -g '#1000' -u '#1000' ./version.py
diff --git a/test/others/rpc/read.py b/test/others/rpc/read.py
new file mode 100644
index 0000000..680069b
--- /dev/null
+++ b/test/others/rpc/read.py
@@ -0,0 +1,18 @@
+# This script is used to read a single character from CRIU's status FD.
+# That way we know when the CRIU service is ready. CRIU writes a \0 to
+# the status FD.
+# In theory this could be easily done using 'read -n 1' from bash, but
+# but the bash version on Ubuntu has probably the following bug:
+# https://lists.gnu.org/archive/html/bug-bash/2017-07/msg00039.html
+
+import os
+import sys
+
+
+f=open(sys.argv[1])
+r = f.read(1)
+
+if r == '\0':
+ sys.exit(0)
+
+sys.exit(-1)
diff --git a/test/others/rpc/restore-loop.py b/test/others/rpc/restore-loop.py
index 21e93b9..1ea9bf3 100755
--- a/test/others/rpc/restore-loop.py
+++ b/test/others/rpc/restore-loop.py
@@ -19,6 +19,11 @@ s.connect(args['socket'])
req = rpc.criu_req()
req.type = rpc.RESTORE
req.opts.images_dir_fd = os.open(args['dir'], os.O_DIRECTORY)
+# As the dumped process is running with setsid this should not
+# be necessary. There seems to be a problem for this testcase
+# in combination with alpine's setsid.
+# The dump is now done with -j and the restore also.
+req.opts.shell_job = True
# Send request
s.send(req.SerializeToString())
diff --git a/test/others/rpc/run.sh b/test/others/rpc/run.sh
index aaf48f4..d1facd8 100755
--- a/test/others/rpc/run.sh
+++ b/test/others/rpc/run.sh
@@ -50,7 +50,9 @@ function test_restore_loop {
echo "pid ${P}"
title_print "Dump loop.sh"
- ${CRIU} dump -v4 -o dump-loop.log -D build/imgs_loop -t ${P}
+ # So theoretically '-j' (--shell-job) should not be necessary, but on alpine
+ # this test fails without it.
+ ${CRIU} dump -j -v4 -o dump-loop.log -D build/imgs_loop -t ${P}
title_print "Run restore-loop"
./restore-loop.py build/criu_service.socket build/imgs_loop
--
1.8.3.1
More information about the CRIU
mailing list