nixos/tests/installer: prevent race between parted and udev #47155

xeji · 2018-09-21T23:57:09Z

Motivation for this change

Our installer tests still suffer from non-deterministic failure on Hydra (18.09 and master), which often delays channels.

One typical symptom of non-deterministic failure seems to be this:

Successful example:

machine: must succeed: parted --script /dev/vda -- mkpart primary linux-swap 1M 1024M
machine# [    9.146921]  vda:
machine: exit status 0
machine: must succeed: parted --script /dev/vda -- mkpart primary ext2 1024M -1s
machine: exit status 0

Failed example:

machine: must succeed: parted --script /dev/vda -- mkpart primary linux-swap 1M 1024M
machine# [   10.560431]  vda: vda1
machine# Error: Partition(s) 1 on /dev/vda have been written, but we have been unable to inform the kernel of the change, probably because it/they are in use.  As a result, the old partition(s) will remain in use.  You should reboot now before making further changes.
machine# [   10.572443]  vda: vda1
machine: exit status 1
machine: output: 
error: command `parted --script /dev/vda -- mkpart primary linux-swap 1M 1024M' did not succeed (exit code 1)
command `parted --script /dev/vda -- mkpart primary linux-swap 1M 1024M' did not succeed (exit code 1)

The failed example looks like udev recognizes the new partition before parted tries to tell the kernel about it. I found some reports that confirm a race between udev and parted, like this one. This can happen between separate parted calls, so it is recommended to combine multiple parted commands into a single parted call to create the correct partition layout in one pass without udev interfering.

And that's what this PR does. The change itself is a series of trivial rewrites.
It should eliminate this particular cause of non-deterministic failure (but probably there are others, so don't expect zero failures from now on 😄).

Relevant for ZHF #45960, please backport to 18.09

~~WIP because I still like to run all the tests locally once, which will take some time.~~ done

Things done

run all installer tests locally

by combining all parted commands into a single parted call. This eliminates one cause of non-deterministic failure.

dezgeg · 2018-09-22T00:27:28Z

I've noticed the races myself at some point as well:
#40230 (comment)

I have a fear that even after this a similar race condition exists. Namely, udev can still be holding the partition devices open during the time mkfs programs or mdadm are run and want to open the partition for exclusive access, leading to Device or resource busy errors. See e.g. this thread: https://groups.google.com/forum/#!topic/scylladb-dev/u87yHgo3ylU

xeji · 2018-09-22T10:11:07Z

It will be difficult to eliminate all causes of such races. Just looking at how they tried to fix this in parted by "sleep-and-retry" doesn't look promising.

Two things we can do to further reduce the probability of failure:

Wrap each parted (and maybe also mkfs etc.) call with flock /dev/vda parted ... to acquire an exclusive lock on the device. That's what util-linux recommends for sfdisk. Doesn't work with device-mapper devices like md though.
For the swraid test which is the only one using mdadm, disable udev queue execution with udevadm control --stop-exec-queue like mentioned in the google groups thread above.

to further reduce risk of race with udev, like util-linux recommends for sfdisk: https://github.com/karelzak/util-linux/blob/v2.32/disk-utils/sfdisk.8#L71

In the swraid test, temporarily stop udev queue execution while creating mdraid devices to prevent a race with udev, see https://groups.google.com/forum/#!topic/scylladb-dev/u87yHgo3ylU

xeji · 2018-09-22T10:42:23Z

We could also protect all calls to mkfs/mkswap in a similar way but I don't recall seeing these fail in our tests yet, so let's fix those failures when they happen.

xeji · 2018-09-22T11:04:03Z

Example data point: In the latest 18.09 Hydra eval there are 5 installer test failures blocking the channel, and all of them show the kind of error addressed by this PR. So these changes can really improve things.

xeji · 2018-09-24T17:02:20Z

backported: 35271fd..570ec19

nixos/tests/installer: prevent race between parted and udev

a518376

by combining all parted commands into a single parted call. This eliminates one cause of non-deterministic failure.

xeji added the 2.status: work-in-progress label Sep 21, 2018

GrahamcOfBorg added 6.topic: nixos 10.rebuild-darwin: 0 10.rebuild-linux: 0 labels Sep 22, 2018

xeji added 2 commits September 22, 2018 12:22

nixos/tests/installer: use flock for all parted calls

c46677f

to further reduce risk of race with udev, like util-linux recommends for sfdisk: https://github.com/karelzak/util-linux/blob/v2.32/disk-utils/sfdisk.8#L71

nixos/tests/installer: stop udev queue before calling mdadm

7dd6a51

In the swraid test, temporarily stop udev queue execution while creating mdraid devices to prevent a race with udev, see https://groups.google.com/forum/#!topic/scylladb-dev/u87yHgo3ylU

xeji removed the 2.status: work-in-progress label Sep 22, 2018

xeji merged commit 9163c05 into NixOS:master Sep 24, 2018

xeji deleted the p/installer-tests branch September 24, 2018 17:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nixos/tests/installer: prevent race between parted and udev #47155

nixos/tests/installer: prevent race between parted and udev #47155

xeji commented Sep 21, 2018 •

edited

dezgeg commented Sep 22, 2018

xeji commented Sep 22, 2018

xeji commented Sep 22, 2018

xeji commented Sep 22, 2018

xeji commented Sep 24, 2018

nixos/tests/installer: prevent race between parted and udev #47155

nixos/tests/installer: prevent race between parted and udev #47155

Conversation

xeji commented Sep 21, 2018 • edited

Motivation for this change

Things done

dezgeg commented Sep 22, 2018

xeji commented Sep 22, 2018

xeji commented Sep 22, 2018

xeji commented Sep 22, 2018

xeji commented Sep 24, 2018

xeji commented Sep 21, 2018 •

edited