nixos-containers: fix atomic restart #80169

danbst · 2020-02-15T08:51:25Z

nixos containers: remove --keep-unit (revert 810680b)

Fixes #43652
Fixes #16753
Alternative fix for #39717
The fix in #76719 was not enough

`--keep-unit` ties systemd-nspawn container to systemd unit. When
unit is restarted (ie atomic operation "restart", not two separate
"stop" and "start") systemd assumes that unit didn't disappear.
Hence machine didn't disappear. Though it may as well be systemd bug
that machine is left in `closing` state forever.

If unit is stopped (or stopped due to failure), associated machine is also stopped,
so service can start fresh.

When --keep-unit is removed, resource slice isn't changed though - it is machine.slice by default for
nspawn containers.

cc @erikarvstedt @peterhoeg @flokli @edolstra

Fixes NixOS#43652 Fixes NixOS#16753 Alternative fix for NixOS#39717 The fix in NixOS#76719 was not enough `--keep-unit` ties systemd-nspawn container to systemd unit. When unit is restarted (ie atomic operation "restart", not two separate "stop" and "start") systemd assumes that unit didn't disappear. Hence machine won't disappear. If unit is stopped (or stopped due to failure), associated machine is also stopped, so service can start fresh. Resource slice isn't changed though - it is machine.slice by default for nspawn containers.

danbst · 2020-02-15T08:54:56Z

if this will be merged, we can go on and cleanup the artifacts from old solutions - WINCH signal, mixed killmode, PID mgmt in imperative containers, restart as stop/start...

flokli · 2020-02-15T20:49:56Z

cc @arianvp here.

Previously, we were storing the leader pid in a runtime file and signalled SIGRTMIN+4 manually. In systemd 219, the `machinectl poweroff` command was introduced, which does that for us.

d-xo · 2020-02-19T10:54:25Z

I tested this locally and can confirm that it fixes the regression of #6212 introduced in 90a3908.

arianvp · 2020-02-19T11:27:08Z

This seems fine. though im a bit confused by why it was originally introduced.

I also don't understand why we need machinectl poweroff to turn off a container; but maybe this is what you're referring to with:

WINCH signal, mixed killmode, PID mgmt in imperative containers, restart as stop/start...

From my understanding, systemd-nspawn should be able to just run in the unit file, and then when the unit is stopped, systemd-nspawn should stop automatically... E.g. make this unit look more like the upstream systemd-nspawn@.service unit

#  SPDX-License-Identifier: LGPL-2.1+
#
#  This file is part of systemd.
#
#  systemd is free software; you can redistribute it and/or modify it
#  under the terms of the GNU Lesser General Public License as published by
#  the Free Software Foundation; either version 2.1 of the License, or
#  (at your option) any later version.

[Unit]
Description=Container %i
Documentation=man:systemd-nspawn(1)
Wants=modprobe@tun.service modprobe@loop.service modprobe@dm-mod.service
PartOf=machines.target
Before=machines.target
After=network.target systemd-resolved.service modprobe@tun.service modprobe@loop.service modprobe@dm-mod.service
RequiresMountsFor=/var/lib/machines

[Service]
# Make sure the DeviceAllow= lines below can properly resolve the 'block-loop' expression (and others)
ExecStart=systemd-nspawn --quiet --keep-unit --boot --link-journal=try-guest --network-veth -U --settings=override --machine=%i
KillMode=mixed
Type=notify
RestartForceExitStatus=133
SuccessExitStatus=133
Slice=machine.slice
Delegate=yes
TasksMax=16384
@SERVICE_WATCHDOG@

# Enforce a strict device policy, similar to the one nspawn configures when it
# allocates its own scope unit. Make sure to keep these policies in sync if you
# change them!
DevicePolicy=closed
DeviceAllow=/dev/net/tun rwm
DeviceAllow=char-pts rw

# nspawn itself needs access to /dev/loop-control and /dev/loop, to implement
# the --image= option. Add these here, too.
DeviceAllow=/dev/loop-control rw
DeviceAllow=block-loop rw
DeviceAllow=block-blkext rw

# nspawn can set up LUKS encrypted loopback files, in which case it needs
# access to /dev/mapper/control and the block devices /dev/mapper/*.
DeviceAllow=/dev/mapper/control rw
DeviceAllow=block-device-mapper rw

[Install]
WantedBy=machines.target

arianvp · 2020-02-19T11:32:02Z

So I think, by merging this change, we can completely get rid of the machinectl stop command, as doing systemctl stop container@blah should be enough to shut down the container.

This is the same issue we're fixing as documented here: systemd/systemd#2770

flokli · 2020-02-19T12:15:05Z

@danbst this looks good - can you do the above mentioned cleanups in this PR, too?

danbst · 2020-02-19T12:43:57Z

@arianvp omg, there was an upstream issue for this! Interesting, they mentioned RemainAfterExit, I think we can't get rid of it in NixOS, as it will break NixOS restart logic.

@flokli ok, I'll try cleanup that

arianvp · 2020-02-19T13:02:30Z

We'll have to keep KillMode=mixed by the way. it's also used upstream. I asked for the reason and this was the answer:

michich, when the kernel wants to stop properly the system (for instance, because you pressed the power button) it sends
SIGTERM to init, wait for some time, then powers down
to emulate that for a container, you send SIGTERM to the main process (the init of the container)
then you wait a bit
then you SIGKILL everything, which is the equivalent for a container of shutting down

flokli · 2020-02-22T00:18:32Z

@danbst poke ;-)

danbst · 2020-02-22T15:18:33Z

@flokli I'm debugging tests, they fail on nixos-container destroy command

* machinectl terminate during container start was a hack, probably wrong one * use "systemctl restart" consistently for declarative and imperative containers * fix "nixos-container terminate" command. We have to notify systemd we want unit to be stopped, because otherwise it will be started back due to restart on failure policy. * add a bit docs to stop/terminate/destroy, to clarify which one does what * add missing "nixos-container restart" docs

danbst · 2020-02-22T18:32:25Z

Cleanups (from commit message):

* machinectl terminate during container start was a hack, probably wrong one
* use "systemctl restart" consistently for declarative and imperative containers
* fix "nixos-container terminate" command. We have to notify systemd we want
  unit to be stopped, because otherwise it will be started back due to
  restart on failure policy.
* add a bit docs to stop/terminate/destroy, to clarify which one does what
* add missing "nixos-container restart" docs

@arianvp I was not able to remove machinectl poweroff, for some reason it doesn't poweroff machine, but just kills it.

Also, with this PR machinectl terminate behavior had changed. I think I'll add release notes, to clarify this bit. nixos-container terminate/destroy works though.

danbst · 2020-02-24T09:10:05Z

@GrahamcOfBorg test containers-imperative

flokli · 2020-02-26T21:06:49Z

pkgs/tools/virtualization/nixos-container/nixos-container.pl

@@ -25,7 +25,7 @@ sub showHelp {
       nixos-container create <container-name>
         [--nixos-path <path>]
         [--system-path <path>]
-         [--config <string>]
+         [--config <string>]  -- config without outer braces, for example 'services.nginx.enable = true;'


This is already documented in nixos/doc/manual/administration/imperative-containers.xml.

it's fine to duplicate it here. The hands-on --help is too scarce.

I agree with @danbst here. This is a common misunderstanding (which usually seems like a bug to an end-user), so mentioning this here as well is fine IMHO.

flokli · 2020-02-26T21:07:46Z

pkgs/tools/virtualization/nixos-container/nixos-container.pl

+       nixos-container restart <container-name>
+       nixos-container stop <container-name>  -- shutdown container cleanly, wait until stopped
+       nixos-container terminate <container-name> -- halt container, like hard poweroff
+       nixos-container destroy <container-name> -- terminate (halt) container and remove state data


This should move into nixos/doc/manual/administration/imperative-containers.xml (if not already there). Why were things reordered?

well, technically yes, but it's fine duplicating it here. No need to consult manual when stopping container.

Things were reordered to group stop/terminate/destroy commands. They do same, but with different nuances.

flokli · 2020-02-26T21:09:44Z

pkgs/tools/virtualization/nixos-container/nixos-container.pl

+    system("systemctl", "stop", "--no-block", "container\@$containerName");
    system("machinectl", "terminate", $containerName) == 0


Urgh, this feels like a hack. Doesn't machinectl terminate by itself make the systemd unit become stopped?

this is consequence of removing --keep-unit, I haven't found a better way to stop unit with machine.

machinectl stop does stop unit fine. It is because machine exits with normal result. But when it runs terminate, it exits with failure result, and systemd restarts unit due to Restart=on-failure policy.

arianvp · 2020-02-26T22:36:03Z

I found a more proper fix (that cleanly shuts down the container with just systemctl stop). We have a slight difference between the way upstream calls systemd-nspawn. I have a patch ready and I'll test if it works tomorrow. Wil also get rid of the WINCH hacks so that's good too. This serves as a reminder to send that tomorrow when I have internet again :)

…

On Wed, Feb 26, 2020, 23:06 Danylo Hlynskyi ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In pkgs/tools/virtualization/nixos-container/nixos-container.pl <#80169 (comment)>: > nixos-container status <container-name> nixos-container update <container-name> + nixos-container restart <container-name> + nixos-container stop <container-name> -- shutdown container cleanly, wait until stopped + nixos-container terminate <container-name> -- halt container, like hard poweroff + nixos-container destroy <container-name> -- terminate (halt) container and remove state data well, technically yes, but it's fine duplicating it here. No need to consult manual when stopping container. Things were reordered to group stop/terminate/destroy commands. They do same, but with different nuances. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#80169?email_source=notifications&email_token=AAEZNI23LM57XO22RH7DDILRE3OR5A5CNFSM4KVWU6W2YY3PNVWWK3TUL52HS4DFWFIHK3DMKJSXC5LFON2FEZLWNFSXPKTDN5WW2ZLOORPWSZGOCXCXNEQ#discussion_r384782585>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAEZNIY7R2OBVCY2SP6GNNDRE3OR5ANCNFSM4KVWU6WQ> .

danbst · 2020-03-01T18:24:00Z

@arianvp kind ping :)

arianvp · 2020-03-01T19:43:40Z

My 'fix' didn't really help unfortunately. I hit what I think is a bug in systemd systemd/systemd#14961 (The fix was setting KillSignal=SIGRTMIN+3 with KeepUnit, such that systemd-nspawn gracefully kills the underlying container on SIGTERM. This is the default for systemd-nspawn in --boot mode, but because we can't use boot mode in NixOS because the systemd binary in the container isn't in the expected place, you have to pass an additional --kill-signal). Maybe it's something obvious that I'm doing wrong. I'll give it one more look Tuesday during Berlin NixOS meetup

…

On Sun, Mar 1, 2020, 19:24 Danylo Hlynskyi ***@***.***> wrote: @arianvp <https://github.com/arianvp> kind ping :) — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#80169?email_source=notifications&email_token=AAEZNI43WFCPY22HCPMCFL3RFKR4DA5CNFSM4KVWU6W2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOENNGVNY#issuecomment-593128119>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAEZNIYHWP5UM3WDTRGKOUDRFKR4DANCNFSM4KVWU6WQ> .

Ma27 · 2020-04-04T20:57:55Z

@danbst @arianvp may I ask if there are any updates here? :)

I started working on a draft for improved nixos-containers (using `networkd` and `.nspawn` units) after the networkd hackathon[1] which isn't published yet. Quite recently I realized that when changing a `.nspawn`-unit, the `switch-to-configuration.pl` doesn't activate those changes. This patch takes care of it with the following changes: * It's possible to declare whether to restart or reload such a unit. The restart option is the default. In that case the `systemd-nspawn@<machine-name>.service`[2]-unit will be restarted or reloaded. * By default, all `.nspawn`-units are part of the `machines.target`. * A VM-test covers all those cases including a custom reload-script to activate a new configuration in the machine. * I had to remove the `--keep-unit` flag on startup to fix the restart of the unit. This is a known issue[3]. It's also possible to use a reload to activate a new configuration inside a nspawn-machine with a config like this: ``` nix { pkgs, ... }: { systemd.nspawn.test-container.reloadOnChange = true; systemd.nspawn.test-container.restartOnChange = false; systemd.services."systemd-nspawn@test-container".serviceConfig.ExecReload = "${pkgs.writeScriptBin "activate" '' #! ${pkgs.runtimeShell} -xe systemd-run --machine test-container --pty --quiet -- /bin/sh --login -c \ '${containerCfg}/bin/switch-to-configuration test' ''}/bin/activate"; } ``` [1] https://discourse.nixos.org/t/networkd-sprint-2019-11-23-24-in-munich/4578 [2] https://github.com/systemd/systemd/blob/v243/units/systemd-nspawn@.service.in [3] NixOS#80169

arianvp · 2020-04-07T14:07:40Z

I'm okay with this approach for now if we are adamant on getting it out for 20.03

If not, I would like to wait until we complete the 245 release, and see if the issues I encountered in the linked systemd issue goes away. If we keep using --keep-unit with the inclusion of --kill-signal=SIGTRM3 then systemctl stop container@foo should be sufficient to kill the container, without the machinectl terminate call

flokli · 2020-04-07T14:27:59Z

I'm not sure which approach is referred to by "this", whether it's this PR, or the one referencing it (#84608). I don't think any of these two should be backported to 20.03. It's too late for that, and I don't want to cause breakages that late in the release cycle.

tadfisher · 2020-07-13T21:11:39Z

Ping @danbst; this is still an issue in master, and it would be nice to have this fixed there at least.

maralorn · 2021-02-07T10:48:37Z

Hello @arianvp and @danbst, are there any news on this one?

lIt would be so great to have some kind of fix. Today was another day where I woke up to all my containers being down after the nightly system update.

stale · 2021-08-06T17:49:20Z

I marked this as stale due to inactivity. → More info

danbst added 2 commits February 15, 2020 10:44

tests/imperative containers: add restart test

f940087

ofborg bot added 6.topic: nixos 8.has: module (update) 10.rebuild-darwin: 0 10.rebuild-linux: 1-10 labels Feb 15, 2020

veprbl added the 6.topic: nixos-container Imperative and declarative systemd-nspawn containers label Feb 15, 2020

flokli requested a review from arianvp February 15, 2020 20:50

flokli referenced this pull request Feb 18, 2020

nixos/containers: use machinectl poweroff

90a3908

Previously, we were storing the leader pid in a runtime file and signalled SIGRTMIN+4 manually. In systemd 219, the `machinectl poweroff` command was introduced, which does that for us.

arianvp approved these changes Feb 19, 2020

View reviewed changes

flokli requested changes Feb 26, 2020

View reviewed changes

Ma27 mentioned this pull request Apr 7, 2020

nixos/systemd-nspawn: reload or restart machines on config change #84608

Closed

10 tasks

ryantm added 2.status: merge conflict and removed 2.status: merge conflict labels Oct 3, 2020

stale bot added the 2.status: stale https://github.com/NixOS/nixpkgs/blob/master/.github/STALE-BOT.md label Aug 6, 2021

Artturin added the 12.approvals: 1 label Apr 13, 2022

stale bot removed the 2.status: stale https://github.com/NixOS/nixpkgs/blob/master/.github/STALE-BOT.md label Apr 13, 2022

stale bot added the 2.status: stale https://github.com/NixOS/nixpkgs/blob/master/.github/STALE-BOT.md label Nov 2, 2022

wegank removed the 12.approvals: 1 label Sep 7, 2023

stale bot removed the 2.status: stale https://github.com/NixOS/nixpkgs/blob/master/.github/STALE-BOT.md label Sep 7, 2023

wegank added the 2.status: stale https://github.com/NixOS/nixpkgs/blob/master/.github/STALE-BOT.md label Mar 19, 2024

wegank marked this pull request as draft March 20, 2024 14:55

stale bot removed the 2.status: stale https://github.com/NixOS/nixpkgs/blob/master/.github/STALE-BOT.md label Mar 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nixos-containers: fix atomic restart #80169

nixos-containers: fix atomic restart #80169

danbst commented Feb 15, 2020

danbst commented Feb 15, 2020

flokli commented Feb 15, 2020

d-xo commented Feb 19, 2020

arianvp commented Feb 19, 2020

arianvp commented Feb 19, 2020 •

edited

flokli commented Feb 19, 2020

danbst commented Feb 19, 2020

arianvp commented Feb 19, 2020

flokli commented Feb 22, 2020

danbst commented Feb 22, 2020

danbst commented Feb 22, 2020

danbst commented Feb 24, 2020

flokli Feb 26, 2020

danbst Feb 26, 2020

Ma27 Apr 4, 2020

flokli Feb 26, 2020

danbst Feb 26, 2020

flokli Feb 26, 2020

danbst Feb 26, 2020

arianvp commented Feb 26, 2020 via email

danbst commented Mar 1, 2020

arianvp commented Mar 1, 2020 via email •

edited

Ma27 commented Apr 4, 2020

arianvp commented Apr 7, 2020

flokli commented Apr 7, 2020 •

edited

tadfisher commented Jul 13, 2020

maralorn commented Feb 7, 2021

stale bot commented Aug 6, 2021

		system("systemctl", "stop", "--no-block", "container\@$containerName");
		system("machinectl", "terminate", $containerName) == 0

nixos-containers: fix atomic restart #80169

Are you sure you want to change the base?

nixos-containers: fix atomic restart #80169

Conversation

danbst commented Feb 15, 2020

danbst commented Feb 15, 2020

flokli commented Feb 15, 2020

d-xo commented Feb 19, 2020

arianvp commented Feb 19, 2020

arianvp commented Feb 19, 2020 • edited

flokli commented Feb 19, 2020

danbst commented Feb 19, 2020

arianvp commented Feb 19, 2020

flokli commented Feb 22, 2020

danbst commented Feb 22, 2020

danbst commented Feb 22, 2020

danbst commented Feb 24, 2020

flokli Feb 26, 2020

Choose a reason for hiding this comment

danbst Feb 26, 2020

Choose a reason for hiding this comment

Ma27 Apr 4, 2020

Choose a reason for hiding this comment

flokli Feb 26, 2020

Choose a reason for hiding this comment

danbst Feb 26, 2020

Choose a reason for hiding this comment

flokli Feb 26, 2020

Choose a reason for hiding this comment

danbst Feb 26, 2020

Choose a reason for hiding this comment

arianvp commented Feb 26, 2020 via email

danbst commented Mar 1, 2020

arianvp commented Mar 1, 2020 via email • edited

Ma27 commented Apr 4, 2020

arianvp commented Apr 7, 2020

flokli commented Apr 7, 2020 • edited

tadfisher commented Jul 13, 2020

maralorn commented Feb 7, 2021

stale bot commented Aug 6, 2021

arianvp commented Feb 19, 2020 •

edited

arianvp commented Mar 1, 2020 via email •

edited

flokli commented Apr 7, 2020 •

edited