dockerTools: Eliminate layer creation copypasta #51587

grahamc · 2018-12-05T20:04:29Z

Motivation for this change

See the second commit, as the first commit is in #51528.

Essentially, there were several places which ran the same exact commands to build a docker layer, and now there is only one copy of that code :)

I experimented with deduplicating the image creation code, but boy howdy that is for another day.

Things done

Docker images used to be, essentially, a linked list of layers. Each layer would have a tarball and a json document pointing to its parent, and the image pointed to the top layer: imageA ----> layerA | v layerB | v layerC The current image spec changed this format to where the Image defined the order and set of layers: imageA ---> layerA |--> layerB `--> layerC For backwards compatibility, docker produces images which follow both specs: layers point to parents, and images also point to the entire list: imageA ---> layerA | | | v |--> layerB | | | v `--> layerC This is nice for tooling which supported the older version and never updated to support the newer format. Our `buildImage` code only supported the old version, so in order for `buildImage` to properly generate an image based on another image with `fromImage`, the parent image's layers must fully support the old mechanism. This is not a problem in general, but is a problem with `buildLayeredImage`. `buildLayeredImage` creates images with newer image spec, because individual store paths don't have a guaranteed parent layer. Including a specific parent ID in the layer's json makes the output less likely to cache hit when published or pulled. This means until now, `buildLayeredImage` could not be the input to `buildImage`. The changes in this PR change `buildImage` to only use the layer's manifest when locating parent IDs. This does break buildImage on extremely old Docker images, though I do wonder how many of these exist. This work has been sponsored by Target.

Extract the layer creation code in to make-layer.sh and update each layer creation function to use it.

grahamc · 2018-12-05T20:05:57Z

@GrahamcOfBorg build nixosTests.docker-tools

Mic92 · 2018-12-07T08:37:50Z

pkgs/build-support/docker/default.nix

-        tar -C image/$parentID/layer -xpf image/$parentID/layer.tar
-        rm image/$parentID/layer.tar
+      extractionID=0
+      for layerTar in $(cat layer-list); do


while IFS= read layerTar; do # ... done <layer-list

Mic92 · 2018-12-07T08:39:21Z

pkgs/build-support/docker/default.nix

-        while [[ -n "$currentID" ]]; do
-          layerChecksum=$(sha256sum image/$currentID/layer.tar | cut -d ' ' -f1)
+
+        for layerTar in $(cat ./layer-list); do


Mic92 · 2018-12-07T08:52:53Z

pkgs/build-support/docker/default.nix

+          # would fail if layer-list was completely empty.
+          echo "$layerID/layer.tar"
+          cat layer-list
+        ) | ${pkgs.moreutils}/bin/sponge layer-list


That would break in the cross-compiling case.
Better to use:

( echo "$layerID/layer.tar" cat layer-list ) > layer-list-new mv layer-list-new layer-list

This also removes the dependency on moreutils while having the same amount of external commands.

Speaking of cross-compiling: All buildInputs above should become nativeBuildInputs

Mic92 · 2018-12-07T09:00:04Z

pkgs/build-support/docker/default.nix

@@ -673,6 +646,9 @@ rec {
        if [[ -n "$fromImage" ]]; then
          echo "Unpacking base image..."
          tar -C image -xpf "$fromImage"
+
+          cat ./image/manifest.json  | jq -r '.[0].Layers | .[]' > layer-list


There are several cases of cat file | over the whole file, that could become:

Suggested change

cat ./image/manifest.json | jq -r '.[0].Layers | .[]' > layer-list

jq < ./image/manifest.json -r '.[0].Layers | .[]' > layer-list

I know that cat is useless, but I find it much more readable.

and slower.

There is also jq -r '.[0].Layers | .[]' ./image/manifest.json > layer-list :)

This is an improvement, but if performance is a concern, this should probably be rewritten in a 'real' language that doesn't need to start processes to perform non-trivial operations.

grahamc added 2 commits December 5, 2018 14:25

dockerTools: Eliminate layer creation copypasta

0a378ff

Extract the layer creation code in to make-layer.sh and update each layer creation function to use it.

grahamc requested review from nlewo, globin and samueldr and removed request for nlewo and globin December 5, 2018 20:04

GrahamcOfBorg added 6.topic: nixos 8.has: documentation 10.rebuild-darwin: 0 10.rebuild-linux: 0 labels Dec 5, 2018

puffnfresh approved these changes Dec 7, 2018

View reviewed changes

Mic92 reviewed Dec 7, 2018

View reviewed changes

GrahamcOfBorg requested review from disassembler and removed request for globin and disassembler January 3, 2019 15:54

grahamc mentioned this pull request Jan 3, 2019

Support Pull Review Requests softprops/hubcaps#190

Merged

GrahamcOfBorg requested a review from disassembler January 3, 2019 16:25

grahamc closed this Jul 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dockerTools: Eliminate layer creation copypasta #51587

dockerTools: Eliminate layer creation copypasta #51587

grahamc commented Dec 5, 2018

grahamc commented Dec 5, 2018

Mic92 Dec 7, 2018

Mic92 Dec 7, 2018

Mic92 Dec 7, 2018

Mic92 Dec 7, 2018

Mic92 Dec 7, 2018

grahamc Dec 7, 2018

Mic92 Dec 7, 2018

nlewo Dec 7, 2018

roberth Dec 9, 2018

	cat ./image/manifest.json \| jq -r '.[0].Layers \| .[]' > layer-list
	jq < ./image/manifest.json -r '.[0].Layers \| .[]' > layer-list

dockerTools: Eliminate layer creation copypasta #51587

dockerTools: Eliminate layer creation copypasta #51587

Conversation

grahamc commented Dec 5, 2018

Motivation for this change

Things done

grahamc commented Dec 5, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment