A Peek into Docker Images
A look into docker images using snakes
Let’s talk about docker images. A docker image is made up of one or more layers and some metadata. When you do a docker pull these are retrieved from dockerhub or your repository of choice.
That is simple enough. However understanding how docker images work under the covers, where the various digests and hashes you see returned by the docker client come from, is a little more challenging. Let’s walk through the structure of an example image and see what lessons we learn.
First, let’s pull a fun docker image. How about a terminal game like snake?
After a docker run we will get a fun little snake game.
> docker run -ti dyego/snake-game
You may notice the following output was generated before the game started:
Unable to find image 'dyego/snake-game:latest' locally
latest: Pulling from dyego/snake-game
1160f4abea84: Pull complete
4d49542c61a4: Pull complete
3ee103c86f60: Pull complete
9a56d2eb1eed: Pull complete
67e0ebed9a3b: Pull complete
194910951a14: Pull complete
20ccf4425819: Pull complete
4b3db85b3b19: Pull complete
b612933e98de: Pull complete
ab455ad83399: Pull complete
Digest: sha256:b2b4751952d24fa810a91620aee5f49a1cdf7d05b472a209920f3310f1a84bc1
Status: Downloaded newer image for dyego/snake-game:latest
We can see that docker image was made up of 10 layers. Each layer was pulled down by the docker client. Then a digest was emitted, the docker pull was complete and then the game started.
Pulling it apart
Where do the digest and the hashes returned by each layer come from? Let’s use a handy tool called skopeo to pull the raw docker image into a folder and take a look at it.
> skopeo copy docker://dyego/snake-game:latest dir:./1160f4abea84cbe2f316db6306839d2704f09a04af763ee493dd92cb066c0865
194910951a14ae1c018dbb76537380240c31a7573411abbe1abc8c0c78f410f6
20ccf4425819fbf3045577b0954f38b0b1fe636447dcfb2e7ab51b56119d720a
3ee103c86f6069f2ce62589cd1227005febfb70eb986a90a1b39af090769f2cf
4b3db85b3b19a9e472b818bcdb61efbbc376b308630df69544e97c09ee6ef366
4d49542c61a48835965105e24434036ff74a67dde0d7560a6f55219b5980e1c1
67e0ebed9a3b9232682631f4cfe09da2ee26e0761e935f723b81e9e3bcadf138
97b9447a34eca52d4283759df0f47f42cb9629b3ab6058fca5a993cfacb1e7a8
9a56d2eb1eedd2ba6239a29cf279531ebf0b0bf0f1d29736f7b11ab0d98ab431
ab455ad83399d5f68cc1b402ad4231854b30c408b27ca277544fb2c0d24e7c15
b612933e98de5544206641c6319e1fbe1a7269aafbeca9fbf07bf8b6f0055198
manifest.json
We get one file for each layer in our image, a manifest file, and one extra file, which we will get to later. We can see that the first twelve characters of each file correspond to the hash returned by docker pull for that layer.
We can also see that the file name of each layer is actually the sha256 of the contents of the layer:
> cat 1160f4abea84cbe2f316db6306839d2704f09a04af763ee493dd92cb066c0865 | shasum -a 256
1160f4abea84cbe2f316db6306839d2704f09a04af763ee493dd92cb066c0865 -
Finding #1:
Docker layers are stored using a Content Addressable Storage scheme. That means that the hash of the contents of the layer is how that layer is referred to and stored in the file system.
Layers on top of layers
Next let’s look at the manifest.json file. The layers section of the document looks like this:
{
...
"layers": [
{
"mediaType": "vnd.docker.image.rootfs.diff.tar.gzip",
"size": 1991501,
"digest": "sha256:1160f4abea84.."
},{
"mediaType": "vnd.docker.image.rootfs.diff.tar.gzip",
"size": 80230150,
"digest": "sha256:9a56d2eb1eed..."
}, ...
]
}
The manifest file lists each of layer digests for its ten layers, as well as the size and the format of the file. The ordering here is important as docker images use a union file system.
Each immutable layer is an overlay on top of the previous layers. In this case, since this image is based on golang:alpine, the first five layers are shared with many other images that build on golang:alpine.
Image Config
The manifest also has a config section:
{
"config": {
"mediaType": "vnd.docker.container.image.v1+json",
"size": 5542,
"digest": "sha256:97b9447a34ec..."
},
...
}
The config is the extra file I mentioned we would get back to. The config is a json document and contains metadata about image creation.
> cat 97b9447a34eca52d4283759df0f47f42cb9629b3ab6058fca5a993cfacb1e7a8
{"architecture":"amd64","config":{"Hostname":
...
}
And again its filename is its sha256 hash:
> cat 97b9447a34eca52d4283759df0f47f42cb9629b3ab6058fca5a993cfacb1e7a8 | shasum -a 256
97b9447a34eca52d4283759df0f47f42cb9629b3ab6058fca5a993cfacb1e7a8 -
We will cover the config file in more detail in a future article
Image Digest
Now we are only missing one piece of the puzzle. Where does the image digest that was returned in our original docker run come from? The answer is the sha256 of the manifest:
> cat manifest.json| shasum -a 256
b2b4751952d24fa810a91620aee5f49a1cdf7d05b472a209920f3310f1a84bc1
This structure is known as a merkle tree.
An image can be referred to by the hash of its manifest and the manifest contains a list of the child dependencies of the image. Its dependencies are its layers and the config. Any change to any layer will cause its digest to change, which will cause the manifest to change, which will cause the entire image to have a different digest.
In fact, you can always refer to an image using its digest rather than a tag. Referring to its by its digest is more verbose, but the digest, unlike the tag, can’t be updated to point to a different image without changing the digest itself. This makes it a great choice in many situations. Here is how we would do this:
> docker pull dyego/snake-game@sha256:b2b4751952d24fa810a91620aee5f49a1cdf7d05b472a209920f3310f1a84bc1
Finding #2:
A docker image is an immutable merkle tree of its layers and config. The digest is the sha256 of the manifest and can be used to definitively refer to an image.
This is also how git works internally. Each commit can be referred to by its hash. In git, branches are just pointers to hashes of commits. This is way docker image tags work as well. If you are familiar with how git commits form a tree, this intuition can guide you to understanding the docker image format. They are very similar.
By looking at bit deeper at the docker image format we now have a better and more hands-on understanding of how the format works.