Docker
# Intro
# What is Docker?
- a platform that automates the deployment, scaling, and management of applications by isolating them into lightweight, portable containers
# What is container?
- software environment
# Why container?
- bare metal not flexible
- virtual machines too heavy
# Docker and OCI?
- OCI = Open Container Initiation
- Docker played a pivatol role in shaping the standards for container format and runtime
- OCI specification: runtime-spec + image-spec
- Docker: Docker Engine + Docker Image
# Installation
Install Docker Engine on Ubuntu | Docker Docs
Install Docker Desktop on Windows | Docker Docs
Important To run Windows containers, you need Windows 10 or Windows 11 Professional or Enterprise edition. Windows Home or Education editions only allow you to run Linux containers.
Win 需要先安装和更新 wsl
|
|
# Underlying Mechanisms
# cgroup/control group
-
a linux kernel feature that allow you to allocate or manage computer resources(CPU, memory, bandwidth, I/O …)
-
pros
- Resources isolation: avoid negative affects (on resources) from other container
- can mannually limit resources usage
- monitering
- can give priority to certain container
# Namespaces
- Linux kernel features allows the isolation of system resources
- types of namespaces:
- PID
- NET(network)
- MNT(mount)
- UTS(UNIX Tims Sharing System)
- USER
- IPC(Inter-Process Communication)
- docker creates a set of namspaces when a container is started, so that
- the container is portable
- without conflicts or interferences from other container
# Union Filesystems
- creates virtual, layered file structure
- enables simultaneous mounting of multiple directories on a single mount point, keeping the containers seperate
- essential features:
- Layered structure multiple read-only layer + a top writable layer
- Copy-on-Write the system creates a copy of the file in writable layer when the file needs changes
- Resources Sharing containers share common base layer while running separately
- Fast Container Initialization Union filesystems make it possible to create new containers instantly by merely creating a new writable layer on existing read-only layers.
Popular UnionFS in Docker:
- AUFS (Advanced Multi-Layered Unification Filesystem), widely used as Docker storage driver
- OverlayFS (Overlay Filesystem), simplified creation and management of overlayed directories compare to AUFS.
- Btrfs (B-Tree Filesystem) advanced storage features like snapshots and checksumming
- ZFS (Z File System), provides union filesystem features along with data protection, compression, and deduplication
# Basics
# Docker Component
- Dockerfile: A text file containing instructions (commands) to build a Docker image.
- Docker Image: A snapshot of a container, created from a Dockerfile. Images are stored in a registry, like Docker Hub, and can be pulled or pushed to the registry.
- Docker Container: A running instance of a Docker image.
# Docker Command
-
docker pull <image>
: Download an image from a registry, like Docker Hub. -
docker build -t <image_name> <path>
: Build an image from a Dockerfile, where<path>
is the directory containing the Dockerfile. -
docker run -d -p <host_port>:<container_port> --name <container_name> <image>
: Run a container from an image, mapping host ports to container ports. -
docker image ls
: List all images available on your local machine. -
docker image rm <image>
: Remove an image from your local machine. -
docker container ls
: List all running containers. -
docker container stop <container>
: Stop a running container. -
docker container rm <container>
: Remove a stopped container.
PS: docker-compose commands are also important
# Data Persistence in Docker
- Containers are ephemeral by default, which means any data stored in the container will be lost once it is terminated
- to store data permanently, there are 3 ways:
# Docker Volumn
- a volumn is a directory on the host machine. Docker can use it to store files or directories
To create a volume, use the following command:
|
|
To use a volume, add a --volume
(or -v
) flag to your docker run
command:
|
|
or Using --mount
|
|
to inspect the details of volume
|
|
remove
|
|
- a volume can be mounted in different containers at the same time
# Bind Mount
- Bind mounts allow you to map any directory on the host machine to a directory within the container
To create a bind mount, use the --mount
flag with type=bind
in your docker run
command:
|
|
- Bind Mount 在主机的绝对目录, Volumn 在 Docker 的管理下并在 Docker 的存储空间
# Docker tmpfs mounts
- Docker tmpfs mounts allow you to create a temporary file storage directly in the container’s memory. Data stored in tmpfs mounts is fast and secure but will be lost once the container is terminated.
- useful for cases where just the persistence of data within the life-cycle of the container is required.
To use a tmpfs mount, add a --tmpfs
flag to your docker run
command:
|
|
# Image
# Using Third-Party Images
# How to find?
- search in Docker Hub
# Use third-party image in docker file
FROM image-name
e.g.
|
|
# Example: using database in docker
MySQL
|
|
--name
container name-e
specify environmental parameter-p
host:vm vm端口映射到主机-d
from which image
PostgreSQL
|
|
MongoDB
|
|
# Interactive Test Environment
- Goal: much easier to work with third party software, test different dependencies or versions, and quickly experiment without the risk of damaging your local setup
create
|
|
-it
interative mode with tty--rm
remove the container once it is stopped- then you can pip, print or test any python
more examples
|
|
|
|
|
|
# Command Line Utilities
# BusyBox
- provides a large amount of common unix utilities, such as
awk
,grep
,vi
, etc - just like you would on a regular command line
To run BusyBox inside a Docker container, you simply need to pull the image and run it with Docker:
|
|
# cURL
- transfer data using various network protocols
- often used for testing APIs or downloading files from the internet
|
|
# other
wget
: A free utility for non-interactive download of files from the Web.imagemagick
: A powerful software suite for image manipulation and conversion.jq
: A lightweight and flexible command-line JSON processor.
# Building Container Images
Container image is a executable package including all things to run an application. Including:
- code
- runtime
- system tools
- libraries
- settings
# Dockerfile
It is essentially a script containing instructions on how to assemble a Docker image.
# Dockerfile Example
|
|
# Common Dockerfile Instructions
FROM
set the base imageWORKDIR
set working dir (RUN, CMD, ENTRYPOINT, COPY, ADD). If dir do not exists, create.COPY
copy from host dir, to container’s file system.ADD
COPY + handle remote URLs and automatically unpack archives.CMD
command when running a containerENTRYPOINT
CMD but the container as excutable with paramsEXPOSE
the container will listen on which port(s)ENV
set env var for the container
Each instruction in the Dockerfile creates a new layer in the image.
# Build from Image
Build from image
|
|
inspect the created image
|
|
inspect individual layers of an image
|
|
remove
|
|
Pushing Images to a Registry
- login, tag, push
|
|
# Efficient layer caching
- Docker creates a new layer for each instruction
- The existing cached layers will be reused when:
- the instruction hasn’t changed since the last build
- or none of the layers is affected by changed
Tips/Best practices
- Minimize changes in the Dockerfile
- minimize frequency of changes in the Dockerfile
- most frequently changed lines appear at the bottom
- Build context optimization
- use
.dockerignore
- use
- Use smaller base images
- smaller image = less number of layers need to be cached
--cache-from
- specify which image to use as a cache src.
- Combine multiple instructions
# Image Size and Security
Smaller image size ->
- faster build
- reduced network overhead when downloading
# Reducing image size
- use an appropriate base image
- consider alpine variant
|
|
- Run multiple commands in a single
RUN
statement
|
|
- Remove unnecessary file in the same layer
- remove temporary or unused files in the same layer to reduce the final image size
|
|
- Use multi-stage builds
- use multiple
FROM
statements, eachFROM
statement creates a new stage - copy files from one stage to another using the
COPY --from
statement
- use multiple
|
|
- Use
.dockerignore
- to exclude files
|
|
# Enhancing Security
-
Keep base images updated
-
Always use a non-root user when running containers
|
|
- Limit scope of
COPY
andADD
|
|