Go to file
Kir Kolyshkin d57f5bb286 cgroupv1: don't ignore MemorySwap if Memory==-1
Commit 18ebc51b3cc3 "Reset Swap when memory is set to unlimited (-1)"
added handling of the case when a user updates the container limits
to set memory to unlimited (-1) but do not set any other limits.
Apparently, in this case, if swap limit was previously set, kernel fails
to set memory.limit_in_bytes to -1 if memory.memsw.limit_in_bytes is
not set to -1.

What the above commit fails to handle correctly is the request when
Memory is set to -1 and MemorySwap is set to some specific limit N
(where N > 0). In this case, the value of N is silently discarded
and MemorySwap is set to -1 instead.

This is wrong thing to do, as the limit set, even if incorrectly,
should not be ignored.

Fix this by only assigning MemorySwap == -1 in case it was not
explicitly set.

Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>
2020-05-20 17:23:40 -07:00
contrib contrib: recvtty: add --no-stdin flag 2020-05-13 17:38:45 +10:00
docs docs: terminals: mention subreaper requirement 2020-04-28 22:53:59 +10:00
libcontainer cgroupv1: don't ignore MemorySwap if Memory==-1 2020-05-20 17:23:40 -07:00
man Use "command -v" shell builtin instead of "which" 2020-03-03 16:28:54 -08:00
script Modify check-config.sh in accordance with Moby Project updates 2018-11-30 16:38:19 +09:00
tests cgroups/systemd: allow to set -1 as pids.limit 2020-05-20 13:20:04 -07:00
types Merge pull request #2292 from Creatone/creatone/extend-intelrdt 2020-05-13 13:33:55 -07:00
vendor Merge pull request #2409 from adrianreber/go-criu-4-0-0 2020-05-21 01:39:09 +09:00
.gitignore travis: test cross compilation 2018-06-16 09:32:39 +09:00
.pullapprove.yml Disallow self-LGTMs 2016-06-01 09:31:21 +08:00
.travis.yml cgroup v2: support rootless systemd 2020-05-08 12:39:20 +09:00
CONTRIBUTING.md *: add information about security mailing list 2016-12-03 18:54:53 +11:00
Dockerfile Dockerfile: bump bats to 1.2.0 2020-05-12 11:54:17 -07:00
LICENSE Initial commit of runc binary 2015-06-21 19:34:13 -07:00
MAINTAINERS MAINTAINERS: add Kir Kolyshkin 2020-04-19 09:19:20 +09:00
MAINTAINERS_GUIDE.md trailing punctuation in header 2016-12-02 15:42:17 +08:00
Makefile Remove "-buildmode=pie" from platforms that don't support it 2020-05-19 16:00:37 -07:00
NOTICE Move libcontainer documenation to root of repo 2015-06-26 11:50:46 -07:00
PRINCIPLES.md Move libcontainer documenation to root of repo 2015-06-26 11:50:46 -07:00
README.md README.md: fix a dead link 2020-05-21 02:31:33 +09:00
SECURITY.md Apply review 2020-01-26 23:03:13 +09:00
VERSION VERSION: back to development 2020-01-23 03:19:29 +11:00
Vagrantfile Vagrantfile: use criu from stable repo 2020-05-09 13:54:27 -07:00
checkpoint.go Merge pull request #2413 from JFHwang/2392-spec-check 2020-05-19 08:11:22 +09:00
create.go Prepare startContainer() to have more action 2017-05-01 21:55:57 +03:00
delete.go Replace formatted errors when unneeded 2020-05-16 18:13:21 -07:00
events.go Replace formatted errors when unneeded 2020-05-16 18:13:21 -07:00
exec.go runc exec: don't enable terminal unless -t is set 2020-04-11 14:22:07 -07:00
go.mod Merge pull request #2409 from adrianreber/go-criu-4-0-0 2020-05-21 01:39:09 +09:00
go.sum Merge pull request #2409 from adrianreber/go-criu-4-0-0 2020-05-21 01:39:09 +09:00
init.go Improve nsexec logging 2019-04-22 17:53:52 +03:00
kill.go nit: do not use syscall package 2020-04-18 16:16:49 -07:00
list.go Replace formatted errors when unneeded 2020-05-16 18:13:21 -07:00
main.go Don't print errors twice 2020-05-02 18:13:07 -07:00
notify_socket.go sd-notify: do not hang when NOTIFY_SOCKET is used with create 2020-03-12 21:21:05 +01:00
pause.go Fix some typos 2018-12-09 23:52:54 +08:00
ps.go Replace formatted errors when unneeded 2020-05-16 18:13:21 -07:00
restore.go runc checkpoint: fix --status-fd to accept fd 2020-05-11 15:36:50 -07:00
rlimit_linux.go Fix the value corresponding to rlimitmap [key] 2020-02-07 13:02:14 +08:00
rootless_linux.go cgroup v2: support rootless systemd 2020-05-08 12:39:20 +09:00
run.go Prepare startContainer() to have more action 2017-05-01 21:55:57 +03:00
signals.go nit: do not use syscall package 2020-04-18 16:16:49 -07:00
spec.go Merge pull request #1911 from theSuess/linter-fixes 2018-11-13 12:13:34 -05:00
start.go sd-notify: do not hang when NOTIFY_SOCKET is used with create 2020-03-12 21:21:05 +01:00
state.go Check args numbers before application start 2016-11-29 11:18:51 +08:00
tty.go tty: clean up epollConsole closing 2018-09-21 11:55:36 +10:00
update.go Replace formatted errors when unneeded 2020-05-16 18:13:21 -07:00
utils.go Don't print errors twice 2020-05-02 18:13:07 -07:00
utils_linux.go validateProcessSpec: prevent SEGV when config is valid json, but invalid. 2020-05-18 09:38:22 -07:00

README.md

runc

Build Status Go Report Card GoDoc CII Best Practices

Introduction

runc is a CLI tool for spawning and running containers according to the OCI specification.

Releases

runc depends on and tracks the runtime-spec repository. We will try to make sure that runc and the OCI specification major versions stay in lockstep. This means that runc 1.0.0 should implement the 1.0 version of the specification.

You can find official releases of runc on the release page.

Currently, the following features are not considered to be production-ready:

  • Support for cgroup v2

Security

The reporting process and disclosure communications are outlined here.

Security Audit

A third party security audit was performed by Cure53, you can see the full report here.

Building

runc currently supports the Linux platform with various architecture support. It must be built with Go version 1.13 or higher.

In order to enable seccomp support you will need to install libseccomp on your platform.

e.g. libseccomp-devel for CentOS, or libseccomp-dev for Ubuntu

# create a 'github.com/opencontainers' in your GOPATH/src
cd github.com/opencontainers
git clone https://github.com/opencontainers/runc
cd runc

make
sudo make install

You can also use go get to install to your GOPATH, assuming that you have a github.com parent folder already created under src:

go get github.com/opencontainers/runc
cd $GOPATH/src/github.com/opencontainers/runc
make
sudo make install

runc will be installed to /usr/local/sbin/runc on your system.

Build Tags

runc supports optional build tags for compiling support of various features, with some of them enabled by default (see BUILDTAGS in top-level Makefile).

To change build tags from the default, set the BUILDTAGS variable for make, e.g.

make BUILDTAGS='seccomp apparmor'
Build Tag Feature Enabled by default Dependency
seccomp Syscall filtering yes libseccomp
selinux selinux process and mount labeling yes
apparmor apparmor profile support yes
nokmem disable kernel memory accounting no

Running the test suite

runc currently supports running its test suite via Docker. To run the suite just type make test.

make test

There are additional make targets for running the tests outside of a container but this is not recommended as the tests are written with the expectation that they can write and remove anywhere.

You can run a specific test case by setting the TESTFLAGS variable.

# make test TESTFLAGS="-run=SomeTestFunction"

You can run a specific integration test by setting the TESTPATH variable.

# make test TESTPATH="/checkpoint.bats"

You can run a specific rootless integration test by setting the ROOTLESS_TESTPATH variable.

# make test ROOTLESS_TESTPATH="/checkpoint.bats"

You can run a test using your container engine's flags by setting CONTAINER_ENGINE_BUILD_FLAGS and CONTAINER_ENGINE_RUN_FLAGS variables.

# make test CONTAINER_ENGINE_BUILD_FLAGS="--build-arg http_proxy=http://yourproxy/" CONTAINER_ENGINE_RUN_FLAGS="-e http_proxy=http://yourproxy/"

Dependencies Management

runc uses Go Modules for dependencies management. Please refer to Go Modules for how to add or update new dependencies. When updating dependencies, be sure that you are running Go 1.14 or newer.

# Update vendored dependencies
make vendor
# Verify all dependencies
make verify-dependencies

Using runc

Creating an OCI Bundle

In order to use runc you must have your container in the format of an OCI bundle. If you have Docker installed you can use its export method to acquire a root filesystem from an existing Docker container.

# create the top most bundle directory
mkdir /mycontainer
cd /mycontainer

# create the rootfs directory
mkdir rootfs

# export busybox via Docker into the rootfs directory
docker export $(docker create busybox) | tar -C rootfs -xvf -

After a root filesystem is populated you just generate a spec in the format of a config.json file inside your bundle. runc provides a spec command to generate a base template spec that you are then able to edit. To find features and documentation for fields in the spec please refer to the specs repository.

runc spec

Running Containers

Assuming you have an OCI bundle from the previous step you can execute the container in two different ways.

The first way is to use the convenience command run that will handle creating, starting, and deleting the container after it exits.

# run as root
cd /mycontainer
runc run mycontainerid

If you used the unmodified runc spec template this should give you a sh session inside the container.

The second way to start a container is using the specs lifecycle operations. This gives you more power over how the container is created and managed while it is running. This will also launch the container in the background so you will have to edit the config.json to remove the terminal setting for the simple examples here. Your process field in the config.json should look like this below with "terminal": false and "args": ["sleep", "5"].

        "process": {
                "terminal": false,
                "user": {
                        "uid": 0,
                        "gid": 0
                },
                "args": [
                        "sleep", "5"
                ],
                "env": [
                        "PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin",
                        "TERM=xterm"
                ],
                "cwd": "/",
                "capabilities": {
                        "bounding": [
                                "CAP_AUDIT_WRITE",
                                "CAP_KILL",
                                "CAP_NET_BIND_SERVICE"
                        ],
                        "effective": [
                                "CAP_AUDIT_WRITE",
                                "CAP_KILL",
                                "CAP_NET_BIND_SERVICE"
                        ],
                        "inheritable": [
                                "CAP_AUDIT_WRITE",
                                "CAP_KILL",
                                "CAP_NET_BIND_SERVICE"
                        ],
                        "permitted": [
                                "CAP_AUDIT_WRITE",
                                "CAP_KILL",
                                "CAP_NET_BIND_SERVICE"
                        ],
                        "ambient": [
                                "CAP_AUDIT_WRITE",
                                "CAP_KILL",
                                "CAP_NET_BIND_SERVICE"
                        ]
                },
                "rlimits": [
                        {
                                "type": "RLIMIT_NOFILE",
                                "hard": 1024,
                                "soft": 1024
                        }
                ],
                "noNewPrivileges": true
        },

Now we can go through the lifecycle operations in your shell.

# run as root
cd /mycontainer
runc create mycontainerid

# view the container is created and in the "created" state
runc list

# start the process inside the container
runc start mycontainerid

# after 5 seconds view that the container has exited and is now in the stopped state
runc list

# now delete the container
runc delete mycontainerid

This allows higher level systems to augment the containers creation logic with setup of various settings after the container is created and/or before it is deleted. For example, the container's network stack is commonly set up after create but before start.

Rootless containers

runc has the ability to run containers without root privileges. This is called rootless. You need to pass some parameters to runc in order to run rootless containers. See below and compare with the previous version.

Note: In order to use this feature, "User Namespaces" must be compiled and enabled in your kernel. There are various ways to do this depending on your distribution:

  • Confirm CONFIG_USER_NS=y is set in your kernel configuration (normally found in /proc/config.gz)
  • Arch/Debian: echo 1 > /proc/sys/kernel/unprivileged_userns_clone
  • RHEL/CentOS 7: echo 28633 > /proc/sys/user/max_user_namespaces

Run the following commands as an ordinary user:

# Same as the first example
mkdir ~/mycontainer
cd ~/mycontainer
mkdir rootfs
docker export $(docker create busybox) | tar -C rootfs -xvf -

# The --rootless parameter instructs runc spec to generate a configuration for a rootless container, which will allow you to run the container as a non-root user.
runc spec --rootless

# The --root parameter tells runc where to store the container state. It must be writable by the user.
runc --root /tmp/runc run mycontainerid

Supervisors

runc can be used with process supervisors and init systems to ensure that containers are restarted when they exit. An example systemd unit file looks something like this.

[Unit]
Description=Start My Container

[Service]
Type=forking
ExecStart=/usr/local/sbin/runc run -d --pid-file /run/mycontainerid.pid mycontainerid
ExecStopPost=/usr/local/sbin/runc delete mycontainerid
WorkingDirectory=/mycontainer
PIDFile=/run/mycontainerid.pid

[Install]
WantedBy=multi-user.target

License

The code and docs are released under the Apache 2.0 license.