runc/libcontainer
Antonio Murdaca 75acc7c7c3
libcontainer: selinux: fix DupSecOpt and DisableSecOpt
`label.InitLabels` takes options as a string slice in the form of:

    user:system_u
    role:system_r
    type:container_t
    level:s0:c4,c5

However, `DupSecOpt` and `DisableSecOpt` were still adding a docker
specifc `label=` in front of every option. That leads to `InitLabels`
not being able to correctly init selinux labels in this scenario for
instance:

    label.InitLabels(DupSecOpt([%OPTIONS%]))

if `%OPTIONS` has options prefixed with `label=`, that's going to fail.
Fix this by removing that docker specific `label=` prefix.

Signed-off-by: Antonio Murdaca <runcom@redhat.com>
2017-02-06 17:29:42 +01:00
..
apparmor Updating error condition in applying apparmor profile 2016-05-04 19:10:55 +05:30
cgroups Merge pull request #1278 from datawolf/scanner 2017-01-20 17:49:44 +00:00
configs Fix go_vet errors 2017-01-06 10:20:27 +08:00
criurpc libcontainer: update criurpc.proto 2016-02-19 02:38:02 +03:00
devices Don't add device to list if it doesn't exist anymore 2016-12-07 11:08:00 -08:00
integration *: fix go-vet failures 2017-01-04 09:48:32 +11:00
keys libcontainer: rename keyctl package to keys 2016-07-25 20:59:26 -03:00
label Revert "DupSecOpt needs to match InitLabels" 2017-02-01 09:14:20 +01:00
nsenter Set init processes as non-dumpable 2017-01-11 09:56:56 -08:00
seccomp move error check out of the for loop 2017-01-18 05:02:39 +00:00
selinux libcontainer: selinux: fix DupSecOpt and DisableSecOpt 2017-02-06 17:29:42 +01:00
specconv Bump runtime-spec to v1.0.0-rc3 2016-12-17 14:02:35 +08:00
stacktrace fix typos 2016-11-30 13:31:36 +08:00
system Fix issue in `GetProcessStartTime` 2016-10-20 11:34:21 -07:00
user Cleanup: remove redundant code 2017-01-09 01:56:14 -05:00
utils Remove a compiler warning in some environments 2017-01-24 14:06:15 +00:00
xattr Fixing xattr test step issue 2015-11-29 09:24:42 +05:30
README.md Merge pull request #1284 from stevenh/godoc 2017-01-30 10:56:58 -08:00
SPEC.md Do not create /dev/fuse by default 2016-08-12 13:00:24 +01:00
capabilities_ambient.go Move ambient capabilties behind build tag 2016-11-02 10:59:59 -07:00
capabilities_linux.go Move ambient capabilties behind build tag 2016-11-02 10:59:59 -07:00
capabilities_noambient.go Move ambient capabilties behind build tag 2016-11-02 10:59:59 -07:00
compat_1.5_linux.go Don't set /proc/<PID>/setgroups to deny in Go1.5 2015-08-03 14:59:15 -04:00
console.go Merge pull request #1018 from cyphar/console-rewrite 2016-12-07 14:37:19 -08:00
console_freebsd.go console: don't chown(2) the slave PTY 2016-12-01 15:49:36 +11:00
console_linux.go runc: implement --console-socket 2016-12-01 15:49:36 +11:00
console_solaris.go console: don't chown(2) the slave PTY 2016-12-01 15:49:36 +11:00
console_windows.go console: don't chown(2) the slave PTY 2016-12-01 15:49:36 +11:00
container.go Correct container.Destroy() docs 2017-02-03 16:18:29 +00:00
container_linux.go libcontainer: init: only pass stateDirFd when creating a container 2017-02-02 00:41:11 +11:00
container_linux_test.go Fix trivial style errors reported by `go vet` and `golint` 2016-04-12 08:13:16 +00:00
container_solaris.go Get runc to build clean on Solaris 2016-04-12 16:13:08 -07:00
container_windows.go Windows: Refactor Container interface 2015-11-02 15:12:16 -08:00
criu_opts_unix.go Fix trivial style errors reported by `go vet` and `golint` 2016-04-12 08:13:16 +00:00
criu_opts_windows.go Windows: Factor down criu_opts 2015-10-23 12:58:59 -07:00
error.go Fix the outdated comment for Error interface 2017-01-03 15:06:47 +08:00
error_test.go [unittest] add extra ErrorCode in TestErrorCode testcase 2016-09-22 20:15:54 +08:00
factory.go Update import paths for new repository 2015-06-21 19:29:59 -07:00
factory_linux.go Merge pull request #1293 from stevenh/resolve-initarg 2017-02-03 19:25:52 +08:00
factory_linux_test.go Serialize CommandHooks to state 2016-03-03 16:57:51 +00:00
generic_error.go libcontainer: refactor syncT handling 2016-12-01 15:46:04 +11:00
generic_error_test.go Move libcontainer into subdirectory 2015-06-21 19:29:15 -07:00
init_linux.go Merge pull request #1285 from stevenh/signal-wait 2017-02-06 16:41:24 +08:00
message_linux.go *: console rewrite 2016-12-01 15:49:36 +11:00
network_linux.go libcontainer: network_linux.go: fix go vet 2015-11-30 12:31:18 +01:00
notify_linux.go libcontainer: Add support for memcg pressure notifications 2015-12-28 13:36:55 -05:00
notify_linux_test.go libcontainer: Add support for memcg pressure notifications 2015-12-28 13:36:55 -05:00
process.go tests: fix all the things 2016-12-01 15:49:37 +11:00
process_linux.go libcontainer: init: only pass stateDirFd when creating a container 2017-02-02 00:41:11 +11:00
restored_process.go Add signal API to Container interface 2015-08-03 17:07:29 -07:00
rootfs_linux.go Do not create cgroup dir name from combining subsystems 2017-01-11 15:27:58 +08:00
rootfs_linux_test.go Remove check for binding to / 2016-09-29 15:26:09 -07:00
setgroups_linux.go Don't set /proc/<PID>/setgroups to deny in Go1.5 2015-08-03 14:59:15 -04:00
setns_init_linux.go libcontainer: init: only pass stateDirFd when creating a container 2017-02-02 00:41:11 +11:00
standard_init_linux.go Set init processes as non-dumpable 2017-01-11 09:56:56 -08:00
state_linux.go Correct docs typo for restoredState. 2017-02-03 16:19:01 +00:00
state_linux_test.go Fix signal handling for unit tests 2016-05-31 11:10:47 -07:00
stats.go Move libcontainer into subdirectory 2015-06-21 19:29:15 -07:00
stats_freebsd.go Move libcontainer into subdirectory 2015-06-21 19:29:15 -07:00
stats_linux.go Update import paths for new repository 2015-06-21 19:29:59 -07:00
stats_solaris.go Get runc to build clean on Solaris 2016-04-12 16:13:08 -07:00
stats_windows.go Move libcontainer into subdirectory 2015-06-21 19:29:15 -07:00
sync.go *: console rewrite 2016-12-01 15:49:36 +11:00

README.md

libcontainer

GoDoc

Libcontainer provides a native Go implementation for creating containers with namespaces, cgroups, capabilities, and filesystem access controls. It allows you to manage the lifecycle of the container performing additional operations after the container is created.

Container

A container is a self contained execution environment that shares the kernel of the host system and which is (optionally) isolated from other containers in the system.

Using libcontainer

Because containers are spawned in a two step process you will need a binary that will be executed as the init process for the container. In libcontainer, we use the current binary (/proc/self/exe) to be executed as the init process, and use arg "init", we call the first step process "bootstrap", so you always need a "init" function as the entry of "bootstrap".

In addition to the go init function the early stage bootstrap is handled by importing nsenter.

import (
	_ "github.com/opencontainers/runc/libcontainer/nsenter"
)

func init() {
	if len(os.Args) > 1 && os.Args[1] == "init" {
		runtime.GOMAXPROCS(1)
		runtime.LockOSThread()
		factory, _ := libcontainer.New("")
		if err := factory.StartInitialization(); err != nil {
			logrus.Fatal(err)
		}
		panic("--this line should have never been executed, congratulations--")
	}
}

Then to create a container you first have to initialize an instance of a factory that will handle the creation and initialization for a container.

factory, err := libcontainer.New("/var/lib/container", libcontainer.Cgroupfs, libcontainer.InitArgs(os.Args[0], "init"))
if err != nil {
	logrus.Fatal(err)
	return
}

Once you have an instance of the factory created we can create a configuration struct describing how the container is to be created. A sample would look similar to this:

defaultMountFlags := syscall.MS_NOEXEC | syscall.MS_NOSUID | syscall.MS_NODEV
config := &configs.Config{
	Rootfs: "/your/path/to/rootfs",
	Capabilities: []string{
		"CAP_CHOWN",
		"CAP_DAC_OVERRIDE",
		"CAP_FSETID",
		"CAP_FOWNER",
		"CAP_MKNOD",
		"CAP_NET_RAW",
		"CAP_SETGID",
		"CAP_SETUID",
		"CAP_SETFCAP",
		"CAP_SETPCAP",
		"CAP_NET_BIND_SERVICE",
		"CAP_SYS_CHROOT",
		"CAP_KILL",
		"CAP_AUDIT_WRITE",
	},
	Namespaces: configs.Namespaces([]configs.Namespace{
		{Type: configs.NEWNS},
		{Type: configs.NEWUTS},
		{Type: configs.NEWIPC},
		{Type: configs.NEWPID},
		{Type: configs.NEWUSER},
		{Type: configs.NEWNET},
	}),
	Cgroups: &configs.Cgroup{
		Name:   "test-container",
		Parent: "system",
		Resources: &configs.Resources{
			MemorySwappiness: nil,
			AllowAllDevices:  nil,
			AllowedDevices:   configs.DefaultAllowedDevices,
		},
	},
	MaskPaths: []string{
		"/proc/kcore",
		"/sys/firmware",
	},
	ReadonlyPaths: []string{
		"/proc/sys", "/proc/sysrq-trigger", "/proc/irq", "/proc/bus",
	},
	Devices:  configs.DefaultAutoCreatedDevices,
	Hostname: "testing",
	Mounts: []*configs.Mount{
		{
			Source:      "proc",
			Destination: "/proc",
			Device:      "proc",
			Flags:       defaultMountFlags,
		},
		{
			Source:      "tmpfs",
			Destination: "/dev",
			Device:      "tmpfs",
			Flags:       syscall.MS_NOSUID | syscall.MS_STRICTATIME,
			Data:        "mode=755",
		},
		{
			Source:      "devpts",
			Destination: "/dev/pts",
			Device:      "devpts",
			Flags:       syscall.MS_NOSUID | syscall.MS_NOEXEC,
			Data:        "newinstance,ptmxmode=0666,mode=0620,gid=5",
		},
		{
			Device:      "tmpfs",
			Source:      "shm",
			Destination: "/dev/shm",
			Data:        "mode=1777,size=65536k",
			Flags:       defaultMountFlags,
		},
		{
			Source:      "mqueue",
			Destination: "/dev/mqueue",
			Device:      "mqueue",
			Flags:       defaultMountFlags,
		},
		{
			Source:      "sysfs",
			Destination: "/sys",
			Device:      "sysfs",
			Flags:       defaultMountFlags | syscall.MS_RDONLY,
		},
	},
	UidMappings: []configs.IDMap{
		{
			ContainerID: 0,
			HostID: 1000,
			Size: 65536,
		},
	},
	GidMappings: []configs.IDMap{
		{
			ContainerID: 0,
			HostID: 1000,
			Size: 65536,
		},
	},
	Networks: []*configs.Network{
		{
			Type:    "loopback",
			Address: "127.0.0.1/0",
			Gateway: "localhost",
		},
	},
	Rlimits: []configs.Rlimit{
		{
			Type: syscall.RLIMIT_NOFILE,
			Hard: uint64(1025),
			Soft: uint64(1025),
		},
	},
}

Once you have the configuration populated you can create a container:

container, err := factory.Create("container-id", config)
if err != nil {
	logrus.Fatal(err)
	return
}

To spawn bash as the initial process inside the container and have the processes pid returned in order to wait, signal, or kill the process:

process := &libcontainer.Process{
	Args:   []string{"/bin/bash"},
	Env:    []string{"PATH=/bin"},
	User:   "daemon",
	Stdin:  os.Stdin,
	Stdout: os.Stdout,
	Stderr: os.Stderr,
}

err := container.Run(process)
if err != nil {
	container.Destroy()
	logrus.Fatal(err)
	return
}

// wait for the process to finish.
_, err := process.Wait()
if err != nil {
	logrus.Fatal(err)
}

// destroy the container.
container.Destroy()

Additional ways to interact with a running container are:

// return all the pids for all processes running inside the container.
processes, err := container.Processes()

// get detailed cpu, memory, io, and network statistics for the container and
// it's processes.
stats, err := container.Stats()

// pause all processes inside the container.
container.Pause()

// resume all paused processes.
container.Resume()

// send signal to container's init process.
container.Signal(signal)

// update container resource constraints.
container.Set(config)

// get current status of the container.
status, err := container.Status()

// get current container's state information.
state, err := container.State()

Checkpoint & Restore

libcontainer now integrates CRIU for checkpointing and restoring containers. This let's you save the state of a process running inside a container to disk, and then restore that state into a new process, on the same machine or on another machine.

criu version 1.5.2 or higher is required to use checkpoint and restore. If you don't already have criu installed, you can build it from source, following the online instructions. criu is also installed in the docker image generated when building libcontainer with docker.

Code and documentation copyright 2014 Docker, inc. Code released under the Apache 2.0 license. Docs released under Creative commons.