jasder/runc - runc - 军科开源项目托管

Commit Graph

Author	SHA1	Message	Date
Kir Kolyshkin	f160352682	libct/cgroup: prep to rm GetClosestMountpointAncestor This function is not very efficient, does not really belong to cgroup package, and is only used once (from fs/cpuset.go). Prepare to remove it by replacing with the implementation based on the parser from github.com/moby/sys/mountinfo parser. This commit is here to make sure the proposed replacement passes the unit test. Funny, but the unit test need to be slightly modified since it supplies the wrong mountinfo (space as the first character, empty line at the end). Validated by $ go test -v -run Ance === RUN TestGetClosestMountpointAncestor --- PASS: TestGetClosestMountpointAncestor (0.00s) PASS ok github.com/opencontainers/runc/libcontainer/cgroups 0.002s Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-05-13 16:26:16 -07:00
Kir Kolyshkin	85d4264d8a	Merge pull request #2390 from lifubang/threadedordomain cgroupv2: don't enable threaded mode by default LGTMs: AkihiroSuda, cyphar, kolyshkin	2020-05-13 14:30:25 -07:00
Kir Kolyshkin	4b71877f99	Merge pull request #2292 from Creatone/creatone/extend-intelrdt Add RDT Memory Bandwidth Monitoring (MBM) and Cache Monitoring Technology (CMT) statistics.	2020-05-13 13:33:55 -07:00
Kir Kolyshkin	41855317b6	Merge pull request #2271 from katarzyna-z/kk-cpuacct-usage-all Add reading of information from cpuacct.usage_all	2020-05-13 13:33:05 -07:00
lifubang	fe0669b26d	don't enable threaded mode by default Because in threaded mode, we can't enable the memory controller -- it isn't thread-aware. Signed-off-by: lifubang <lifubang@acmcoder.com>	2020-05-13 16:27:36 +08:00
Akihiro Suda	2b9a36ee8c	Merge pull request #2398 from pkagrawal/master Honor spec.Process.NoNewPrivileges in specconv.CreateLibcontainerConfig	2020-05-12 15:05:55 +09:00
Pradyumna Agrawal	4aa9101477	Honor spec.Process.NoNewPrivileges in specconv.CreateLibcontainerConfig The change ensures that the passed in value of NoNewPrivileges under spec.Process is reflected in the container config generated by specconv.CreateLibcontainerConfig Closes #2397 Signed-off-by: Pradyumna Agrawal <pradyumnaa@vmware.com>	2020-05-11 13:38:14 -07:00
Kir Kolyshkin	714c91e9f7	Simplify cgroup path handing in v2 via unified API This unties the Gordian Knot of using GetPaths in cgroupv2 code. The problem is, the current code uses GetPaths for three kinds of things: 1. Get all the paths to cgroup v1 controllers to save its state (see (linuxContainer).currentState(), (LinuxFactory).loadState() methods). 2. Get all the paths to cgroup v1 controllers to have the setns process enter the proper cgroups in `(*setnsProcess).start()`. 3. Get the path to a specific controller (for example, `m.GetPaths()["devices"]`). Now, for cgroup v2 instead of a set of per-controller paths, we have only one single unified path, and a dedicated function `GetUnifiedPath()` to get it. This discrepancy between v1 and v2 cgroupManager API leads to the following problems with the code: - multiple if/else code blocks that have to treat v1 and v2 separately; - backward-compatible GetPaths() methods in v2 controllers; - - repeated writing of the PID into the same cgroup for v2; Overall, it's hard to write the right code with all this, and the code that is written is kinda hard to follow. The solution is to slightly change the API to do the 3 things outlined above in the same manner for v1 and v2: 1. Use `GetPaths()` for state saving and setns process cgroups entering. 2. Introduce and use Path(subsys string) to obtain a path to a subsystem. For v2, the argument is ignored and the unified path is returned. This commit converts all the controllers to the new API, and modifies all the users to use it. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-05-08 12:04:06 -07:00
Kir Kolyshkin	2c8d668eee	Merge pull request #2387 from kolyshkin/g-knot-prepare cgroup refactoring LGTMs: AkihiroSuda, mrunalp.	2020-05-08 12:03:22 -07:00
Kir Kolyshkin	1d143562d2	libct/cgroups/fs: access m.paths under lock 1. Prevent theoretical "concurrent map access" error to m.paths. 2. There is no need to call m.Paths -- we can access m.paths directly. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-05-08 10:09:55 -07:00
Kir Kolyshkin	51e1a0842d	libct/cgroups/systemd/v1: privatize v1 manager This patch was generated entirely by gorename -- nothing to review here. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-05-08 10:09:48 -07:00
Kir Kolyshkin	d827e323b0	libct/cgroups/systemd/v1: add NewLegacyManager Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-05-08 10:07:40 -07:00
Kir Kolyshkin	fc620fdf81	libct/cgroups/fs: privatize Manager and its fields This was generated entirely by gorename -- nothing to review here. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-05-08 10:07:00 -07:00
Kir Kolyshkin	5935bf8c21	libct/cgroups/fs: introduce NewManager() ...and use it from libcontainer/factory_linux.go. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-05-08 10:06:05 -07:00
Kir Kolyshkin	24f945e08d	libct/cgroups/systemd/v2: return a public interface Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-05-08 10:06:02 -07:00
Kir Kolyshkin	63854b0ea8	newSetnsProcess: reuse state.CgroupPaths c.cgroupManager.GetPaths() are called twice here: once in currentState() and then in newSetnsProcess(). Reuse the result of the first call, which is stored into state.CgroupPaths. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-05-08 10:05:59 -07:00
Kir Kolyshkin	9a3e632625	notify: simplify usage Instead of passing the whole map of paths, pass the path to the memory controller which these functions actually require. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-05-08 10:05:58 -07:00
Alice Frosi	b18a9650f8	test: update devicefilter tests The test cases need to take into account the assembly modifications. The instruction: LdXMemH dst: r2 src: r1 off: 0 imm: 0 has been replaced with: LdXMemW dst: r2 src: r1 off: 0 imm: 0 And32Imm dst: r2 imm: 65535 Signed-off-by: Alice Frosi <afrosi@de.ibm.com>	2020-05-08 07:31:05 +01:00
Alice Frosi	128cb60f58	ebpf: fix big endian issue for s390x Load the full 32 bits word and take the lower 16 bits, instead of reading just 16 bits. Same fix as `07bae05e61` Signed-off-by: Alice Frosi <afrosi@de.ibm.com>	2020-05-08 07:31:05 +01:00
Akihiro Suda	bf15cc99b1	cgroup v2: support rootless systemd Tested with both Podman (master) and Moby (master), on Ubuntu 19.10 . $ podman --cgroup-manager=systemd run -it --rm --runtime=runc \ --cgroupns=host --memory 42m --cpus 0.42 --pids-limit 42 alpine / # cat /proc/self/cgroup 0::/user.slice/user-1001.slice/user@1001.service/user.slice/libpod-132ff0d72245e6f13a3bbc6cdc5376886897b60ac59eaa8dea1df7ab959cbf1c.scope / # cat /sys/fs/cgroup/user.slice/user-1001.slice/user@1001.service/user.slice/libpod-132ff0d72245e6f13a3bbc6cdc5376886897b60ac59eaa8dea1df7ab959cbf1c.scope/memory.max 44040192 / # cat /sys/fs/cgroup/user.slice/user-1001.slice/user@1001.service/user.slice/libpod-132ff0d72245e6f13a3bbc6cdc5376886897b60ac59eaa8dea1df7ab959cbf1c.scope/cpu.max 42000 100000 / # cat /sys/fs/cgroup/user.slice/user-1001.slice/user@1001.service/user.slice/libpod-132ff0d72245e6f13a3bbc6cdc5376886897b60ac59eaa8dea1df7ab959cbf1c.scope/pids.max 42 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-05-08 12:39:20 +09:00
Akihiro Suda	492cfd8bf9	Merge pull request #2352 from lifubang/eventsv2 fix runc events error in cgroup v2	2020-05-08 12:51:05 +09:00
lifubang	657407ff23	fix runc events error in cgroup v2 Signed-off-by: lifubang <lifubang@acmcoder.com>	2020-05-07 22:18:46 +08:00
Sebastiaan van Stijn	b48bbdd08d	vendor: opencontainers/selinux v1.5.1, update deprecated uses full diff: https://github.com/opencontainers/selinux/v1.4.0...v1.5.1 Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-05-05 15:53:40 +02:00
Katarzyna Kujawa	407e9f9d0d	Add reading of information from cpuacct.usage_all Remove logrus logs from tests Signed-off-by: Katarzyna Kujawa <katarzyna.kujawa@intel.com>	2020-05-05 08:51:12 +02:00
Mrunal Patel	a57358e016	Merge pull request #2370 from lifubang/swap0 let runc disable swap in cgroup v2	2020-05-04 16:57:12 -07:00
Sebastiaan van Stijn	402d645c5c	Simplify ticks, as the value is a constant See for example in the Musl libc source code https://git.musl-libc.org/cgit/musl/tree/src/conf/sysconf.c#n29 This removes the cgo dependency for the system package. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-05-04 23:05:46 +02:00
Sebastiaan van Stijn	9df0b5e268	libcontainer: RunningInUserNS() use sync.Once Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-05-04 15:53:33 +02:00
Mrunal Patel	6161d255b6	Merge pull request #2375 from tedyu/wait-lazy-close Close fd in case fd.Write() returns error	2020-05-03 09:03:40 -07:00
lifubang	a70f354680	let runc disable swap in cgroup v2 In cgroup v2, when memory and memorySwap set to the same value which is greater than zero, runc should write zero in `memory.swap.max` to disable swap. Signed-off-by: lifubang <lifubang@acmcoder.com>	2020-05-03 20:57:36 +08:00
Ted Yu	db29dce076	Close fd in case fd.Write() returns error Signed-off-by: Ted Yu <yuzhihong@gmail.com>	2020-05-02 20:06:08 -07:00
Sebastiaan van Stijn	64ca54816c	libcontainer: simplify error message The error message was including both the rootfs path, and the full mount path, which also includes the path of the rootfs. This patch removes the rootfs path from the error message, as it was redundant, and made the error message overly verbose Before this patch (errors wrapped for readability): ``` container_linux.go:348: starting container process caused: process_linux.go:438: container init caused: rootfs_linux.go:58: mounting "/foo.txt" to rootfs "/var/lib/docker/overlay2/de506d67da606b807009e23b548fec60d72359c77eec88785d8c7ecd54a6e4b2/merged" at "/var/lib/docker/overlay2/de506d67da606b807009e23b548fec60d72359c77eec88785d8c7ecd54a6e4b2/merged/usr/share/nginx/html" caused: not a directory: unknown ``` With this patch applied: ``` container_linux.go:348: starting container process caused: process_linux.go:438: container init caused: rootfs_linux.go:58: mounting "/foo.txt" to rootfs at "/var/lib/docker/overlay2/de506d67da606b807009e23b548fec60d72359c77eec88785d8c7ecd54a6e4b2/merged/usr/share/nginx/html" caused: not a directory: unknown ``` Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-05-03 02:59:46 +02:00
Sebastiaan van Stijn	2adfd20ac9	libcontainer: don't double-quote errors genericError.Error() was formatting the underlying error using `%q`; as a result, quotes in underlying errors were escaped multiple times, which caused the output to become hard to read, for example (wrapped for readability): ``` container_linux.go:345: starting container process caused "process_linux.go:430: container init caused \"rootfs_linux.go:58: mounting \\\"/foo.txt\\\" to rootfs \\\"/var/lib/docker/overlay2/f49a0ae0ec6646c818dcf05dbcbbdd79fc7c42561f3684fbb1fc5d2b9d3ad192/merged\\\" at \\\"/var/lib/docker/overlay2/f49a0ae0ec6646c818dcf05dbcbbdd79fc7c42561f3684fbb1fc5d2b9d3ad192/merged/usr/share/nginx/html\\\" caused \\\"not a directory\\\"\"": unknown ``` With this patch applied: ``` container_linux.go:348: starting container process caused: process_linux.go:438: container init caused: rootfs_linux.go:58: mounting "/foo.txt" to rootfs "/var/lib/docker/overlay2/de506d67da606b807009e23b548fec60d72359c77eec88785d8c7ecd54a6e4b2/merged" at "/var/lib/docker/overlay2/de506d67da606b807009e23b548fec60d72359c77eec88785d8c7ecd54a6e4b2/merged/usr/share/nginx/html" caused: not a directory: unknown ``` Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-05-03 02:55:15 +02:00
Kir Kolyshkin	c3b0b13fe9	cgroups/fs2: don't always parse /proc/self/cgroup Function defaultPath always parses /proc/self/cgroup, but the resulting value is not always used. Avoid unnecessary reading/parsing by moving the code to just before its use. Modify the test case accordingly. [v2: test: use UnifiedMountpoint, skip test if not on v2] Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-04-28 22:16:36 -07:00
Kir Kolyshkin	0a4dcc0203	Merge pull request #2331 from lifubang/StartTransientUnit check that StartTransientUnit/StopUnit succeeds LGTMs: @AkihiroSuda @kolyshkin Closes #2313, #2309	2020-04-28 10:47:52 -07:00
lifubang	bfa1b2aab3	check that StartTransientUnit and StopUnit succeeds Signed-off-by: lifubang <lifubang@acmcoder.com>	2020-04-28 15:46:28 +08:00
Akihiro Suda	60c647e3b8	fs2: fix cgroup.subtree_control EPERM on rootless + add CI Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-04-27 13:30:15 +09:00
Paweł Szulik	799d94818d	intelrdt: Add Cache Monitoring Technology stats Signed-off-by: Paweł Szulik <pawel.szulik@intel.com>	2020-04-25 09:43:48 +02:00
Kir Kolyshkin	b19f9cecfe	Merge pull request #2343 from lifubang/updateSystemdScope fix data inconsistency when using runc update in systemd driven cgroup	2020-04-24 23:34:19 -07:00
Akihiro Suda	0fd8d468ea	Merge pull request #2318 from lifubang/linuxResources cgroupv2: use default allowed devices when linux resources is null	2020-04-25 09:00:23 +09:00
Mrunal Patel	634e51b52c	Merge pull request #2335 from kolyshkin/cgroupv2-cpt Fix cgroupv2 checkpoint/restore	2020-04-24 08:47:36 -07:00
Akihiro Suda	49ca1fd074	Merge pull request #2347 from kolyshkin/v2-allow-all-devs cgroupv2: allow to set EnableAllDevices=true	2020-04-24 16:09:40 +09:00
Mrunal Patel	c420a3ec7f	Merge pull request #2324 from kolyshkin/criu-freezer libcontainer: fix Checkpoint wrt cgroupv2	2020-04-23 19:24:38 -07:00
Kir Kolyshkin	440244268b	Merge pull request #2330 from KentaTada/use-linuxnamespace-const libcontainer: use consts of Namespace from runtime-spec	2020-04-23 18:58:29 -07:00
Kir Kolyshkin	55d5c99ca7	libct/mountToRootfs: rm useless code To make a bind mount read-only, it needs to be remounted. This is what the code removed does, but it is not needed here. We have to deal with three cases here: 1. cgroup v2 unified mode. In this case the mount is real mount with fstype=cgroup2, and there is no need to have a bind mount on top, as we pass readonly flag to the mount as is. 2. cgroup v1 + cgroupns (enableCgroupns == true). In this case the "mount" is in fact a set of real mounts with fstype=cgroup, and they are all performed in mountCgroupV1, with readonly flag added if needed. 3. cgroup v1 as is (enableCgroupns == false). In this case mountCgroupV1() calls mountToRootfs() again with an argument from the list obtained from getCgroupMounts(), i.e. a bind mount with the same flags as the original mount has (plus unix.MS_BIND \| unix.MS_REC), and mountToRootfs() does remounting (under the case "bind":). So, the code which this patch is removing is not needed -- it essentially does nothing in case 3 above (since the bind mount is already remounted readonly), and in cases 1 and 2 it creates an unneeded extra bind mount on top of a real one (or set of real ones). Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-04-23 16:49:12 -07:00
Kir Kolyshkin	20959b1666	libcontainer/integration/checkpoint_test: simplify Since commit `9280e3566d` it is not longer needed to have `cgroup2' mount. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-04-23 15:22:32 -07:00
lifubang	1d4ccc8e0c	fix data inconsistent when runc update in systemd driven cgroup v1 Signed-off-by: lifubang <lifubang@acmcoder.com>	2020-04-23 19:32:57 +08:00
lifubang	7682a2b2a5	fix data inconsistent when runc update in systemd driven cgroup v2 Signed-off-by: lifubang <lifubang@acmcoder.com>	2020-04-23 19:32:07 +08:00
Kenta Tada	4474795388	libcontainer: use x/sys/unix instead of the hardcoded value PR_SET_CHILD_SUBREAPER is defined in x/sys/unix. Signed-off-by: Kenta Tada <Kenta.Tada@sony.com>	2020-04-23 10:49:51 +09:00
Kir Kolyshkin	9280e3566d	checkpoint/restore: fix cgroupv2 handling In case of cgroupv2 unified hierarchy, the /sys/fs/cgroup mount is the real mount with fstype of cgroup2 (rather than a set of external bind mounts like for cgroupv1). So, we should not add it to the list of "external bind mounts" on both checkpoint and restore. Without this fix, checkpoint integration tests fail on cgroup v2. Also, same is true for cgroup v1 + cgroupns. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-04-22 11:26:43 -07:00
Kir Kolyshkin	75a92ea615	cgroupv2: allow to set EnableAllDevices=true In this case we just do not install any eBPF rules checking the devices. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-04-22 11:05:36 -07:00

1 2 3 4 5 ...

1486 Commits