jasder/runc - runc - 军科开源项目托管

Commit Graph

Author	SHA1	Message	Date
Aleksa Sarai	93e5c4d320	merge branch 'pr-2232' Aleksa Sarai (1): libcontainer: dual-license nsenter/cloned_binary.c LGTMs: @mrunalp @AkihiroSuda Closes #2232	2020-03-04 11:10:49 +11:00
Qiang Huang	3b7e32feba	Merge pull request #2210 from Zyqsempai/2164-remove-deprecated-systemd-resources Exchange deprecated systemd resources with the appropriate for cgroupv2	2020-02-29 10:13:55 +08:00
Aleksa Sarai	98de84265d	libcontainer: dual-license nsenter/cloned_binary.c The new license is Apache-2.0 OR LPGL-2.1-or-later. This is necessary for libcrun to be relicensed under the LGPL-2.1[1], and all of the relevant copyright holders have agreed to relicense this code under the dual license: * Aleksa Sarai [2] * Christian Brauner [3] * Justin Cormack [4] Because it is still dual-licensed as an Apache-2.0 work, this doesn't affect it's usability within runc or any other dependent projects. [1]: https://github.com/containers/crun/issues/256 [2]: https://github.com/containers/crun/issues/256#issuecomment-589498088 [3]: https://github.com/containers/crun/issues/256#issuecomment-589605034 [4]: https://github.com/containers/crun/issues/256#issuecomment-589504231 Signed-off-by: Aleksa Sarai <asarai@suse.de>	2020-02-22 00:17:07 +11:00
Aleksa Sarai	0f32b03dda	merge branch 'pr-2192' Boris Popovschi (2): Fix skip message for cgroupv2 Fix MAJ:MIN io.stat parsing order LGTMs: @hqhq @cyphar Closes #2192	2020-02-21 16:00:17 +11:00
Boris Popovschi	4b8134f63b	Convert blkioWeight to io.weight properly Signed-off-by: Boris Popovschi <zyqsempai@mail.ru>	2020-02-18 15:44:07 +02:00
Kir Kolyshkin	1cd71dfd71	systemd properties: support for *Sec values Some systemd properties are documented as having "Sec" suffix (e.g. "TimeoutStopSec") but are expected to have "USec" suffix when passed over dbus, so let's provide appropriate conversion to improve compatibility. This means, one can specify TimeoutStopSec with a numeric argument, in seconds, and it will be properly converted to TimeoutStopUsec with the argument in microseconds. As a side bonus, even float values are converted, so e.g. TimeoutStopSec=1.5 is possible. This turned out a bit more tricky to implement when I was originally expected, since there are a handful of numeric types in dbus and each one requires explicit conversion. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-02-17 16:07:19 -08:00
Kir Kolyshkin	4c5c3fb960	Support for setting systemd properties via annotations In case systemd is used to set cgroups for the container, it creates a scope unit dedicated to it (usually named `runc-$ID.scope`). This patch adds an ability to set arbitrary systemd properties for the systemd unit via runtime spec annotations. Initially this was developed as an ability to specify the `TimeoutStopUSec` property, but later generalized to work with arbitrary ones. Example usage: add the following to runtime spec (config.json): ``` "annotations": { "org.systemd.property.TimeoutStopUSec": "uint64 123456789", "org.systemd.property.CollectMode":"'inactive-or-failed'" }, ``` and start the container (e.g. `runc --systemd-cgroup run $ID`). The above will set the following systemd parameters: * `TimeoutStopSec` to 2 minutes and 3 seconds, * `CollectMode` to "inactive-or-failed". The values are in the gvariant format (see [1]). To figure out which type systemd expects for a particular parameter, see systemd sources. In particular, parameters with `USec` suffix require an `uint64` typed argument, while gvariant assumes int32 for a numeric values, therefore the explicit type is required. NOTE that systemd receives the time-typed parameters as USec but shows them (in `systemctl show`) as Sec. For example, the stop timeout should be set as `TimeoutStopUSec` but is shown as `TimeoutStopSec`. [1] https://developer.gnome.org/glib/stable/gvariant-text.html Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-02-17 16:07:19 -08:00
Mrunal Patel	81ef5024f8	Merge pull request #2213 from Zyqsempai/2166-convert-cpu-weight-poperly Added conversion for cpu.weight v2	2020-02-17 07:49:39 -08:00
Boris Popovschi	7c439cc6f6	Added conversion for cpu.weight v2 Signed-off-by: Boris Popovschi <zyqsempai@mail.ru>	2020-02-12 11:32:34 +02:00
Andrei Vagin	269ea385a4	restore: fix a race condition in process.Wait() Adrian reported that the checkpoint test stated failing: === RUN TestCheckpoint --- FAIL: TestCheckpoint (0.38s) checkpoint_test.go:297: Did not restore the pipe correctly: The problem here is when we start exec.Cmd, we don't call its wait method. This means that we don't wait cmd.goroutines ans so we don't know when all data will be read from process pipes. Signed-off-by: Andrei Vagin <avagin@gmail.com>	2020-02-10 10:21:08 -08:00
Boris Popovschi	3b992087b8	Fix skip message for cgroupv2 Signed-off-by: Boris Popovschi <zyqsempai@mail.ru>	2020-02-03 14:27:12 +02:00
Mrunal Patel	2fc03cc11c	Merge pull request #2207 from cyphar/fix-double-volume-attack rootfs: do not permit /proc mounts to non-directories	2020-01-22 08:06:10 -08:00
Aleksa Sarai	3291d66b98	rootfs: do not permit /proc mounts to non-directories mount(2) will blindly follow symlinks, which is a problem because it allows a malicious container to trick runc into mounting /proc to an entirely different location (and thus within the attacker's control for a rename-exchange attack). This is just a hotfix (to "stop the bleeding"), and the more complete fix would be finish libpathrs and port runc to it (to avoid these types of attacks entirely, and defend against a variety of other /proc-related attacks). It can be bypased by someone having "/" be a volume controlled by another container. Fixes: CVE-2019-19921 Signed-off-by: Aleksa Sarai <asarai@suse.de>	2020-01-17 14:00:30 +11:00
Aleksa Sarai	f6fb7a0338	merge branch 'pr-2133' Julia Nedialkova (1): Handle ENODEV when accessing the freezer.state file LGTMs: @crosbymichael @cyphar Closes #2133	2020-01-17 02:07:19 +11:00
Boris Popovschi	5b96f314ba	Exchanged deprecated systemd resources with the appropriate for cgroupv2 Signed-off-by: Boris Popovschi <zyqsempai@mail.ru>	2020-01-15 18:09:33 +02:00
Boris Popovschi	cf9b7c33e1	Fix MAJ:MIN io.stat parsing order Signed-off-by: Boris Popovschi <zyqsempai@mail.ru>	2020-01-15 14:39:14 +02:00
Akihiro Suda	55f8c254be	temporarily disable CRIU tests Ubuntu kernel is temporarily broken: https://github.com/opencontainers/runc/pull/2198#issuecomment-571124087 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-01-14 11:18:44 +09:00
Akihiro Suda	5c20ea1472	fix merging #2177 and #2169 A new method was added to the cgroup interface when #2177 was merged. After #2177 got merged, #2169 was merged without rebase (sorry!) and compilation was failing: libcontainer/cgroups/fs2/fs2.go:208:22: container.Cgroup undefined (type *configs.Config has no field or method Cgroup) Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-01-14 11:13:25 +09:00
Mrunal Patel	5cc0deaf7a	Merge pull request #2169 from AkihiroSuda/split-fs cgroup2: split fs2 from fs	2020-01-13 16:23:27 -08:00
Michael Crosby	2b52db7527	Merge pull request #2177 from devimc/topic/libcontainer/kata-containers libcontainer: export and add new methods to allow cgroups manipulation	2020-01-02 11:47:12 -05:00
Jordan Liggitt	8541d9cf3d	Fix race checking for process exit and waiting for exec fifo Signed-off-by: Jordan Liggitt <liggitt@google.com>	2019-12-18 18:48:18 +00:00
Julio Montes	8ddd892072	libcontainer: add method to get cgroup config from cgroup Manager `configs.Cgroup` contains the configuration used to create cgroups. This configuration must be saved to disk, since it's required to restore the cgroup manager that was used to create the cgroups. Add method to get cgroup configuration from cgroup Manager to allow API users save it to disk and restore a cgroup manager later. fixes #2176 Signed-off-by: Julio Montes <julio.montes@intel.com>	2019-12-17 22:46:03 +00:00
Julio Montes	cd7c59d042	libcontainer: export createCgroupConfig A `config.Cgroups` object is required to manipulate cgroups v1 and v2 using libcontainer. Export `createCgroupConfig` to allow API users to create `config.Cgroups` objects using directly libcontainer API. Signed-off-by: Julio Montes <julio.montes@intel.com>	2019-12-17 22:46:03 +00:00
Aleksa Sarai	7496a96825	merge branch 'pr-2086' * Kurnia D Win (1): fix permission denied LGTMs: @crosbymichael @cyphar Closes #2086	2019-12-17 20:49:52 +11:00
Aleksa Sarai	201b063745	merge branch 'pr-2141' Radostin Stoyanov (1): criu: Ensure other users cannot read c/r files LGTMs: @crosbymichael @cyphar Closes #2141	2019-12-07 09:32:58 +11:00
Akihiro Suda	ec49f98d72	fs2: support legacy device spec (to pass CI) Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-12-06 15:53:07 +09:00
Akihiro Suda	88e8350de2	cgroup2: split fs2 from fs split fs2 package from fs, as mixing up fs and fs2 is very likely to result in unmaintainable code. Inspired by containerd/cgroups#109 Fix #2157 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-12-06 15:42:10 +09:00
Aleksa Sarai	5e63695384	merge branch 'pr-2174' Sascha Grunert (1): Expose network interfaces via runc events LGTMs: @cyphar @mrunalp Closes #2174	2019-12-06 13:07:44 +11:00
Michael Crosby	8bb10af481	Merge pull request #2165 from AkihiroSuda/travis-f31 .travis.yml: add Fedora 31 vagrant box (for cgroup2)	2019-12-05 16:26:51 -05:00
Sascha Grunert	41a20b5852	Expose network interfaces via runc events The libcontainer network statistics are unreachable without manually creating a libcontainer instance. To retrieve them via the CLI interface of runc, we now expose them as well. Signed-off-by: Sascha Grunert <sgrunert@suse.com>	2019-12-05 13:20:51 +01:00
Akihiro Suda	faf1e44ea9	cgroup2: ebpf: increase RLIM_MEMLOCK to avoid BPF_PROG_LOAD error Fix #2167 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-11-07 15:43:27 +09:00
Mrunal Patel	46def4cc4c	Merge pull request #2154 from jpeach/2008-remove-static-build-tag Remove the static_build build tag.	2019-11-04 17:10:59 -08:00
Akihiro Suda	ccd4436fc4	.travis.yml: add Fedora 31 vagrant box (for cgroup2) As the baby step, only unit tests are executed. Failing tests are currently skipped and will be fixed in follow-up PRs. Fix #2124 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-10-31 16:53:01 +09:00
Akihiro Suda	faf673ee45	cgroup2: port over eBPF device controller from crun The implementation is based on https://github.com/containers/crun/blob/0.10.2/src/libcrun/ebpf.c Although ebpf.c is originally licensed under LGPL-3.0-or-later, the author Giuseppe Scrivano agreed to relicense the file in Apache License 2.0: https://github.com/opencontainers/runc/issues/2144#issuecomment-543116397 See libcontainer/cgroups/ebpf/devicefilter/devicefilter_test.go for tested configurations. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-10-31 14:01:46 +09:00
Qiang Huang	e57a774066	Merge pull request #2149 from AkihiroSuda/cgroup2-ps cgroup2: implement `runc ps`	2019-10-31 09:44:39 +08:00
Qiang Huang	d239ca8425	Merge pull request #2148 from AkihiroSuda/cg2-ignore-cpuset-when-no-config cgroup2: cpuset_v2: skip Apply when no limit is specified	2019-10-29 21:57:58 +08:00
Mrunal Patel	03cf145f5a	Merge pull request #2159 from AkihiroSuda/cgroup2-mount-in-userns cgroup2: allow mounting /sys/fs/cgroup in UserNS without unsharing CgroupNS	2019-10-28 19:19:09 -07:00
Akihiro Suda	74a3fe5d1b	cgroup2: do not parse /proc/cgroups /proc/cgroups is meaningless for v2 and should be ignored. https://github.com/torvalds/linux/blob/v5.3/Documentation/admin-guide/cgroup-v2.rst#deprecated-v1-core-features * Now GetAllSubsystems() parses /sys/fs/cgroup/cgroup.controller, not /proc/cgroups. The function result also contains "pseudo" controllers: {"devices", "freezer"}. As it is hard to detect availability of pseudo controllers, pseudo controllers are always assumed to be available. * Now IOGroupV2.Name() returns "io", not "blkio" Fix #2155 #2156 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-10-28 00:00:33 +09:00
Akihiro Suda	9c81440fb5	cgroup2: allow mounting /sys/fs/cgroup in UserNS without unsharing CgroupNS Bind-mount /sys/fs/cgroup when we are in UserNS but CgroupNS is not unshared, because we cannot mount cgroup2. This behavior correspond to crun v0.10.2. Fix #2158 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-10-27 23:09:41 +09:00
James Peach	13919f5dfd	Remove the static_build build tag. The `static_build` build tag was introduced in `e9944d0f` to remove build warnings related to systemd cgroup driver dependencies. Since then, those dependencies have changed and building the systemd cgroup driver no longer imports dlopen. After this change, runc builds will always include the systemd cgroup driver. This fixes #2008. Signed-off-by: James Peach <jpeach@apache.org>	2019-10-26 08:28:45 +11:00
Michael Crosby	c4d8e1688c	Merge pull request #2140 from crosbymichael/fs-unified Set unified mountpoint in find mnt func	2019-10-24 15:20:47 -04:00
Akihiro Suda	dbd771e475	cgroup2: implement `runc ps` Implemented `runc ps` for cgroup v2 , using a newly added method `m.GetUnifiedPath()`. Unlike the v1 implementation that checks `m.GetPaths()["devices"]`, the v2 implementation does not require the device controller to be available. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-10-19 01:59:24 +09:00
Akihiro Suda	d918e7f408	cpuset_v2: skip Apply when no limit is specified Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-10-19 00:33:31 +09:00
Akihiro Suda	033936ef76	io_v2.go: remove blkio v1 code Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-10-18 21:33:48 +09:00
Radostin Stoyanov	a610a84821	criu: Ensure other users cannot read c/r files No checkpoint files should be readable by anyone else but the user creating it. Signed-off-by: Radostin Stoyanov <rstoyanov1@gmail.com>	2019-10-17 07:49:38 +01:00
Michael Crosby	b28f58f31b	Set unified mountpoint in find mnt func This is needed for the fsv2 cgroups to work when there is a unified mountpoint. Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-10-15 15:40:03 -04:00
Radostin Stoyanov	f017e0f9e1	checkpoint: Set descriptors.json file mode to 0600 Prevent unprivileged users from being able to read descriptors.json Signed-off-by: Radostin Stoyanov <rstoyanov1@gmail.com>	2019-10-12 19:29:44 +01:00
Aleksa Sarai	1b8a1eeec3	merge branch 'pr-2132' Support different field counts of cpuaact.stats LGTMs: @crosbymichael @cyphar Closes #2132	2019-10-02 01:50:47 +10:00
Aleksa Sarai	d463f6485b	*: verify that operations on /proc/... are on procfs This is an additional mitigation for CVE-2019-16884. The primary problem is that Docker can be coerced into bind-mounting a file system on top of /proc (resulting in label-related writes to /proc no longer happening). While we are working on mitigations against permitting the mounts, this helps avoid our code from being tricked into writing to non-procfs files. This is not a perfect solution (after all, there might be a bind-mount of a different procfs file over the target) but in order to exploit that you would need to be able to tweak a config.json pretty specifically (which thankfully Docker doesn't allow). Specifically this stops AppArmor from not labeling a process silently due to /proc/self/attr/... being incorrectly set, and stops any accidental fd leaks because /proc/self/fd/... is not real. Signed-off-by: Aleksa Sarai <asarai@suse.de>	2019-09-30 09:06:48 +10:00
tianye15	28e58a0f6a	Support different field counts of cpuaact.stats Signed-off-by: skilxnTL <tylxltt@gmail.com>	2019-09-29 10:20:58 +08:00
Julia Nedialkova	e63b797f38	Handle ENODEV when accessing the freezer.state file ...when checking if a container is paused Signed-off-by: Julia Nedialkova <julianedialkova@hotmail.com>	2019-09-27 17:02:56 +03:00
blacktop	84373aaa56	Add SCMP_ACT_LOG as a valid Seccomp action (#1951 ) Signed-off-by: blacktop <blacktop@users.noreply.github.com>	2019-09-26 11:03:03 -04:00
Michael Crosby	331692baa7	Only allow proc mount if it is procfs Fixes #2128 This allows proc to be bind mounted for host and rootless namespace usecases but it removes the ability to mount over the top of proc with a directory. ```bash > sudo docker run --rm apparmor docker: Error response from daemon: OCI runtime create failed: container_linux.go:346: starting container process caused "process_linux.go:449: container init caused \"rootfs_linux.go:58: mounting \\\"/var/lib/docker/volumes/aae28ea068c33d60e64d1a75916cf3ec2dc3634f97571854c9ed30c8401460c1/_data\\\" to rootfs \\\"/var/lib/docker/overlay2/a6be5ae911bf19f8eecb23a295dec85be9a8ee8da66e9fb55b47c841d1e381b7/merged\\\" at \\\"/proc\\\" caused \\\"\\\\\\\"/var/lib/docker/overlay2/a6be5ae911bf19f8eecb23a295dec85be9a8ee8da66e9fb55b47c841d1e381b7/merged/proc\\\\\\\" cannot be mounted because it is not of type proc\\\"\"": unknown. > sudo docker run --rm -v /proc:/proc apparmor docker-default (enforce) root 18989 0.9 0.0 1288 4 ? Ss 16:47 0:00 sleep 20 ``` Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2019-09-24 11:00:18 -04:00
Jonathan Rudenberg	af7b6547ec	libcontainer/nsenter: Don't import C in non-cgo file Signed-off-by: Jonathan Rudenberg <jonathan@titanous.com>	2019-09-11 17:03:07 +00:00
Giuseppe Scrivano	718a566e02	cgroup: support mount of cgroup2 convert a "cgroup" mount to "cgroup2" when the system uses cgroups v2 unified hierarchy. Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2019-09-06 17:57:14 +02:00
Sebastiaan van Stijn	eb86f6037e	bump syndtr/gocapability d98352740cb2c55f81556b63d4a1ec64c5a319c2 relevant changes: - syndtr/gocapability#14 capability: Deprecate NewPid and NewFile for NewPid2 and NewFile2 - syndtr/gocapability#16 Fix capHeader.pid type Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2019-09-06 01:44:26 +02:00
Mrunal Patel	92ac8e3f84	Merge pull request #2113 from giuseppe/cgroupv2 libcontainer: initial support for cgroups v2	2019-09-05 13:14:29 -07:00
Giuseppe Scrivano	524cb7c318	libcontainer: add systemd.UnifiedManager Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2019-09-05 13:02:27 +02:00
Giuseppe Scrivano	ec11136828	libcontainer, cgroups: rename systemd.Manager to LegacyManager Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2019-09-05 13:02:26 +02:00
Giuseppe Scrivano	1932917b71	libcontainer: add initial support for cgroups v2 allow to set what subsystems are used by libcontainer/cgroups/fs.Manager. subsystemsUnified is used on a system running with cgroups v2 unified mode. Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2019-09-05 13:02:25 +02:00
Mrunal Patel	92d851e03b	Merge pull request #2123 from carlosedp/riscv64 Bump x/sys and update syscall for initial Risc-V support	2019-09-04 14:10:26 -07:00
Carlos de Paula	4316e4d047	Bump x/sys and update syscall to start Risc-V support Signed-off-by: Carlos de Paula <me@carlosedp.com>	2019-08-29 12:09:08 -03:00
Akihiro Suda	0bc069d795	nsenter: fix clang-tidy warning nsexec.c:148:3: warning: Initialized va_list 'args' is leaked [clang-analyzer-valist.Unterminated] Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-08-29 00:18:02 +09:00
Akihiro Suda	b225ef58fb	nsenter: minor clean up Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-08-28 19:50:35 +09:00
Daniel J Walsh	e4aa73424b	Rename cgroups_windows.go to cgroups_unsupported.go Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2019-08-26 18:13:52 -04:00
Mrunal Patel	c61c7370f9	Merge pull request #2103 from sipsma/cgnil cgroups/fs: check nil pointers in cgroup manager	2019-08-26 14:05:44 -07:00
Mrunal Patel	68d73f0a2e	Merge pull request #2107 from sashayakovtseva/public-get-devices Make get devices function public	2019-08-26 09:58:10 -07:00
Kenta Tada	c740965a18	libcontainer: update masked paths of /proc This commit updates the masked paths of /proc. Related issues: * https://github.com/moby/moby/pull/37404 * https://github.com/moby/moby/pull/38299 * https://github.com/moby/moby/pull/36368 Signed-off-by: Kenta Tada <Kenta.Tada@sony.com>	2019-08-26 12:25:56 +09:00
Mrunal Patel	3525eddec5	Merge pull request #2117 from filbranden/detection1 Remove libcontainer detection for systemd features	2019-08-25 13:15:15 -07:00
Filipe Brandenburger	518c855833	Remove libcontainer detection for systemd features Transient units (and transient slice units) have been available for quite a long time and RHEL 7 with systemd v219 (likely the oldest OS we care about at this point) supports that. A system running a systemd without these features is likely to break a lot of other stuff that runc/libcontainer care about. Regarding delegated slices, modern systemd doesn't allow it and runc/libcontainer run fine on it, so we might as well just stop requesting it on older versions of systemd which allowed it. (Those versions never really changed behavior significantly when that option was passed anyways.) Signed-off-by: Filipe Brandenburger <filbranden@gmail.com>	2019-08-22 21:53:24 -07:00
Filipe Brandenburger	588f040a77	Avoid the dependency on cgo through go-systemd/util package This dependency is only needed in package "github.com/coreos/go-systemd/util" and we only use it for IsRunningSystemd(), which is a simple Go function that just stats a file. Let's just borrow it here, so we remove the dependency and can remove that package from vendored build. This also removes dependencies on dlopen and on trying to find libsystemd.so or libsystemd-login.so in the system. Tested that this still builds and works as expected. Signed-off-by: Filipe Brandenburger <filbranden@gmail.com>	2019-08-22 21:07:24 -07:00
sashayakovtseva	afc24792dc	Make get devices function public Signed-off-by: sashayakovtseva <sasha@sylabs.io>	2019-08-15 17:16:47 +03:00
Erik Sipsma	9c822e4847	cgroups/fs: check nil pointers in cgroup manager Signed-off-by: Erik Sipsma <sipsma@amazon.com>	2019-08-14 09:50:45 -07:00
Mrunal Patel	2e94378464	Merge pull request #2094 from sipsma/2093-nodotudev Skip searching /dev/.udev for device nodes.	2019-08-05 10:41:54 -07:00
Erik Sipsma	f08cdaeec9	Skip searching /dev/.udev for device nodes. Closes: #2093 Signed-off-by: Erik Sipsma <sipsma@amazon.com>	2019-07-31 19:41:33 +00:00
Andreas Stocker	808e809f8a	doc: First process in container needs `Init: true` `Init` on the `Process` struct specifies whether the process is the first process in the container. This needs to be set to `true` when running the container. Signed-off-by: Andreas Stocker <astocker@anexia-it.com>	2019-07-29 22:24:28 +02:00
Kurnia D Win	5e0e67d76c	fix permission denied when exec as root and config.Cwd is not owned by root, exec will fail because root doesn't have the caps. So, Chdir should be done before setting the caps. Signed-off-by: Kurnia D Win <kurnia.d.win@gmail.com>	2019-07-18 12:49:36 +07:00
Mrunal Patel	b4a0b1d737	Merge pull request #2065 from odinuge/master Fix cgroup hugetlb size prefix for kB	2019-06-06 12:38:57 -07:00
Kenta Tada	b54fd85bbf	libcontainer: change seccomp test for clone syscall This commit changes the value of seccomp test for clone syscall. Also hardcoded values should be changed because it is unclear to understand what flags are tested. Related issues: * https://github.com/containerd/containerd/pull/3314 * https://github.com/moby/moby/pull/39308 * https://github.com/opencontainers/runtime-tools/pull/694 Signed-off-by: Kenta Tada <Kenta.Tada@sony.com>	2019-06-04 18:52:00 +09:00
Odin Ugedal	6f77e35daf	Export list of HugePageSizeUnits This will allow others to import it instead of copying it. Signed-off-by: Odin Ugedal <odin@ugedal.com>	2019-05-30 20:17:30 +02:00
Odin Ugedal	c6445b1c1c	Add tests for GetHugePageSize Add tests to avoid regressions Signed-off-by: Odin Ugedal <odin@ugedal.com>	2019-05-30 17:27:32 +02:00
Odin Ugedal	273e7b74a7	Fix cgroup hugetlb size prefix for kB The hugetlb cgroup control files (introduced here in 2012: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=abb8206cb0773) use "KB" and not "kB" (https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/mm/hugetlb_cgroup.c?h=v5.0#n349). The behavior in the kernel has not changed since the introduction, and the current code using "kB" will therefore fail on devices with small amounts of ram (see https://github.com/kubernetes/kubernetes/issues/77169) running a kernel with config flag CONFIG_HUGETLBFS=y As seen from the code in "mem_fmt" inside hugetlb_cgroup.c, only "KB", "MB" and "GB" are used, so the others may be removed as well. Here is a real world example of the files inside the "/sys/kernel/mm/hugepages/" directory: - "hugepages-64kB" - "hugepages-2048kB" - "hugepages-32768kB" - "hugepages-1048576kB" And the corresponding cgroup files: - "hugetlb.64KB._____" - "hugetlb.2MB._____" - "hugetlb.32MB._____" - "hugetlb.1GB._____" Signed-off-by: Odin Ugedal <odin@ugedal.com>	2019-05-29 21:52:43 +02:00
Mrunal Patel	5ef781c2e7	Merge pull request #2061 from KentaTada/add-cgroup-namespace-test libcontainer: fix TestGetContainerState to check configs.NEWCGROUP	2019-05-22 16:09:38 -07:00
Qiang Huang	c8337777b6	Merge pull request #2042 from xiaochenshen/rdt-add-missing-destroy libcontainer: intelrdt: add missing destroy handler in defer func	2019-05-21 09:48:00 +08:00
Kenta Tada	65032b55b1	libcontainer: fix TestGetContainerState to check configs.NEWCGROUP This test needs to handle the case of configs.NEWCGROUP as Namespace's type. Signed-off-by: Kenta Tada <Kenta.Tada@sony.com>	2019-05-21 09:10:38 +09:00
Mrunal Patel	2484581dd7	Merge pull request #2035 from cyphar/bindmount-types specconv: always set "type: bind" in case of MS_BIND	2019-05-07 15:47:58 -07:00
Mrunal Patel	a0ecf749ee	Merge pull request #2047 from filbranden/systemd7 Move systemd.Manager initialization into a function in that module	2019-05-07 15:08:41 -07:00
Filipe Brandenburger	46351eb3d1	Move systemd.Manager initialization into a function in that module This will permit us to extend the internals of systemd.Manager to include further information about the system, such as whether cgroupv1, cgroupv2 or both are in effect. Furthermore, it allows a future refactor of moving more of UseSystemd() code into the factory initialization function. Signed-off-by: Filipe Brandenburger <filbranden@gmail.com>	2019-05-01 13:22:19 -07:00
Georgi Sabev	a146081828	Write logs to stderr by default Minor refactoring to use the filePair struct for both init sock and log pipe Co-authored-by: Julia Nedialkova <julianedialkova@hotmail.com> Signed-off-by: Georgi Sabev <georgethebeatle@gmail.com>	2019-04-24 15:18:14 +03:00
Georgi Sabev	68b4ff5b37	Simplify bail logic & minor nsexec improvements Co-authored-by: Julia Nedialkova <julianedialkova@hotmail.com> Signed-off-by: Georgi Sabev <georgethebeatle@gmail.com>	2019-04-24 15:16:11 +03:00
Xiaochen Shen	17b37ea3fa	libcontainer: intelrdt: add missing destroy handler in defer func In the exception handling of initProcess.start(), we need to add the missing IntelRdtManager.Destroy() handler in defer func. Signed-off-by: Xiaochen Shen <xiaochen.shen@intel.com>	2019-04-24 16:41:51 +08:00
Georgi Sabev	475aef10f7	Remove redundant log function Bump logrus so that we can use logrus.StandardLogger().Logf instead Co-authored-by: Julia Nedialkova <julianedialkova@hotmail.com> Signed-off-by: Georgi Sabev <georgethebeatle@gmail.com>	2019-04-22 17:54:55 +03:00
Georgi Sabev	ba3cabf932	Improve nsexec logging * Simplify logging function * Logs contain __FUNCTION__:__LINE__ * Bail uses write_log Co-authored-by: Julia Nedialkova <julianedialkova@hotmail.com> Co-authored-by: Danail Branekov <danailster@gmail.com> Signed-off-by: Georgi Sabev <georgethebeatle@gmail.com>	2019-04-22 17:53:52 +03:00
Aleksa Sarai	8296826da5	specconv: always set "type: bind" in case of MS_BIND We discovered in umoci that setting a dummy type of "none" would result in file-based bind-mounts no longer working properly, which is caused by a restriction for when specconv will change the device type to "bind" to work around rootfs_linux.go's ... issues. However, bind-mounts don't have a type (and Linux will ignore any type specifier you give it) because the type is copied from the source of the bind-mount. So we should always overwrite it to avoid user confusion. Signed-off-by: Aleksa Sarai <asarai@suse.de>	2019-04-08 15:08:08 +10:00
Danail Branekov	c486e3c406	Address comments in PR 1861 Refactor configuring logging into a reusable component so that it can be nicely used in both main() and init process init() Co-authored-by: Georgi Sabev <georgethebeatle@gmail.com> Co-authored-by: Giuseppe Capizzi <gcapizzi@pivotal.io> Co-authored-by: Claudia Beresford <cberesford@pivotal.io> Signed-off-by: Danail Branekov <danailster@gmail.com>	2019-04-04 14:57:28 +03:00
Marco Vedovati	feebfac358	Remove pipe close before exec. Pipe close before exec is not necessary as os.Pipe() is calling pipe2 with O_CLOEXEC option. Signed-off-by: Marco Vedovati <mvedovati@suse.com>	2019-04-04 14:53:30 +03:00
Marco Vedovati	9a599f62fb	Support for logging from children processes Add support for children processes logging (including nsexec). A pipe is used to send logs from children to parent in JSON. The JSON format used is the same used by logrus JSON formatted, i.e. children process can use standard logrus APIs. Signed-off-by: Marco Vedovati <mvedovati@suse.com>	2019-04-04 14:53:23 +03:00
Michael Crosby	11fc498ffa	Merge pull request #2023 from LittleLightLittleFire/2022-fix-runc-zombie-process-regression Fixes regression causing zombie runc:[1:CHILD] processes	2019-03-22 14:06:31 -04:00
Mrunal Patel	dd22a84864	Merge pull request #2012 from rhatdan/selinux Need to setup labeling of kernel keyrings.	2019-03-20 21:17:18 -07:00
Alex Fang	eab5330908	Fixes regression causing zombie runc:[1:CHILD] processes Whenever processes are spawned using nsexec, a zombie runc:[1:CHILD] process will always be created and will need to be reaped by the parent Signed-off-by: Alex Fang <littlelightlittlefire@gmail.com>	2019-03-21 13:43:38 +11:00

1 2 3 4 5 ...

1378 Commits