Lennart Poettering

1️⃣3️⃣ Here's the 13th installment of posts highlighting key new features of the upcoming v256 release of systemd.

ssh is widely established as *the* mechanism for controlling Linux systems remotely, both interactively and with automated tools. It not only provides means for secure authentication and communication for a tty/shell, but also does this for file transfers (sftp), and IPC communication (D-Bus or Varlink).

Like 9 May at 12:49 | Open on mastodon.social

Lennart Poettering

It relies on TCP as network transport, which is great for remote operation around the globe but really sucks for local communication with a VM and similar, as it usually requires delegation of an address space, dhcp lease, dns and so on, which while manageable are certainly a major source of mistakes, fragility and headaches. In particular it means that logging into a system to debug networking doesnt really work since without working networking you cant even log in. Sad!

9 May at 12:50 | Open on mastodon.social

Show 3 replies

Lennart Poettering

7️⃣ Here's the 7th installment of my series of posts highlighting key new features of the upcoming v256 release of systemd.

In systemd we put a lot of focus on operating with disk images, specifically file system images that carry an expressive GPT partition table – something that we call DDIs ("Discoverable Disk Images").

Like 1 May at 6:03 | Open on mastodon.social

Lennart Poettering

DDIs are supposed to carry dm-verity authentication information, i.e. every single access to them is typically cryptographically protected, and linked back to a set of signing keys maintained by the system (ideally in the kernel keyring). systemd uses DDIs for the system itself, for systemd-nspawn containers, for systemd portable services, for systemd-sysext system extensions, for systemd-confext configuration extensions and more.

1 May at 6:03 | Open on mastodon.social

Show 6 replies

Lennart Poettering

5️⃣ Here's the 5th installment of my series of posts highlighting key new features of the upcoming v256 release of systemd.

I am pretty sure all of you are well aware of the venerable "sudo" tool that is a key component of most Linux distributions since a long time. At the surface it's a tool that allows an unprivileged user to acquire privileges temporarily, from within their existing login sessions, for just one command, or maybe for a subshell.

"sudo" is very very useful, as it…

Like 29 April at 7:27 | Open on mastodon.social

Show previous comments

Sebastian Wick

@pid_eins suid programs executing in the environment of the parent process means that I might become root in a user namespace and get the filesystem view of the current mount ns. This won't work with your approach, will it?

30 April at 20:48 | Open on fosstodon.org

Show 3 replies

Matthias Klumpp

@pid_eins This is very nice - especially having it provide a clean context, that saves a ton of headaches and sanitization I need to do when using "sudo" in other programs! (not like that's an amazing thing anyway, but occasionally it's useful and justified)
Only having run0 coloring / changing the output of the called command is not something I always like, but maybe that can be optionally disabled...

30 April at 21:19 | Open on mastodon.social

Matthew Miller

@pid_eins

Is there an equivalent to "sudo -e"?

1 May at 10:51 | Open on hachyderm.io

Show 5 replies

Lennart Poettering

This is such a bad bad API compat breakage:

https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=e81cd5a983bb35dabd38ee472cf3fea1c63e0f23

It's used all over the place in userspace. In systemd we use it:

1. to detect if a block device has partition scanning off or on
2. In our udev test suite, to validate devices are in order
3. udev rules use it for some feature checks (in older versions of systemd).

And it's even a frickin documented userspace API:

https://www.kernel.org/doc/html/v5.5/block/capability.html

So much about that nonsensical "we don't break userspace" kernel mantra.

This is such a bad bad API compat breakage:

https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=e81cd5a983bb35dabd38ee472cf3fea1c63e0f23

It's used all over the place in userspace. In systemd we use it:

1. to detect if a block device has partition scanning off or on
2. In our udev test suite, to validate devices are in order
3. udev rules use it for some feature checks (in older versions of systemd).

Expand text...

Like 8 April at 14:37 | Open on mastodon.social

Lennart Poettering

Anyone knows where the kernel's github/gitlab project is? Would love to file an issue or placeholder revert PR, but somehow I cannot find it! Anyone?

(Yes, this is a joke, I am fully aware of the concept of mailing lists – as a historical concept from the 2005 era... Yes, I am too lazy to figuring out how to report this properly. Hence social media it is.)

8 April at 14:52 | Open on mastodon.social

Show 1 reply

Lennart Poettering

Credit where credit is due! I'd really like to take a minute and thank Jia Tan how they helped us to finally get sd_notify() support merged into OpenSSH upstream!

https://bugzilla.mindrot.org/show_bug.cgi?id=2641

Thank you, Jia, you rock!

Like 3 April at 8:07 | Open on mastodon.social

Show previous comments

Ikey Doherty

@pid_eins don't forget, next time glibc enables a CVE it'll also be systemds fault for being linked to it :rofl:

Glad to see sane concepts landing anyway ^^

3 April at 9:04 | Open on fosstodon.org

jakob reading solarpunk

@pid_eins I'm sure they're going to make employee of the month!

3 April at 9:29 | Open on mastodon.social

Neko :verified: :verified:

@pid_eins LOL

3 April at 9:36 | Open on mastodon.cloud

Lennart Poettering

PSA: In context of the xzpocalypse we now added an example reimplementation of sd_notify() to our man page:

https://www.freedesktop.org/software/systemd/man/devel/sd_notify.html#Notes

It's pretty comprehensive (i.e. uses it for reload notification too), but still relatively short.

In the past, I have been telling anyone who wanted to listen that if all you want is sd_notify() then don't bother linking to libsystemd, since the protocol is stable and should be considered the API, not our C wrapper around it. After all, the protocol is so trivial

PSA: In context of the xzpocalypse we now added an example reimplementation of sd_notify() to our man page:

https://www.freedesktop.org/software/systemd/man/devel/sd_notify.html#Notes

It's pretty comprehensive (i.e. uses it for reload notification too), but still relatively short.

In the past, I have been telling anyone who wanted to listen that if all you want is sd_notify() then don't bother linking to libsystemd, since the protocol is stable and should be considered the API, not our C wrapper...

Expand text...

Like 2 April at 16:59 | Open on mastodon.social

Lennart Poettering

that one can explain it in one sentence: send an AF_UNIX datagram containing READY=1 to a socket whose path you find in the $NOTIFY_SOCKET env var.

But apparently turning that sentence (which appears in similar fashion in the man page) into code is not trivial, hence this new example code.

Hence, copy away, the thing is MIT licensed. And the protocol has been stable for a decade, and I am pretty sure it's going to remain stable for another decade at least.

2 April at 17:01 | Open on mastodon.social

Show 7 replies

Lennart Poettering

Quiz: How many inode types are there on Linux?

You might think the answer to this is 7, i.e. regular files, directories, symlinks, block device nodes, char device nodes, fifos, and sockets. But you are actually are wrong: there's an 8th one. There's the concept of an anonymous inode on Linux which has the file type of zero. You can easily acquire fds to inodes of this type via eventfd(). If you call fstat() on such fds, then (.st_mode & S_IFMT) == 0 will hold. 🤯

Like 12 March at 9:35 | Open on mastodon.social

Lennart Poettering

And I am pretty sure there's a lot of software you might be able to break given that they do not expect this case on the most basic of fs concepts.

Also note that these anonymous inodes are not actually as anonymous as one might think: because open fds appear in /proc/self/fd/ as magic symlinks you can easily get am fs path when you call stat() on will return you a zero inode type.

Double 🤯🤯

12 March at 9:37 | Open on mastodon.social

Show 4 replies

HAMMER SMASHED FILESYSTEM 🇺🇦

@pid_eins i can assure you i didn't think it's 7

12 March at 9:54 | Open on metalhead.club

Lennart Poettering

I blogged. Or well, I actually didn't. I just posted a guest post by @daandemeyer on my blog, about the excellent developments in mkosi land:

https://0pointer.net/blog/a-re-introduction-to-mkosi-a-tool-for-generating-os-images.html

Enjoy!

Like 23 January at 17:36 | Open on mastodon.social

Lennart Poettering

Here's another little feature we scheduled for the next systemd release. Everyone knows SSH well, and it's great to connect to hosts remotely, and even do file transfer. It's probably *the* single most relevant way to talk to some host for administration and various other tasks. It's a bit fragile though: it requires networking, and that even if we talk to a local VM or full OS container. But precisely networking is one of the things you might want to administer via SSH, hence you have a cyclic…

Like 9 January at 10:05 | Open on mastodon.social

Lennart Poettering

…and risky dependency. But for the VM and full OS container case there's no real need to use SSH via the network: these things run on the local system, hence why bother with IP? To address that we are adding a small generator (that means: a plugin for systemd that generates units on the fly, based on system state, configuration) which binds SSH to a local AF_VSOCK socket in a VM, and to an AF_UNIX socket in a container. You can then use these to directly connect to the system without involving…

9 January at 10:07 | Open on mastodon.social

Show 4 replies

Lennart Poettering

I recently implemented a fun little feature for systemd: inspired by MacOS' "target disk mode", a tiny tool called systemd-storagetm, that exposes all local block devices as NVMe-TCP devices, as they pop up. The idea is that if available in your initrd you can just boot into that (instead of into your full OS), and can access your disks via NVMe-TCP (in case you wonder what that is: it's the new hot shit for exposing block devices over the network, kinda like iSCSI, NBD, …, but cool).

Like 30 Oct 2023 at 13:01 | Open on mastodon.social

Lennart Poettering

Link is here: https://github.com/systemd/systemd/pull/29748

30 Oct 2023 at 13:01 | Open on mastodon.social

Show 5 replies

Eric Curtin

@pid_eins super cool. NVMe-oF ftw! I'm actually looking for a way of trimming down some of the dependencies in systemd-udevd and udevadm to make it even smaller for an super small inirtd. I only want systemd-udevd to initialize local storage devices for my used case. The Fedora systemd-udevd is dynamically linked to a large systemd .so is there a way of building it against smaller systemd libs like libudev etc.?

30 Oct 2023 at 13:22 | Open on social.treehouse.systems

Lennart Poettering

LWN just posted @bluca's summary of the image-base Linux summit in Berlin. Enjoy:

https://lwn.net/SubscriberLink/946526/a1c7bb28c62c9667/

Like 16 Oct 2023 at 15:26 | Open on mastodon.social

Lennart Poettering

We recently added a new document to the systemd website focussing on one specific facet of the service manager: the fdstore. A concept that people should really use more to facilitate "seamless" service restarts and various other things. Please have a look:

https://systemd.io/FILE_DESCRIPTOR_STORE/

Like 19 Sep 2023 at 16:42 | Open on mastodon.social

Lennart Poettering

Here's a fun new feature we are working on in systemd: userspace-only reboot. In order to reduce grey-out times on image-based OS updates to next to nothing we are making a reboot happen where kernel stays as it is, but userspace shuts down as usual, then possibly transitions into a new rootfs, and starts up again with an initial transaction as it would on a classic system boot. During the transition selected services can pass along their fds and listening sockets, to pass "live" resources…

Like 27 Apr 2023 at 21:03 | Open on mastodon.social

Lennart Poettering

…from the old system to the new system. This means: super-fast switching from one OS version to the next, with all service code restarted cleanly and comprehensively, but with selected resources passed through untouched, so that they can continue to operate. And it wasn't even that hard to implement: https://github.com/systemd/systemd/pull/27435

Or in other words: let's not wait for hardware, firmware, boot loader, kernel, initrd to reinitialize on a reboot, let's just focus on userspace alone.

27 Apr 2023 at 21:05 | Open on mastodon.social

Show 9 replies

Lennart Poettering

So, we are now living in a world where people generate feature requests based on random ideas AI might have? → https://github.com/systemd/systemd/issues/26212

And I thought AI was supposed to reduce the workload on humans, not add more on top!

Like 26 Jan 2023 at 10:36 | Open on mastodon.social

Show previous comments

翠星石

@pid_eins >random ideas AI might have?
The "AI" didn't have the idea, it's just regurgitating a systemd related idea someone wrote about in the text dataset.

The "AI" things are solely dedicated to making things even more proprietary, so more workload on humans pretty much.

Welcome to the *fed*iverse Lennart, we have free software and ですぅ。

@pid_eins >random ideas AI might have?
The "AI" didn't have the idea, it's just regurgitating a systemd related idea someone wrote about in the text dataset.

Expand text...

26 Jan 2023 at 12:12 | Open on freesoftwareextremist.com

tusooa :Cat_girls_Emoji_004: 西风

@pid_eins
>*he* suggested
no wonder it's bad.

26 Jan 2023 at 12:12 | Open on kazv.moe

PrivateGER :owo:

@pid_eins@mastodon.social I especially love how chatgpt just made up some random shit because doing it properly would apparently have been too much work

26 Jan 2023 at 12:12 | Open on plasmatrap.com

Lennart Poettering

PSA for C devs: if your library exposes a function that takes a pointer, and you add a "const" to that pointer later on, then yes, that's an API break. Why? Because the prototype of the function changed enough so that anyone taking a pointer of your function won't be able to assign it to the variable they intend to store it in. Yes, C API compat is hard. (libbpf, I am looking at you 👀👀👀, btw)

Like 16 Nov 2022 at 15:33 | Open on mastodon.social

Show previous comments

Marc-Antoine Perennou

@pid_eins is it considered less critical if it’s provided with a typedef of the fn signature and the typedef gets updated too?

16 Nov 2022 at 15:45 | Open on mastodon.social

Show 1 reply

Toke Høiland-Jørgensen

@pid_eins Well, libbpf has always had an... interesting... approach to backwards compatibility. It's supposed to be better going forward, now that it's reached v1.0. I guess time will tell...

16 Nov 2022 at 22:10 | Open on social.kernel.org

Simon Ser

@pid_eins I've just done that with the Pixman API, and honestly, I don't care. Use an explicit cast if you do.

17 Nov 2022 at 11:39 | Open on octodon.social