If you run "guix pull" today, you get a package graph...

If you run "guix pull" today, you get a package graph of more than 22,000 nodes rooted in a 357-byte program---something that had never been achieved, to our knowledge, since the birth of Unix: a Full-Source Bootstrap.

#GnuMes
#bootstrappable
#BootstrappableBuilds
#ReproducibleBuilds
@fsf
@fsfe

Like 1 26 Apr 2023 at 13:50 | Open on todon.nl

74 comments

dave

@janneke holy shit I didn't know that this much progress had been made in bootstrapping. I thought we were still maybe years away from full source bootstrap. incredible work!

26 Apr 2023 at 13:54 | Open on toot.cat

dave

@janneke does this work for all architectures that guix supports or a subset?

26 Apr 2023 at 14:05 | Open on toot.cat

Janneke

@dthompson This is currently only i686-linux and x86_64-linux.

Work has been ongoing for ARMv7 (and AArch64) for quite some time now, but is stalled, probably until we have RISC-V.

26 Apr 2023 at 14:17 | Open on todon.nl

dave

@janneke thanks! just saw the blog post which answers the question, too. I guess I just needed to wait a few minutes ;)

26 Apr 2023 at 14:18 | Open on toot.cat

J. Ryan Stinnett

@dthompson @janneke Which blog post is this? I’d like to read more about this work. 🙂

26 Apr 2023 at 14:34 | Open on merveilles.town

J. Ryan Stinnett

@dthompson @janneke Ah I suppose it must be https://guix.gnu.org/blog/2023/the-full-source-bootstrap-building-from-source-all-the-way-down/ 😄

26 Apr 2023 at 16:44 | Open on merveilles.town

Janneke

@fsf @fsfe
And here's the blog post:

https://guix.gnu.org/en/blog/2023/the-full-source-bootstrap-building-from-source-all-the-way-down/

26 Apr 2023 at 14:16 | Open on todon.nl

Sergey Bugaev

@janneke not to underpaint the importance and coolness of this achievement, here's an uninformed question that you probably get a lot: how does this work wrt to depending on a Linux kernel (which is tons of C), some basic userland (or can it run as PID 1-and-only?), and x86 hardware (which... who knows what it does) to run this 357 byte binary?

If you can't trust a compiler to build your program correctly, why can you trust a kernel and some hardware to run your binary correctly?

26 Apr 2023 at 15:15 | Open on floss.social

Andrius Štikonas

@bugaevc @janneke https://github.com/fosslinux/live-bootstrap project has some initial code to bootstrap Linux. It can build Linux but we still need to kexec into it (which shouldn't be too hard).

26 Apr 2023 at 15:22 | Open on fosstodon.org

Janneke

@bugaevc
Good question! Of course: you can't.

There is currently no good answer to that other than that we chose to start on getting rid of the obviously unnecessary and "easy" binary seeds first. Or: different people have different interests and competences, if we start then eventually we'll probably get there someday. There are some ideas, though.

The least elegant but easiest "solution" would be to revert to Diverse Double Compliing (DDC, https://dwheeler.com/trusting-trust/). The low level tools (stage0, m2-planet, and mes) can easily do cross builds. You could build on different architectures, and kernels if you like and compare package checksums.

We did something like this for Mes (all x86_64-linux, though) at the fifth reproducible builds conference (RB-V, https://guix.gnu.org/en/blog/2019/reproducible-builds-summit-5th-edition/)

Running as PID 1: During the same RB-V conference, Ludovic Courtès prototyped building a Guix package in the initial ramdisk. After the build the package is discarded, but before that its checksum is printed and can be checked with a build under GNU/Linux.

People have been working to build tiny kernels, such as: https://github.com/ironmeld/boot2now.

Also, Stage0 was designed to also run on the Knight VM, one could imagine running that on simpler hardware, or running the VM on different machines/architectures, dunno.

@bugaevc
Good question! Of course: you can't.

Expand text...

26 Apr 2023 at 15:37 | Open on todon.nl

theruran 🌐🏴

@janneke @bugaevc The folks in #bootstrappable @liberachat are working towards resolving those questions. A POSIX kernel capable of building Linux, and a bootstrap from UEFI are some projects off the top of my head.

They want to get to a FPGA softcore bootstrap, then a manually constructed CPU in TTL to bootstrap from.

But yeah, there are many parts to work on that would improve our (collective) situation, such as bootstrapping GHC: @nomeata https://mastodon.online/@nomeata/110263917613134533

26 Apr 2023 at 16:15 | Open on hackers.town

Sergey Bugaev

@theruran @janneke I was thinking something along these lines:

find an "open source hardware" board where you can somehow verify the hardware aren't playing games on you (in particular not running all of your code in a nearly undetectable hypervisor, like we know Intel does...), probably some RISC-V board

26 Apr 2023 at 17:40 | Open on floss.social

Sergey Bugaev

@theruran @janneke

run you bootstrapping code on it with no OS whatsoever; hopefully it doesn't need much from the OS

you'd have to build in a serial driver or something like that (blinking LEDs is cool but you can't input program source this way), not that I have any idea about hardware

26 Apr 2023 at 17:41 | Open on floss.social

theruran 🌐🏴

@bugaevc @janneke the #Monster6502 was mentioned as a possibility if not inspiration:

https://monster6502.com/

26 Apr 2023 at 17:43 | Open on hackers.town

theruran 🌐🏴

@bugaevc @janneke and #GNUHurd could be another approach, right? it can host GCC to build Linux already?

26 Apr 2023 at 17:46 | Open on hackers.town

Sergey Bugaev

@theruran @janneke the Hurd surely can run GCC and cross-compile Linux; but I'm not sure you would be winning much, for two reasons:

1. It's nowhere near as trivial to do "syscalls" as on Linux — on Linux you place some values into some registers and perform "int 0x80" or "syscall", and that's it, you've called write or exit. On the Hurd, these all are implemented in glibc on top of Mach IPC, and that needs quite a lot of code to happen.

26 Apr 2023 at 19:14 | Open on floss.social

Sergey Bugaev

@theruran @janneke Here's a project of mine where I simply print "Hello world" without relying on glibc: https://github.com/bugaevc/hello-hurd — but that too is written in C, imagine writing it all in hex.

2. Linux is huge, but you can build it in a minimal configuration (see https://tiny.wiki.kernel.org/). Mach may be a microkernel, but it's minimal in functionality, not size. In fact it's a meme in the microkernel community just how large for a microkernel Mach is. But I don't have any numbers to quantify this.

26 Apr 2023 at 19:17 | Open on floss.social

Ludovic Courtès

@bugaevc Speaking of the role of the kernel, an interesting question is how to implement isolated builds on the #Hurd—see “Isolated build environments” at https://guix.gnu.org/en/blog/2020/childhurds-and-substitutes/ for an overview.

I’m curious what you think of this!