Advent of Code platform · show and tell

The platform statically allocates memory on startup. While Roc is running, there are no allocations. As default, it allocates one GiB of memory, but other sizes can be used via the argument --memory. The platform does not deallocate any memory. Last year, I activated deallocations on day 17, since this was necessary for my solution. It is possible, that I will implement it as well this year. But it will be optional.

I think that the platform is a good example for a zig-platform, since it has a working build.zig file, that creates all files necessary for the surgical linker and the legacy linker for linux and mac (I don't have a mac, so I could only test it with linux, but I should work). The platform does not use C-malloc and fiends, but a zig allocator (but without deallocate at the moment). As soon as expect works like dbg and does not need shm_open, mmap and getppid anymore, it should be possible to use the platform without libc.

Jasper Woudenberg (Nov 10 2024 at 14:37):

Perfect timing! I was just talking to a friend yesterday who's considering doing AoC in Roc this year, just sent them the link to this.

Also excited to take a look at how you did the Zig bits. I'm working on a zig platform on Linux and never got the surgical linker working. Also wanted to take a look at how I might run allocation through a Zig allocator but never felt sure enough of what I was doing to try it out, so will definitly check out how you did it!

Jasper Woudenberg (Nov 11 2024 at 21:54):

I didn't get the surgical linking to work unfortunately, keep running into an error. I think it's a better error then the one I had before though: previously when using the surgical linker I got a crash when zig host code allocated memory. Now I get a nice error linking to a Roc issue about surgical linking not support absolute relocations. Either way I can use the legacy linker to work around it for now.

I'm liking working with the Zig build system! Wondering if we should set up a Zig package for Roc platform development. It could include roc.Build namespace with Roc-specific build steps, a roc.std for the roc standard library integrations, maybe some comptime helpers for calling host-exposed functions, and a roc.mem with your allocator-for-Roc code.

Luke Boswell (Nov 11 2024 at 22:01):

Luke Boswell (Nov 11 2024 at 22:02):

Jasper Woudenberg (Nov 11 2024 at 22:29):

I was aware of your zig platform template, it was a great help getting started with my platform work!

Yeah, I meant a package, as in something you could add to the build.zig.zon file of a zig platform to get all the batteries for roc platform development.

Luke Boswell (Nov 11 2024 at 22:39):

Luke Boswell (Nov 11 2024 at 22:40):

I'm hoping we can land this rebuild host PR soon (like very soon), and then I can merge the latest into that branch and it should be pretty close.

Luke Boswell (Nov 11 2024 at 22:55):

It's something we kind of need in the repository for that upgrade. My current workaround, is to copy the builtins into each of the test platforms - once at the start of running the tests. I experimented with a package, but that was deviating a bit from the builtins.

I'm starting to think it might be a good idea to make the builtins depend on a roc_std zig package that lives in the repo, and the test platform also use that, and then also anyone else who is making a zig platform.

Luke Boswell (Nov 11 2024 at 22:59):

@Brendan Hansknecht does it sound like a terrible idea to have the builtins and zig test platform use a common zig package for the primitives? Could we separate out all the extra builtin stuff easily enough?

Jasper Woudenberg (Nov 11 2024 at 23:06):

One question: could it lead to versioning issues if zig platforms use one version of the standard library and the applications people write another?

Luke Boswell (Nov 11 2024 at 23:12):

Brendan Hansknecht (Nov 11 2024 at 23:59):

I think it would be great for there to be a zig package for all the builtin types.

Brendan Hansknecht (Nov 11 2024 at 23:59):

Brendan Hansknecht (Nov 12 2024 at 00:00):

Yes it could. Your platform version is tied to a range of roc versions. Outside of those versions, it would interact incorrectly and break

Oskar Hahn (Nov 13 2024 at 21:13):

I implemented deallocations. The current default is not to deallocate. Without the argument --deallocate, roc_dealloc is a noop.

I tested the platform on some of my AoC2023 solutions. I am surprised, that there is no real performance boost if deallocations are skipped. In many cases, the solution was even a bit faster with deallocations.

teskje (Nov 13 2024 at 21:32):

Could it be that the os needs to page in less memory when you are using a smaller part of the backing buffer? Like, when you deallocate and then allocate again, you can reuse already paged-in memory whereas when you don't deallocate each allocation might have to page in new memory

Brendan Hansknecht (Nov 13 2024 at 21:39):

Cause allocators may start doing smart things but fall back to slower paths if nothing is being freed

Brendan Hansknecht (Nov 13 2024 at 21:40):

Also, you still have the full cost of recounting in roc. The recounting will cost much more than the dealloc generally

Brendan Hansknecht (Nov 13 2024 at 21:40):

If you preallocated an arena and paged in roughly the memory needed, you should see some gains

Oskar Hahn (Nov 13 2024 at 21:47):

So I would think, that the os creates all the pages anyway. Or is it smart enough only to give the memory, when it is actually used?

Oskar Hahn (Nov 13 2024 at 21:48):

I don't think that the roc refcounting could change anything, since roc does not know that roc_dealloc is a noop.

Oskar Hahn (Nov 13 2024 at 21:51):

I would have guessed, that it could be CPU cache locality. Less memory could mean less cache misses.

The difference was not so big. Maybe with a better benchmark I would get other numbers. I am currently only measuring the time that roc__solutionForHost_1_exposed_generic needs.

Brendan Hansknecht (Nov 13 2024 at 22:02):

Brendan Hansknecht (Nov 13 2024 at 22:03):

We technically could give platforms control over this setting. The compiler currently has an internal config to change recounting to a no-op. That said, with recounting as a no-op, it also breaks uniqueness analysis...so that may actually be even worse for perf.

Oskar Hahn (Nov 13 2024 at 22:13):

The difference is really small, so I could be mistaken. But it seems, that when I set the hole memory buffer with @memset(buffer, 'x');, then without deallocations is either faster or the same.

Oskar Hahn (Nov 13 2024 at 22:37):

You are right. I tested the difference between preallocated memory and using the zig GeneralPurposeAllocator:

Brendan Hansknecht (Nov 14 2024 at 00:03):

Wow, deallocation costs way more than I expected. I wonder how much of that is simply that zig's GPA is pretty bad currently vs would happen with other allocators.

Isaac Van Doren (Nov 14 2024 at 00:05):

Brendan Hansknecht (Nov 14 2024 at 00:31):

Oskar Hahn (Nov 14 2024 at 07:46):

I did more testing. It seems, there are many factors that drastically change the numbers. For example, in the middle of my tests, my laptop battery was empty and I plugged in the power cable. This probably disabled some power safe mode in the CPU and the outcome was twice as fast. The following numbers are all with a power cable. I only included one without the cable for comparison. All the tests are with my AoC2023 solution for day 1 part 2. Maybe other solutions would allocate and free memory in a different way.

And please keep in mind, that I have no experience with benchmarks. I probably did many mistakes (for example running the benchmark directly after running zig build with a hot CPU. If you want to use the numbers, you should rerun them yourself.

Here are my numbers

GPA zig-debug roc-no-optimize deallocate without Power Cabel
2.884s

GPA zig-debug roc-no-optimize deallocate with Power Cabel
1.727s

GPA zig-debug roc-no-optimize no-dealloc
261.733ms

GPA zig-debug roc-optimize deallocate
1.623s

GPA zig-debug roc-optimize no-dealloc
225.813ms

GPA zig-ReleaseSafe roc-no-optimize deallocate
922.863ms

GPA zig-ReleaseSafe roc-no-optimize no-dealloc
18.917ms

GPA zig-ReleaseSafe roc-optimize deallocate
922.98ms

GPA zig-ReleaseSafe roc-optimize no-dealloc
4.586ms

GPA zig-ReleaseFast roc-no-optimize deallocate
905.64ms

GPA zig-ReleaseFast roc-no-optimize no-dealloc
6.787ms

GPA zig-ReleaseFast roc-optimize deallocate
918.644ms

GPA zig-ReleaseFast roc-optimize no-dealloc
4.849ms

c-malloc zig-Debug roc-optimize deallocate
5.078ms

c-malloc zig-Debug roc-optimize no-dealloc
2.451ms

c-malloc zig-ReleaseSafe roc-optimize deallocate
4.758ms

c-malloc zig-ReleaseSafe roc-optimize no-dealloc
2.589ms

c-malloc zig-ReleaseFast roc-optimize deallocate
4.758ms

c-malloc zig-ReleaseFast roc-optimize no-dealloc
2.576ms

fixed-buffer-allocater zig-Debug roc-optimize deallocate
8.057ms

fixed-buffer-allocater zig-Debug roc-optimize no-dealloc
6.31ms

fixed-buffer-allocater zig-ReleaseSafe roc-optimize deallocate
1.829ms

fixed-buffer-allocater zig-ReleaseSafe roc-optimize no-dealloc
1.668ms

fixed-buffer-allocater zig-ReleaseFast roc-optimize deallocate
1.503ms

fixed-buffer-allocater zig-ReleaseFast roc-optimize no-dealloc
1.536ms

jdz_allocator zig-Debug roc-optimize deallocate
25.22ms

jdz_allocator zig-Debug roc-optimize no-dealloc
8.521ms

jdz_allocator zig-ReleaseSafe roc-optimize deallocate
2.671ms

jdz_allocator zig-ReleaseSafe roc-optimize no-dealloc
2.336ms

jdz_allocator zig-ReleaseFast roc-optimize deallocate
2.503ms

jdz_allocator zig-ReleaseFast roc-optimize no-dealloc
2.216ms

Brendan Hansknecht (Nov 14 2024 at 08:10):

Brendan Hansknecht (Nov 14 2024 at 08:11):

There are plans to make gpa performant. I know there is a larger tracking issue on the zig GitHub, but it really is a very primitive allocator currently that is not very fast. Some people suggested renaming it to debug allocator for now, but they decided not to cause the plan is to improve it until it is a good general purpose allocator. But in zig today gpa is general not great to use except for debugging

Oskar Hahn (Nov 16 2024 at 13:01):

I am currently thinking about the API of the platform. The current version is [Part1, Part2] -> List U8. For this to work, you have to embed the input file into your roc program.

What I am thinking about is to change this to two functions: part1 : Str -> List U8 and part2 : Str -> List U8. So the platform would provide the puzzle input to the Roc script.

The platform could either look for an input file next to the Roc script or read from stdin. If you read the FAQ Can I copy/redistribute part of Advent of Code?, then reading from stdin sounds like the only allowed option if you want to publish your solutions in a repo.

What do you think? If you want to use the platform for AoC, what would be your preference?

Anton (Nov 16 2024 at 13:13):

I'd definitely prefer to read from an input file, you could set up an example repo for cloning that already has a gitignore for input files so they don't get uploaded.

Brendan Hansknecht (Nov 16 2024 at 17:08):

This is where the fancy solution would be to write a script that steals cookies from the browser and auto downloads the file to an ignored folder. Would open up the login page if the cookies can't be found.

Oskar Hahn (Nov 16 2024 at 17:18):

Brendan Hansknecht (Nov 16 2024 at 17:23):

Yeah, yt-dlp made me think of it. They steal cookies from browser to enable downloading videos from patreon among other protected sites

Matt Harden (Nov 28 2024 at 21:03):

Total Roc beginner here. What's the reason for creating a platform for AoC? Why not use one of the existing "standard" platforms like basic-cli? Is it because you also want to experiment with Roc platforms while you develop for AoC?

Brendan Hansknecht (Nov 28 2024 at 21:05):

Just gives a really simple and tailored experience. Could easily do AOC with the basic-cli platform.

Matt Harden (Nov 28 2024 at 21:08):

Is it possible to wrap one platform in another? What if I wanted to use all the code in basic-cli but replace main.roc with my own? Would I have to fork it or can I wrap it somehow?

Oskar Hahn (Nov 28 2024 at 21:21):

I think you can replace the term platform with the term framework in other languages.

For example in python, you can use Django to create a webpage with one input element. For you use a much simpler framework.

I would say, basic-cli is like Django. You can do many things with it. But for AoC, you just need to transform an input string.

I like simple solutions. And a small AoC framework/platform seems much simpler then basic-cli.

Matt Harden (Nov 28 2024 at 21:23):

Ah, I see the motivation then. Thanks. Are there things in basic-cli that go beyond Roc builtins, or does your platform drop parts of the builtin stuff for simplicity's sake?

Matt Harden (Nov 28 2024 at 21:34):

Never mind; I see all the extra complexity in basic-cli. The Roc platform concept is kind of interesting / new to me; I'm used to all of the things in basic-cli just being always available but I like the idea of a platform being as simple as possible for the purpose.

Luke Boswell (Nov 28 2024 at 21:42):

If you're interested I've also got a template for AoC I shared. It's using basic-cli and is just a roc package.

Technically you could use any platform that provides the required effects to pass in a module parameters to the package, but realistically basic-cli is probably the easiest rn.

Matt Harden (Nov 28 2024 at 21:43):

Matt Harden (Nov 28 2024 at 21:52):

I think I'll try my hand at a super simple Go based platform. I like the idea of a solution "part" being just a function from a string (or maybe list of strings) to another string. The platform can handle reading the input and writing the output.

Oskar Hahn (Dec 01 2024 at 17:51):

I changed the signature of the platform. Now, it requires the functions part1 : Str -> Result Str _ and part2 : Str -> Result Str _

To work with List U8 was not so nice in the tests, since the values are not shown as string, but as a List of ascii codes.

Oskar Hahn (Dec 06 2024 at 14:45):

I found out today, that the zig FixedBufferAllocator does not support deallocation. It can free the last allocation, but not anything before that.

Here is the zig issue. I also asked in there help discord channel. It seems, that there is no FixBufferAllocator, that can deallocate.

Anthony Bullard (Dec 06 2024 at 14:50):

That seems strange, I thought that TigerBeetle used a FixedBufferAllocator for the entire lifetime of the database

Anthony Bullard (Dec 06 2024 at 14:51):

But I guess what they actually say is they _never allocate_ while the database is running, after setup

Anthony Bullard (Dec 06 2024 at 14:51):

So maybe they do, but then initialize all the structs within that layout and never worry about deallocation from then on

Oskar Hahn (Dec 06 2024 at 14:51):

Anthony Bullard (Dec 06 2024 at 14:52):

I think there DB just has a very specific structure where they can precisely lay out everything from the beginning

Anthony Bullard (Dec 06 2024 at 14:52):

And they use asserts EVERYWHERE to make sure none of their invariants are broken during development

Anthony Bullard (Dec 06 2024 at 14:53):

Brendan Hansknecht (Dec 06 2024 at 16:34):

Brendan Hansknecht (Dec 06 2024 at 16:35):

Brendan Hansknecht (Dec 06 2024 at 16:36):

@Oskar Hahn what is your goal for the allocator? Why use a fixed buffer in the first place?

Oskar Hahn (Dec 07 2024 at 08:01):

I thought, it would be faster. And for advent of code, it seemed like a good trade of, to allocate one big chunk of memory at the beginning

Brendan Hansknecht (Dec 07 2024 at 08:44):

That's fair. Depending on the exact goals, something like the allocator for roc wasm4 may work. It is written in zig and very light weight. Could give it a chunk of memory and it will still properly free. The allocator it's a copy of was specifically designed to be light weight for embedded.

Oskar Hahn (Dec 08 2024 at 15:23):

There could be pools specific sizes, like 100 bytes, 100 kb etc. For big values, it could fall back to a "normal" allocator.

Of cause, this is a lot of wasted space, but this would work with a preallocated memory.

Since the memory is big enough, the wast should not be a problem. I also don't think that it is a CPU-cache problem, since each memory is packed. Only the space between to allocations is bigger then needed. But there is no guaranty anyway, that two allocations are next to each other.

Brendan Hansknecht (Dec 08 2024 at 16:37):

Yeah, all allocators use various levels of pools. Generally after a certain size they fall back on more direct allocation via mmap.

Brendan Hansknecht (Dec 08 2024 at 16:38):

The allocator for roc wasm4 assumes a fixed size memory limit and has no fallback, but otherwise should be quite fast and generally light on resources

Brendan Hansknecht (Dec 08 2024 at 16:39):

More robust allocators like tcmalloc, mimalloc and jbz allocator have a lot of tricks, but they also have a lot of complexity due to expecting multithreaded complex applications.

Brendan Hansknecht (Dec 08 2024 at 16:42):

Jasper Woudenberg (Dec 19 2024 at 20:32):

Following up on the original post here, for anyone look for more zig platform examples. I've taken some hints from your build.zig file, Oskar, then added some additional features to the one I'm using for a Zig-based Roc platform:

Stream: show and tell

Topic: Advent of Code platform

Oskar Hahn (Nov 10 2024 at 14:30):

Jasper Woudenberg (Nov 10 2024 at 14:37):

Jasper Woudenberg (Nov 11 2024 at 21:54):

Luke Boswell (Nov 11 2024 at 22:01):

Luke Boswell (Nov 11 2024 at 22:02):

Jasper Woudenberg (Nov 11 2024 at 22:29):

Luke Boswell (Nov 11 2024 at 22:39):

Luke Boswell (Nov 11 2024 at 22:40):

Luke Boswell (Nov 11 2024 at 22:40):

Luke Boswell (Nov 11 2024 at 22:55):

Luke Boswell (Nov 11 2024 at 22:59):

Jasper Woudenberg (Nov 11 2024 at 23:06):

Luke Boswell (Nov 11 2024 at 23:12):

Brendan Hansknecht (Nov 11 2024 at 23:59):

Brendan Hansknecht (Nov 11 2024 at 23:59):

Brendan Hansknecht (Nov 12 2024 at 00:00):

Oskar Hahn (Nov 13 2024 at 21:13):

teskje (Nov 13 2024 at 21:32):

Brendan Hansknecht (Nov 13 2024 at 21:39):

Brendan Hansknecht (Nov 13 2024 at 21:39):

Brendan Hansknecht (Nov 13 2024 at 21:40):

Brendan Hansknecht (Nov 13 2024 at 21:40):

Oskar Hahn (Nov 13 2024 at 21:47):

Oskar Hahn (Nov 13 2024 at 21:48):

Oskar Hahn (Nov 13 2024 at 21:51):

Brendan Hansknecht (Nov 13 2024 at 22:02):

Brendan Hansknecht (Nov 13 2024 at 22:02):

Brendan Hansknecht (Nov 13 2024 at 22:03):

Oskar Hahn (Nov 13 2024 at 22:13):

Oskar Hahn (Nov 13 2024 at 22:37):

Brendan Hansknecht (Nov 14 2024 at 00:03):

Isaac Van Doren (Nov 14 2024 at 00:05):

Brendan Hansknecht (Nov 14 2024 at 00:31):

Oskar Hahn (Nov 14 2024 at 07:46):

Brendan Hansknecht (Nov 14 2024 at 08:10):

Brendan Hansknecht (Nov 14 2024 at 08:11):

Oskar Hahn (Nov 16 2024 at 13:01):

Anton (Nov 16 2024 at 13:13):

Brendan Hansknecht (Nov 16 2024 at 17:08):

Oskar Hahn (Nov 16 2024 at 17:18):

Brendan Hansknecht (Nov 16 2024 at 17:23):

Matt Harden (Nov 28 2024 at 21:03):

Brendan Hansknecht (Nov 28 2024 at 21:05):

Matt Harden (Nov 28 2024 at 21:08):

Oskar Hahn (Nov 28 2024 at 21:21):

Matt Harden (Nov 28 2024 at 21:23):

Matt Harden (Nov 28 2024 at 21:34):

Luke Boswell (Nov 28 2024 at 21:42):

Matt Harden (Nov 28 2024 at 21:43):

Matt Harden (Nov 28 2024 at 21:52):

Oskar Hahn (Dec 01 2024 at 17:51):

Oskar Hahn (Dec 06 2024 at 14:45):

Anthony Bullard (Dec 06 2024 at 14:50):

Anthony Bullard (Dec 06 2024 at 14:51):

Anthony Bullard (Dec 06 2024 at 14:51):

Oskar Hahn (Dec 06 2024 at 14:51):

Anthony Bullard (Dec 06 2024 at 14:52):

Anthony Bullard (Dec 06 2024 at 14:52):

Anthony Bullard (Dec 06 2024 at 14:53):

Brendan Hansknecht (Dec 06 2024 at 16:34):

Brendan Hansknecht (Dec 06 2024 at 16:34):

Brendan Hansknecht (Dec 06 2024 at 16:35):

Brendan Hansknecht (Dec 06 2024 at 16:36):

Oskar Hahn (Dec 07 2024 at 08:01):

Brendan Hansknecht (Dec 07 2024 at 08:44):

Oskar Hahn (Dec 08 2024 at 15:23):

Brendan Hansknecht (Dec 08 2024 at 16:37):

Brendan Hansknecht (Dec 08 2024 at 16:38):

Brendan Hansknecht (Dec 08 2024 at 16:39):

Brendan Hansknecht (Dec 08 2024 at 16:42):

Jasper Woudenberg (Dec 19 2024 at 20:32):