fix(cache): handle missing cache hits when chaining two run steps #25788

bfredl · 2025-11-02T09:58:41Z

fixes #19817
This is the same as #19974 but un-bittrotted

This improves the efficiency of the cache when chaining multiple commands like

const step1 = b.addRunArtifact(tool_fast);
step1.addFileArg(b.path("src/input.c"));
const output1 = step1.addOutputFileArg("output1.h");

const step2 = b.addRunArtifact(tool_slow);
step2.addFileArg(output1);
const chained_output = step2.addOutputFileArg("output2.h");

assume that step2 takes much long time than step1
if we make a change to "src/input.c" which produces an identical "output1.h" as a previous input, one would expect step2 not to rerun as the cached output2.h only depends on the content of output1.h

However, this does not work yet as the hash of src/input.c leaks into the file name of the cached output1.h, which the second run step interprets as a different cache key. Not using the ".zig-build/o/{HASH}" part of the file name in the hash key fixes this.

https://github.com/bfredl/zig-run4run is updated for zig 0.16 as a demonstration. e.g. in src/foo.c changing implementation_of_foo() to new_implementation_of_foo() should only rerun the first step, not the second.

This API is a bit too opinionated for the Zig standard library. Applications should contain this logic instead.

Fixes #30631

These symbols are already provided by compiler_rt

… packets This addresses the regression specific to GitHub's chunked transfer encoding for larger repositories while leaving existing functionality intact.

Reject low-order points by checking projective coordinates directly instead of using affine coordinates. Equivalent, but saves CPU cycles (~254 field multiplications total before, 3 field multiplications after).

Fixes a issue in tryFindProgram where it would fail if the PATH environment variable contained relative paths, due to its incorrect assumption that the full_path argument is always absolute path. Reviewed-on: https://codeberg.org/ziglang/zig/pulls/30619 Co-authored-by: hixuyuming <[email protected]> Co-committed-by: hixuyuming <[email protected]>

See https://codeberg.org/ziglang/zig/issues/30637 for details. Works around https://gcc.gnu.org/bugzilla/show_bug.cgi?id=86673 Co-authored-by: Laurin-Luis Lehning <[email protected]> Reviewed-on: https://codeberg.org/ziglang/zig/pulls/30645 Reviewed-by: Andrew Kelley <[email protected]> Co-authored-by: e820 <[email protected]> Co-committed-by: e820 <[email protected]>

…0650) from jedisct1/zig:ed25519rej into master Reviewed-on: https://codeberg.org/ziglang/zig/pulls/30650

When linking libc, it should be the libc that manages the heap. The main Wasm memory might have been configured as non-growable, which makes `WasmAllocator` a poor default and causes the common `DebugAllocator` use case fail with OOM errors unless the user uses `std_options` to override the default page allocator. Additionally, on Emscripten, growing Wasm memory without notifying the JS glue code will cause array buffers to get detached and lead to spurious crashes.

…Wasm + libc" This reverts commit c9fa8e4. This commit was failing CI checks. This failure was unfortunately not noticed before merge, due in part to the build runner bug fixed in the last commit.

The goal of this internal refactor is to fix some bugs in cancelation and allow group tasks to clean up their own resources eagerly. The latter will become a guarantee of the `std.Io` interface, which is important so that groups can be used to "detach" tasks. This commit changes the API which POSIX system calls use internally (the functions formerly called `beginSyscall` etc), but does not update the usage sites yet.

The most interesting thing here is the replacement of the pthread futex implementation with an implementation based on thread park/unpark APIs. Thread parking tends to be the primitive provided by systems which do not have a futex primitive, such as NetBSD, so this implementation is far more efficient than the pthread one. It is also useful on Windows, where `RtlWaitOnAddress` is itself a userland implementation based on thread park/unpark; we can implement it ourselves including support for features which Windows' implementation lacks, such as cancelation and waking a number of waiters with 1<n<infinity. Compared to the pthread implementation, this thread-parking-based one also supports full robust cancelation. Thread parking also turns out to be useful for implementing `sleep`, so is now used for that on Windows and NetBSD. This commit also introduces proper cancelation support for most Windows operations. The most notable omission right now is DNS lookups through `GetAddrInfoEx`, just because they're a little more work due to having a unique cancelation mechanism---but the machinery is all there, so I'll finish gluing it together soon. As of this commit, there are very few parts of `Io.Threaded` which do not support full robust cancelation. The only ones which actually really matter (because they could block for a prolonged period of time) are DNS lookups on Windows (as discussed above) and futex waits on WASM.

As of this branch, the performance impact of robust cancelation is now negligible (and in fact entirely unmeasurable in almost all cases), so there is no good reason to not enable it in all cases. The performance issues before were primarily down to a typo in the robust cancelation logic which resulted in every canceled syscall potentially being sent hundreds of signals in quick succession, because the delay between signals started out at 1ns instead of 1us!

This commit includes some API changes which I agreed with Andrew as a follow-up to the recent `Io.Group` changes: * `Io.Group.await` *does* propagate cancelation to group tasks; it then waits for them to complete, and *also* returns `error.Canceled`. The assertion that group tasks handle `error.Canceled` "correctly" means this behavior is loosely analagous to how awaiting a future works. The important thing is that the semantics of `Group.await` and `Future.await` are similar, and `error.Canceled` will always be visible to the caller (assuming correct API usage). * `Io.Group.awaitUncancelable` is removed. * `Future.await` calls `recancel` only if the "child" task (the future being awaited) did not acknowledge cancelation. If it did, then it is assumed that the future will propagate `error.Canceled` through `await` as needed.

This was missed when updating to the new group cancelation API, and caused illegal behavior in many cases (the condition was simply that a DNS query returned a second result before a connection was successfully established).

Previously, 64-bit '<<|' operations were emitting 64-bit shifts with one 64-bit operand and one 32-bit operand, which is illegal. Instead, as in the lowering for regular shifts, we need to cast the RHS in this case.

Change the log implementation to prepend the current target and update to all logs which happen during an update. Makes progress on ziglang#22510, but does not fully resolve it.

…es, and better Windows and NetBSD support' (#30634) from std.Io.Threaded-groups-2 into master Reviewed-on: https://codeberg.org/ziglang/zig/pulls/30634 Reviewed-by: Andrew Kelley <[email protected]> Resolves: https://codeberg.org/ziglang/zig/issues/30049

Change-Id: Iba9c4bf2cfa4ff1b82dd5f0828c57711f238f1bf

…ckaddr (#30722) Resolves ziglang/zig#30672 - UB caused by `std.Io.Threaded.netLookupFallible` incorrectly initializing `PosixAddress`/`WsaAddress` from `*sockaddr`. Reviewed-on: https://codeberg.org/ziglang/zig/pulls/30722 Co-authored-by: moriazoso <[email protected]> Co-committed-by: moriazoso <[email protected]>

clock_nanosleep is specified by POSIX but not implemented on these hereby shamed operating systems: * macOS * OpenBSD (which defines TIMER_ABSTIME for some reason...?)

https://codeberg.org/ziglang/zig/issues/30748

Reviewed-on: https://codeberg.org/ziglang/zig/pulls/30741

Closes #30731

…zig:openbsd-ci into master Reviewed-on: https://codeberg.org/ziglang/zig/pulls/30733

54a8496 did this for std.os.linux.SIG but I neglected to also do it for std.c.SIG

…y' (#30746) from clock_nanosleep into master Reviewed-on: https://codeberg.org/ziglang/zig/pulls/30746

This file has changed a lot since the previous release, and I resisted the urge to do this until the conflicts would be minimized.

* also remove musl implementation

ABI detection previously did not take into account the non-standard directory structure of Android. This has been fixed. The API level is detected by running `getprop ro.build.version.sdk`, since we don't want to depend on bionic, and reading system properties ourselves is not trivially possible.

The previous logic was made really messy by the fact that upon entry to the step eval worker, the step may not be ready to run, we may be racing with other workers doing the same check, and we had already acquired our RSS requirement even though we might not run. It also required iterating all dependencies each time we were called to check whether we were even ready to run yet. A much better strategy is for each step to have an atomic counter representing how many of its dependencies are yet to complete. When a step completes (successfully or otherwise), it decrements this value for all of its dependants, and if it drops any to 0, it schedules that step to run. This means each step is scheduled exactly once, and only when all of its dependencies have finished, reducing redundant checks and hence contention. If the step being scheduled needs to claim RSS which isn't available, then it is instead added to `memory_blocked_steps`, which is iterated by the step worker after a step with an RSS claim finishes. This logic is more concise than before, simpler to understand, generally more efficient, and fixes a bug in the RSS tracking. Also, as a nice side effect, it should also play a little bit nicer with `Io.Threaded`'s scheduling strategy, because we no longer spawn extremely short-lived tasks all the time as we previously did. Resolves: https://codeberg.org/ziglang/zig/issues/30742

Resolves: https://codeberg.org/ziglang/zig/issues/30748

The implementation of HostName.validate was too generous. It considered strings like ".example.com", "exa..mple.com", and "-example.com" to be valid hostnames, which is incorrect according to RFC 1123 (the currently accepted standard). Reviewed-on: ziglang#25710

…AAAA records closes ziglang#25948

Follow-up to https://codeberg.org/ziglang/zig/pulls/30746. The TIMER_ABSTIME value was adjusted to match other systems in SerenityOS/serenity#26543.

…ther than entropy' (#30736) from jedisct1/zig:edsigned into master Reviewed-on: https://codeberg.org/ziglang/zig/pulls/30736 Reviewed-by: Andrew Kelley <[email protected]>

… of direct entropy' (#30738) from jedisct1/zig:scryptfixes into master Reviewed-on: https://codeberg.org/ziglang/zig/pulls/30738 Reviewed-by: Andrew Kelley <[email protected]>

fixes ziglang#19817 This improves the efficiency of the cache when chaining muliple commands like const step1 = b.addRunArtifact(tool_fast); step1.addFileArg(b.path("src/input.c")); const output1 = step1.addOutputFileArg("output1.h"); const step2 = b.addRunArtifact(tool_slow); step2.addFileArg(output1); const chained_output = step2.addOutputFileArg("output2.h"); assume that step2 takes much long time than step1 if we make a change to "src/input.c" which produces an identical "output1.h" as a previous input, one would expect step2 not to rerun as the cached output2.h only depends on the content of output1.h However, this does not work yet as the hash of src/input.c leaks into the file name of the cached output1.h, which the second run step interprets as a different cache key. Not using the ".zig-build/o/{HASH}" part of the file name in the hash key fixes this.

alexrp · 2026-01-09T17:35:09Z

https://codeberg.org/ziglang/zig/pulls/30762

bfredl force-pushed the cache4 branch 2 times, most recently from 8b75cfe to a9e757d Compare November 9, 2025 12:55

bfredl force-pushed the cache4 branch from a9e757d to b934e4c Compare November 11, 2025 12:46

PeterMcKinnis and others added 27 commits December 30, 2025 15:09

fix lockStderr API calls in test_runner fuzz code

96ba0ab

std: remove fs.getAppDataDir with no replacement

e956948

This API is a bit too opinionated for the Zig standard library. Applications should contain this logic instead.

Update posix.getRandomBytesDevURandom to use linux.statx

9d497d0

Fixes #30631

Make Io.Mutex an extern struct

4ad8bc3

libc: remove fmod, fmodf and fmodl

814b1e9

These symbols are already provided by compiler_rt

libc: remove log/f, log2/f and log10/f

eab93d3

These symbols are already provided by compiler_rt

git.zig: Process data packets of all lengths, discarding unrecognized…

2bd0288

… packets This addresses the regression specific to GitHub's chunked transfer encoding for larger repositories while leaving existing functionality intact.

crypto.edwards25519: optimize rejectLowOrder

1baa127

Reject low-order points by checking projective coordinates directly instead of using affine coordinates. Equivalent, but saves CPU cycles (~254 field multiplications total before, 3 field multiplications after).

std: use decl literals to improve endian ergonomics

53ebfde

Merge pull request 'crypto.edwards25519: optimize rejectLowOrder' (#3…

1bf2975

…0650) from jedisct1/zig:ed25519rej into master Reviewed-on: https://codeberg.org/ziglang/zig/pulls/30650

std.Build: crashes in the test runner are fatal errors

d024d9f

Revert "Use mmap std.heap.page_allocator impl when compiling for …

0422619

…Wasm + libc" This reverts commit c9fa8e4. This commit was failing CI checks. This failure was unfortunately not noticed before merge, due in part to the build runner bug fixed in the last commit.

std.Io.Threaded: update to new internal syscall API

a1d4120

std.Io: more tests

f27134d

std.Io.Group: tweak documentation and vtable API

b8a09bc

std.Io.net: don't swallow 'error.Canceled'

3bb2f7b

This was missed when updating to the new group cancelation API, and caused illegal behavior in many cases (the condition was simply that a DNS query returned a second result before a connection was successfully established).

std.http.test: fix memory leaks on OOM

f7f0b9d

codegen.wasm: fix 64-bit saturating shl

b3c4984

Previously, 64-bit '<<|' operations were emitting 64-bit shifts with one 64-bit operand and one 32-bit operand, which is illegal. Instead, as in the lowering for regular shifts, we need to cast the RHS in this case.

incr-check: make sure to always show the target

4de3357

Change the log implementation to prepend the current target and update to all logs which happen during an update. Makes progress on ziglang#22510, but does not fully resolve it.

xtexx and others added 27 commits January 8, 2026 04:54

link.Elf2: fix incorrect expected node length

cdaf279

Change-Id: Iba9c4bf2cfa4ff1b82dd5f0828c57711f238f1bf

fix redundant safety checks being emitted for slicing

d2d8b96

std: find a better home for the "preopens" concept

6a5bb3e

std.Io.Threaded: clock_nanosleep is not linux-only

4319c89

clock_nanosleep is specified by POSIX but not implemented on these hereby shamed operating systems: * macOS * OpenBSD (which defines TIMER_ABSTIME for some reason...?)

std.c: use {} rather than void for absent functions

130fc7e

ci: skip incremental tests on loongarch64-linux

cc38acf

https://codeberg.org/ziglang/zig/issues/30748

Merge branch "remove many std.posix functions"

6f7968f

Reviewed-on: https://codeberg.org/ziglang/zig/pulls/30741

std.meta.hasUniqueRepresentation: consider enum tag type

ac91799

Closes #30731

Merge pull request 'enable x86_64-openbsd CI' (#30733) from alexrp/…

d1be8b1

…zig:openbsd-ci into master Reviewed-on: https://codeberg.org/ziglang/zig/pulls/30733

std.c.SIG: make it non-exhaustive

1face9a

54a8496 did this for std.os.linux.SIG but I neglected to also do it for std.c.SIG

Merge pull request 'std.Io.Threaded: clock_nanosleep is not linux-onl…

b0570b8

…y' (#30746) from clock_nanosleep into master Reviewed-on: https://codeberg.org/ziglang/zig/pulls/30746

std.Io: move some decls around

70af303

This file has changed a lot since the previous release, and I resisted the urge to do this until the conflicts would be minimized.

std.Io: add doc comments

20baf04

std.Io.Threaded: fix init for single-threaded

09028ba

std.Io.Threaded: refactor some error handling

ecea8cc

feat(libzigc): add nan, nanf, nanl and bsearch

7f6eab2

* also remove musl implementation

Fix format on uefi guid type, was hitting unreachable

e8af0f2

link.Wasm: reserve sufficient capacity for @tagName function code

6069161

Resolves: https://codeberg.org/ziglang/zig/issues/30748

std.Io.Threaded: Raise specific error when DNS lookup returns no A/…

b929078

…AAAA records closes ziglang#25948

std.c: Make clock_nanosleep available on serenity

27039a0

Follow-up to https://codeberg.org/ziglang/zig/pulls/30746. The TIMER_ABSTIME value was adjusted to match other systems in SerenityOS/serenity#26543.

Merge pull request 'crypto.ed25519.Signer: get an std.io parameter ra…

7c0b42b

…ther than entropy' (#30736) from jedisct1/zig:edsigned into master Reviewed-on: https://codeberg.org/ziglang/zig/pulls/30736 Reviewed-by: Andrew Kelley <[email protected]>

Merge pull request 'crypto.scrypt: accept an std.Io parameter instead…

721bdb6

… of direct entropy' (#30738) from jedisct1/zig:scryptfixes into master Reviewed-on: https://codeberg.org/ziglang/zig/pulls/30738 Reviewed-by: Andrew Kelley <[email protected]>

bfredl force-pushed the cache4 branch from b934e4c to e965df5 Compare January 9, 2026 10:25

alexrp closed this Jan 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix(cache): handle missing cache hits when chaining two run steps #25788

fix(cache): handle missing cache hits when chaining two run steps #25788

bfredl commented Nov 2, 2025

Uh oh!

alexrp commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Uh oh!

fix(cache): handle missing cache hits when chaining two run steps #25788

fix(cache): handle missing cache hits when chaining two run steps #25788

Conversation

bfredl commented Nov 2, 2025

Uh oh!

alexrp commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants