`ci`: only *write ccache in "push to master" jobs by ochafik · Pull Request #11661 · ggml-org/llama.cpp

ochafik · 2025-02-04T16:02:50Z

According to https://github.com/ggerganov/llama.cpp/actions/caches, we're Approaching total cache storage limit (88.08 GB of 10 GB Used)

With this PR, instead of letting each and every branch write their branch-specific outputs to ccache (and probably overwrite each other w/ weird race conditions), we restrict it to pushes to master (hopefully less concurrency). Also proposing to expire cache after 12h but not sure that's needed (risk is if there's no push for over 12h, then nobody will get any ccache to read from).

cc/ @slaren (follow up to #11516)

…f unused

slaren · 2025-02-04T16:07:58Z

I don't think exceeding the cache size is necessarily a problem, that's expected, since caches are immutable and every commit adds a new set of caches. As long as the size of all the caches created in a single commit is a few times lower than the max total cache size, so that the cache for the latest master commit and the caches of open PRs are kept, it should be fine. Creating caches for PRs is desirable since it improves the build times of subsequent commits to the PR.

ochafik · 2025-02-04T17:45:37Z

I don't think exceeding the cache size is necessarily a problem, that's expected, since caches are immutable and every commit adds a new set of caches. As long as the size of all the caches created in a single commit is a few times lower than the max total cache size, so that the cache for the latest master commit and the caches of open PRs are kept, it should be fine. Creating caches for PRs is desirable since it improves the build times of subsequent commits to the PR.

I'm weary about the following problems:

Cache eviction is currently random and likely drops an entire job's cache at a time, causing that job to randomly take 5+ more minutes for whoever runs it next. We likely never gets to the fine-grained, per file expiration setup in the other PR.
Different PRs which jobs run in parallel may overwrite each other's update of the shared cache (not sure what parallelism we have, I'd assume there's clusters of ppl with similar active hours)
We currently can't realistically cache the various heavy SDK install files that are now contributing to some of the longest runs

If we only cached the main branch, we could cache said sdk downloads (reducing long tail), and PRs would get a % of cache hit proportional with the amount of files they modified, with a predictable pattern. PRs with lots of header changes would pay a higher compilation price but would reap benefits from long tail SDK-installing jobs being much faster, and a possible majority of PRs (TBC) would still have a high, predictible cache hit rate.

(I'm wondering how to interpret the https://github.com/ggerganov/llama.cpp/actions/metrics/performance metrics, but job queue time is on the rise, and avg time hasn't budged)

ochafik · 2025-02-04T17:51:11Z

@slaren Anyway, if you're willing to experiment, we could push something like this (+ maybe cache some sdk downloads) and see in which direction performance metrics budge after a week / revert if it's worse.

slaren · 2025-02-04T18:06:40Z

On a related note, evict-old-files probably does not work with scache (the action only has an implementation for ccache), which may be why the windows CUDA cache files are so big.

https://github.com/hendrikmuhs/ccache-action/blob/a1209f81afb8c005c13b4296c32e363431bffea5/src/save.ts#L58

CISC · 2025-10-31T10:35:55Z

This is tempting to resurrect, we get absolutely swamped with caches on busy days, esp. the windows CUDA caches are problematic as they've now grown to 0.5G each.

ggerganov · 2026-03-25T11:27:06Z

I think this was superseded by #18207

ochafik added 2 commits February 3, 2025 02:23

Only write ccache when pushing to master, and evict files after 12h o…

106d2b3

…f unused

Merge branch 'master' into ci-write-less-ccache

421e1f0

ochafik changed the title ~~ci: only write ccache in release jobs (but keep reading from them)~~ ci: only *write ccache when pushing to master Feb 4, 2025

ochafik changed the title ~~ci: only *write ccache when pushing to master~~ ci: only *write ccache in "push to master" jobs Feb 4, 2025

github-actions Bot added the devops improvements to build systems and github actions label Feb 4, 2025

slaren approved these changes Feb 4, 2025

View reviewed changes

CISC mentioned this pull request Dec 19, 2025

ci : only save ccache on master #18207

Merged

ggerganov closed this Mar 25, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`ci`: only *write ccache in "push to master" jobs#11661

`ci`: only *write ccache in "push to master" jobs#11661
ochafik wants to merge 2 commits intoggml-org:masterfrom
ochafik:ci-write-less-ccache

ochafik commented Feb 4, 2025 •

edited

Loading

Uh oh!

slaren commented Feb 4, 2025

Uh oh!

ochafik commented Feb 4, 2025 •

edited

Loading

Uh oh!

ochafik commented Feb 4, 2025

Uh oh!

slaren commented Feb 4, 2025

Uh oh!

CISC commented Oct 31, 2025

Uh oh!

ggerganov commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ochafik commented Feb 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

slaren commented Feb 4, 2025

Uh oh!

ochafik commented Feb 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ochafik commented Feb 4, 2025

Uh oh!

slaren commented Feb 4, 2025

Uh oh!

CISC commented Oct 31, 2025

Uh oh!

ggerganov commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ochafik commented Feb 4, 2025 •

edited

Loading

ochafik commented Feb 4, 2025 •

edited

Loading