fix: Replace in-memory bspatch output buffer with a memory-mapped tem… #3554

razzeee · 2025-12-13T16:41:34Z

…porary file to prevent OOM for large files.

Hey,

my C is very bad, so this was found and done with AI. After looking at the reports an flatpak/flatpak#6255

I would expect this, to be slightly less performant, but work more reliably on systems with less RAM. Most of the things we do here, also seem to have been done in other code files here and there, so it doesn't seem to be unprecedented to go this route.

I haven't been able to runtime test this and I'm unsure how - especially validating, that it still works in the wild (e.g. doesn't break deltas)

…porary file to prevent OOM for large files.

openshift-ci · 2025-12-13T16:41:45Z

Hi @razzeee. Thanks for your PR.

I'm waiting for a github.com member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

gemini-code-assist

Code Review

This is a great approach to solving the OOM issue with large deltas. Using a memory-mapped temporary file is a solid strategy to reduce memory pressure. The implementation of reading the patched data in chunks is also well done.

However, I've found a critical issue with the handling of zero-sized output files. The new implementation using mmap will fail in this case, which was handled correctly by the previous g_malloc0 implementation. This would cause a regression for any delta that results in a zero-byte file.

I've added a few detailed comments with suggestions on how to fix this by special-casing when content_size is zero. Please take a look. With these changes, this should be a very robust improvement!

src/libostree/ostree-repo-static-delta-processing.c

…onally mapping and writing.

simonmicro

I would be interested to learn more about your reasoning behind changing the code around _ostree_repo_bare_content_write.

simonmicro · 2025-12-14T12:58:45Z

src/libostree/ostree-repo-static-delta-processing.c

+          guint64 bytes_remaining = state->content_size;
+          while (bytes_remaining > 0)
+            {
+              guchar chunk_buf[8192];


Have you considered bigger block sizes like experimented with in src/libostree/ostree-repo-commit.c:1232?

simonmicro · 2025-12-14T13:01:27Z

src/libostree/ostree-repo-static-delta-processing.c

+              do
+                bytes_read = read (tmpf.fd, chunk_buf, chunk_size);
+              while (G_UNLIKELY (bytes_read == -1 && errno == EINTR));


While reading, this chunk should not be "long enough" for a badly placed interrupt, you are going to retry it. Can you consider also patching src/libostree/ostree-repo-commit.c:1240 for that?

simonmicro · 2025-12-14T13:15:53Z

src/libostree/ostree-repo-static-delta-processing.c

-      if (!_ostree_repo_bare_content_write (repo, &state->content_out, buf, state->content_size,
-                                            cancellable, error))
-        return FALSE;
+      /* Now read from tmpfile and write to repository (if content_size > 0) */


Why was this done? Previously, the (potentially) large output buffer was used to be written into the repository via _ostree_repo_bare_content_write. Now, this code loads the patched content chunk-based into memory and then writes it to disk. But, as the source was changed to be potentially memory-backed, this is not needed - I would even say it introduced an unnecessary copy?

Furthermore, the glnx_loop_write part of _ostree_repo_bare_content_write now may performs additional calls to write, as the chuck sizes may not align with its retry-logic as from https://github.com/GNOME/libglnx/blob/de29c5f7d9df8d57b4f5caa9920f5d4caa7a8cfc/glnx-fdio.c#L752

fix: Replace in-memory bspatch output buffer with a memory-mapped tem…

22f616f

…porary file to prevent OOM for large files.

openshift-ci bot added the needs-ok-to-test label Dec 13, 2025

gemini-code-assist bot reviewed Dec 13, 2025

View reviewed changes

src/libostree/ostree-repo-static-delta-processing.c Outdated Show resolved Hide resolved

src/libostree/ostree-repo-static-delta-processing.c Outdated Show resolved Hide resolved

src/libostree/ostree-repo-static-delta-processing.c Outdated Show resolved Hide resolved

razzeee added 3 commits December 13, 2025 17:53

feat: Handle zero-sized content in static delta processing by conditi…

85f906b

…onally mapping and writing.

build: Add sys/mman.h include.

55f9542

style: run clang format

6d4824f

simonmicro suggested changes Dec 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Replace in-memory bspatch output buffer with a memory-mapped tem… #3554

fix: Replace in-memory bspatch output buffer with a memory-mapped tem… #3554

razzeee commented Dec 13, 2025

Uh oh!

openshift-ci bot commented Dec 13, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

simonmicro left a comment

Uh oh!

simonmicro Dec 14, 2025

Uh oh!

simonmicro Dec 14, 2025

Uh oh!

simonmicro Dec 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: Replace in-memory bspatch output buffer with a memory-mapped tem… #3554

Are you sure you want to change the base?

fix: Replace in-memory bspatch output buffer with a memory-mapped tem… #3554

Conversation

razzeee commented Dec 13, 2025

Uh oh!

openshift-ci bot commented Dec 13, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

simonmicro left a comment

Choose a reason for hiding this comment

Uh oh!

simonmicro Dec 14, 2025

Choose a reason for hiding this comment

Uh oh!

simonmicro Dec 14, 2025

Choose a reason for hiding this comment

Uh oh!

simonmicro Dec 14, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants