storage: GetBlob: write to a local tempfile by vrothberg · Pull Request #1167 · containers/image

vrothberg · 2021-03-02T12:00:57Z

When reading blobs (e.g., layers) from the storage, write the data to a
temporary file first and return its file descriptor.

The main goal is to keep the time of the storage being locked as short
as possible. With this approach we are disk-bound which is generally
faster than being network bound.

The motivation for this change is the desire to allow for accessing the
storage during pushes. We have attempted to optimize certain execution
paths in containers/storage but those require the callers to know
whether data will be manipulated or read, which is far from being
trivial. The daemon-less nature of the storage also requires to write
certain data.

This approach here is entirely transparent to the storage.

Signed-off-by: Valentin Rothberg rothberg@redhat.com

vrothberg · 2021-03-02T12:01:06Z

@rhatdan @giuseppe @nalind @lsm5 PTAL

rhatdan · 2021-03-02T12:16:48Z

LGTM

mtrmac

(Just a drive-by, I don’t have an opinion on the performance trade-off.)

storage/storage_image.go

lsm5 · 2021-03-02T13:40:38Z

@vrothberg could you perhaps suggest a test to measure this change? And would it make sense to include such a test in CI ? /cc @ashcrow

vrothberg · 2021-03-02T13:45:55Z

@vrothberg could you perhaps suggest a test to measure this change? And would it make sense to include such a test in CI ? /cc @ashcrow

It's rather tough to test such performance-oriented changes. A simple manual test is to force a long copy operation to a registry (e.g., podman push) and in parallel accessing the storage (e.g., podman images). The (expected) result is that second operation is blocked only for a short period of time in contrast to being blocked for the entire copy/push operation as before.

vrothberg · 2021-03-02T13:48:33Z

toomanyrequests: You have reached your pull rate limit. You may increase the limit by authenticating and upgrading: https://www.docker.com/increase-rate-limit

Let's wait a bit :^)

TomSweeneyRedHat · 2021-03-02T15:36:39Z

LGTM
assuming rate limits let the test through.

lsm5

Change LGTM. I'll try a test with this change and get back if I see anything. Guess we gotta wait anyway for the rate-limiting issue.

When reading blobs (e.g., layers) from the storage, write the data to a temporary file first and return its file descriptor. The main goal is to keep the time of the storage being locked as short as possible. With this approach we are disk-bound which is generally faster than the being network bound. The motivation for this change is the desire to allow for accessing the storage during pushes. We have attempted to optimize certain execution paths in containers/storage but those require the callers to know whether data will be manipulated or read, which is far from being trivial. The daemon-less nature of the storage also requires to write certain data. This approach here is entirely transparent to the storage. Signed-off-by: Valentin Rothberg <rothberg@redhat.com>

vrothberg · 2021-03-02T17:28:30Z

Green, and happy.

lsm5

Tested locally just now, this is really nice. podman images and other commands don't have to wait for podman push anymore!

LGTM.

lsm5 · 2021-03-02T18:34:23Z

I want to hit merge, but just checking if it's ok to Rebase and merge or do you prefer a merge commit be added?

vrothberg · 2021-03-02T18:55:28Z

I want to hit merge, but just checking if it's ok to Rebase and merge or do you prefer a merge commit be added?

"Rebase + merge" is the default in c/image. Thanks for reviewing and testing!

akostadinov · 2021-06-14T08:31:23Z

How about using a hard link to avoid unnecessary disk writes?

giuseppe · 2021-06-14T08:49:35Z

How about using a hard link to avoid unnecessary disk writes?

we'd need to make sure the same file system is used. For that, we could use the new staging directory API that was added to c/storage to address exactly this issue

mtrmac · 2021-06-14T20:35:33Z

How about using a hard link to avoid unnecessary disk writes?

The data is typically a stream (formed at runtime with tar-split data), not a file that could be linked.

mtrmac reviewed Mar 2, 2021

View reviewed changes

storage/storage_image.go Outdated Show resolved Hide resolved

storage/storage_image.go Outdated Show resolved Hide resolved

vrothberg force-pushed the getblob-optimization branch from 90cbe6f to 1da9d8f Compare March 2, 2021 13:18

vrothberg mentioned this pull request Mar 2, 2021

Split RW and RO Layer locks per method containers/storage#473

Closed

vrothberg force-pushed the getblob-optimization branch 2 times, most recently from 1da9d8f to 61d68ad Compare March 2, 2021 14:13

lsm5 reviewed Mar 2, 2021

View reviewed changes

vrothberg force-pushed the getblob-optimization branch from 61d68ad to 20f7287 Compare March 2, 2021 16:40

lsm5 approved these changes Mar 2, 2021

View reviewed changes

lsm5 merged commit 81dcd4b into containers:master Mar 2, 2021

vrothberg mentioned this pull request May 4, 2021

Can't start a container while a push operation is in progress containers/podman#5356

Closed

vrothberg deleted the getblob-optimization branch June 17, 2021 07:15

Conversation

vrothberg commented Mar 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vrothberg commented Mar 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rhatdan commented Mar 2, 2021

Uh oh!

mtrmac left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lsm5 commented Mar 2, 2021

Uh oh!

vrothberg commented Mar 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vrothberg commented Mar 2, 2021

Uh oh!

TomSweeneyRedHat commented Mar 2, 2021

Uh oh!

lsm5 left a comment

Choose a reason for hiding this comment

Uh oh!

vrothberg commented Mar 2, 2021

Uh oh!

lsm5 left a comment

Choose a reason for hiding this comment

Uh oh!

lsm5 commented Mar 2, 2021

Uh oh!

vrothberg commented Mar 2, 2021

Uh oh!

akostadinov commented Jun 14, 2021

Uh oh!

giuseppe commented Jun 14, 2021

Uh oh!

mtrmac commented Jun 14, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

vrothberg commented Mar 2, 2021 •

edited

Loading

vrothberg commented Mar 2, 2021 •

edited

Loading

vrothberg commented Mar 2, 2021 •

edited

Loading