perf: speed up format v2.2 scans by adding shortcut for full page by Xuanwo · Pull Request #5981 · lance-format/lance

Xuanwo · 2026-02-22T21:30:56Z

This PR addresses a long-standing issue in the Lance file format (both v2.1 and v2.2) where the rep index must be loaded before reading any full-zipped values. This could cause serious HoL blocking, especially when data is stored on low-latency, high-throughput services like S3.

@westonpace previously reported this issue at #3579. While we already support user requests to cache this index, I implemented that feature. Now I believe we should always cache it by default, as it is low-cost and highly beneficial.

Based on a full scan test using the FineWeb dataset, I observed the following improvements:

Local dataset

p50: 900,363 µs → 542,266 µs (1.66x faster, –39.8%)
p95: 951,820 µs → 579,505 µs (1.64x faster)
p99: 994,691 µs → 713,062 µs (1.40x faster)

S3 dataset

p50: 3,981,524 µs → 990,825 µs (4.02x faster, –75.1%)
p95: 4,056,506 µs → 1,124,499 µs (3.61x faster)
p99: 4,106,640 µs → 1,207,027 µs (3.40x faster)

Additionally, the rep index cache grows linearly; for 200k rows it occupies about 1.6 MiB. This cache will be managed by our global metadata cache. So I think it's totally ok for us to handle it.

This PR includes the following changes:

always cache the repetition index when present and populate cached_state immediately
split io submission so cached paths submit reads before awaiting and keep non-cached behavior for fallback
drop unused cache flag/parameter plumbing and update full zip cache test expectations

Parts of this PR were drafted with assistance from Codex (with gpt-5.3-codex), amp (with claude-4.6) and fully reviewed and edited by me. I take full responsibility for all changes.

github-actions · 2026-02-22T21:31:10Z

ACTION NEEDED
Lance follows the Conventional Commits specification for release automation.

The PR title and description are used as the merge commit message. Please update your PR title and description to match the specification.

For details on the error please inspect the "PR Title Check" action.

github-actions · 2026-02-22T21:32:14Z

PR Review: perf: refine FullZip repetition index scheduling and caching

Overall: Clean refactoring that simplifies the API and improves I/O scheduling. No P0/P1 issues found.

Summary of Changes

Removes the enable_cache flag and cache_repetition_index parameter - repetition index is now always cached when present
Optimizes I/O in the cached path by submitting data requests before awaiting
Sets cached_state directly during initialize() for immediate availability

Minor Observations

I/O pipelining improvement: In the cached path, io.submit_request(byte_ranges, priority) is now called synchronously before the async block, with the await happening inside. This is a valid optimization that fires I/O earlier. Good change.
Lifetime handling: The removal of ranges.to_vec() is correct - all uses of ranges (extract_byte_ranges_from_cached, compute_rep_index_ranges, num_rows calculation) happen before the async block begins, so no ownership issues.
Test updates: Tests correctly updated to expect FullZipCacheableState instead of NoCachedPageData when rep_index is present.

LGTM 👍

This reverts commit 5f547b1.

westonpace

Can we at least leave in the option to disable it?

Additionally, the rep index cache grows linearly; for 200k rows it occupies about 1.6 MiB. This cache will be managed by our global metadata cache. So I think it's totally ok for us to handle it.

I am a little bit worried about this. If we have billions of rows in a dataset won't that mean we need GBs of RAM (per string/binary column)?

If the goal is to improve full scan performance I think there is potentially another way. If we know we are going to load an entire page then we could shortcut and just read the entire page. Then schedule_ranges could receive a special reader that just returns slices of the page data. This way we avoid needing two stages of I/O for full scans.

Xuanwo · 2026-02-23T14:04:03Z

Can we at least leave in the option to disable it?

Yep, will do.

If we know we are going to load an entire page then we could shortcut and just read the entire page. Then schedule_ranges could receive a special reader that just returns slices of the page data. This way we avoid needing two stages of I/O for full scans.

Seems to be an interesting idea, will give it a try first.

Xuanwo · 2026-02-24T04:47:35Z

Hi @westonpace, I added a FullZipReadSource that works really well. Thank you for your suggestion! Now we don't need to cache the data, but we can still achieve similar performance improvements.

The latest bench result:

local p50：874138.5us -> 591128.0us，1.479x（-32.38%）
local p95：937716.5us -> 632214.5us，1.483x（-32.58%）
s3 p50：4027359.0us -> 1013433.5us，3.974x（-74.84%）
s3 p95：4116329.5us -> 1091348.0us，3.772x（-73.49%）
s3 p99：6182523.0us -> 1137305.0us，5.436x（-81.60%）

Compared to previous impls:

local p50: 542,265.5us -> 591,128us，+9.01%
local p95: 579,505us -> 632,214.5us，+9.10%
local p99: 713,061.5us -> 720,853.5us，+1.09%
s3 p50: 990,825us -> 1,013,433.5us，+2.28%
s3 p95: 1,124,499us -> 1,091,348us，-2.95%
s3 p99: 1,207,027us -> 1,137,305us，-5.78%

A bit slower on local but I think that's fine.

westonpace

I like the shortcut with FullZipReadSource, that looks great. Do we need to get rid of the caching ability though? I think it still might be useful for random access cases.

westonpace · 2026-02-25T12:25:03Z

    /// Cached state containing the decoded repetition index
    cached_state: Option<Arc<FullZipCacheableState>>,
-    /// Whether to enable caching of repetition indices
-    enable_cache: bool,


Why get rid of this? It might still be useful for users that want 1 IOP random access on relatively small amounts of data?

I tried to always enable cache before and now we have a shortcut, we can enable the flag back.,

…rep-index # Conflicts: # rust/lance-encoding/src/encodings/logical/primitive.rs

…rep-index

codecov · 2026-02-26T08:10:42Z

Codecov Report

❌ Patch coverage is 84.63950% with 49 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
.../lance-encoding/src/encodings/logical/primitive.rs	84.63%	42 Missing and 7 partials ⚠️

📢 Thoughts on this report? Let us know!

…rep-index # Conflicts: # rust/lance-encoding/src/encodings/logical/primitive.rs

westonpace

Awesome, thanks for bearing with the review! Also, I'm really happy to see this fix get in, I always felt it was kind of embarrassing we were doing two IOPS in this full page case 😰. I just never got around to fixing it 😆

@westonpace

…nce-format#5981) This PR addresses a long-standing issue in the Lance file format (both v2.1 and v2.2) where the rep index must be loaded before reading any full-zipped values. This could cause serious HoL blocking, especially when data is stored on low-latency, high-throughput services like S3. @westonpace previously reported this issue at lance-format#3579. While we already support user requests to cache this index, I implemented that feature. Now I believe we should always cache it by default, as it is low-cost and highly beneficial. Based on a full scan test using the FineWeb dataset, I observed the following improvements: **Local dataset** - p50: 900,363 µs → 542,266 µs (1.66x faster, –39.8%) - p95: 951,820 µs → 579,505 µs (1.64x faster) - p99: 994,691 µs → 713,062 µs (1.40x faster) **S3 dataset** - p50: 3,981,524 µs → 990,825 µs (4.02x faster, –75.1%) - p95: 4,056,506 µs → 1,124,499 µs (3.61x faster) - p99: 4,106,640 µs → 1,207,027 µs (3.40x faster) Additionally, the rep index cache grows linearly; for 200k rows it occupies about 1.6 MiB. This cache will be managed by our global metadata cache. So I think it's totally ok for us to handle it. --- This PR includes the following changes: - always cache the repetition index when present and populate `cached_state` immediately - split io submission so cached paths submit reads before awaiting and keep non-cached behavior for fallback - drop unused cache flag/parameter plumbing and update full zip cache test expectations --- **Parts of this PR were drafted with assistance from Codex (with `gpt-5.3-codex`), amp (with `claude-4.6`) and fully reviewed and edited by me. I take full responsibility for all changes.**

Validate full-scan scheduling fix

573452c

Xuanwo changed the title ~~Refine FullZip repetition index scheduling and caching~~ perf: refine FullZip repetition index scheduling and caching Feb 22, 2026

github-actions Bot added the performance label Feb 22, 2026

Xuanwo changed the title ~~perf: refine FullZip repetition index scheduling and caching~~ perf: speed up format v2.2 scan 400% by always cache fullzip rep index Feb 22, 2026

Xuanwo changed the title ~~perf: speed up format v2.2 scan 400% by always cache fullzip rep index~~ perf: speed up format v2.2 scan 400% by always cache fullzip rep index Feb 22, 2026

Xuanwo changed the title ~~perf: speed up format v2.2 scan 400% by always cache fullzip rep index~~ perf: speed up format v2.2 scans by always caching fullzip rep index Feb 22, 2026

Xuanwo added 2 commits February 23, 2026 05:56

ci: pin linux-build nightly toolchain for llvm-cov

5f547b1

Revert "ci: pin linux-build nightly toolchain for llvm-cov"

bbdad60

This reverts commit 5f547b1.

westonpace reviewed Feb 23, 2026

View reviewed changes

refactor(encoding): introduce FullZipReadSource for fullzip reads

a1e606d

Xuanwo changed the title ~~perf: speed up format v2.2 scans by always caching fullzip rep index~~ perf: speed up format v2.2 scans by adding shortcut for full page Feb 24, 2026

Merge branch 'main' into xuanwo/fix-full-scan-rep-index

8a27b96

westonpace reviewed Feb 25, 2026

View reviewed changes

Xuanwo added 5 commits February 26, 2026 01:33

fix(encoding): restore cache repetition index flag wiring

5c6fba4

Merge remote-tracking branch 'origin/main' into xuanwo/fix-full-scan-…

c0a96a6

…rep-index # Conflicts: # rust/lance-encoding/src/encodings/logical/primitive.rs

Merge remote-tracking branch 'origin/main' into xuanwo/fix-full-scan-…

783e8ae

…rep-index

fix: allow zero-width fullzip values with control words

73ea34d

style: format fullzip decoder validation

9f9a032

Merge remote-tracking branch 'origin/main' into xuanwo/fix-full-scan-…

6834553

…rep-index # Conflicts: # rust/lance-encoding/src/encodings/logical/primitive.rs

westonpace approved these changes Feb 27, 2026

View reviewed changes

Xuanwo merged commit e25f169 into main Feb 27, 2026
29 checks passed

Xuanwo deleted the xuanwo/fix-full-scan-rep-index branch February 27, 2026 13:06

andrea-reale mentioned this pull request Mar 30, 2026

emilk/fix write starvation rerun-io/lance#12

Closed

hushengquan mentioned this pull request Apr 14, 2026

FullZip scan latency regression on cloud storage (S3) due to lazy I/O submission #6504

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: speed up format v2.2 scans by adding shortcut for full page#5981

perf: speed up format v2.2 scans by adding shortcut for full page#5981
Xuanwo merged 11 commits intomainfrom
xuanwo/fix-full-scan-rep-index

Xuanwo commented Feb 22, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Feb 22, 2026

Uh oh!

github-actions Bot commented Feb 22, 2026

Uh oh!

westonpace left a comment

Uh oh!

Xuanwo commented Feb 23, 2026

Uh oh!

Xuanwo commented Feb 24, 2026 •

edited

Loading

Uh oh!

westonpace left a comment

Uh oh!

westonpace Feb 25, 2026

Uh oh!

Xuanwo Feb 25, 2026

Uh oh!

codecov Bot commented Feb 26, 2026

Uh oh!

westonpace left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Xuanwo commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Feb 22, 2026

Uh oh!

github-actions Bot commented Feb 22, 2026

PR Review: perf: refine FullZip repetition index scheduling and caching

Summary of Changes

Minor Observations

Uh oh!

westonpace left a comment

Choose a reason for hiding this comment

Uh oh!

Xuanwo commented Feb 23, 2026

Uh oh!

Xuanwo commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

westonpace left a comment

Choose a reason for hiding this comment

Uh oh!

westonpace Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

Xuanwo Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented Feb 26, 2026

Codecov Report

Uh oh!

westonpace left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Xuanwo commented Feb 22, 2026 •

edited

Loading

Xuanwo commented Feb 24, 2026 •

edited

Loading