perf: submit I/O requests eagerly in FullZipScheduler by hushengquan · Pull Request #6513 · lance-format/lance

hushengquan · 2026-04-14T12:39:35Z

Summary

Refactor FullZipScheduler::create_page_load_task to accept a pre-submitted I/O future instead of deferring I/O submission until the async task executes. This allows the I/O requests to be submitted immediately during scheduling, enabling the object store layer to batch and parallelize them. close #6504

I/O Model Change

Before: Lazy I/O submission (serialized)

Previously, create_page_load_task received a FullZipReadSource::Remote(io) along with byte ranges and priority. The actual io.submit_request() call happened inside the async block, meaning the I/O request was not submitted until the future was first polled.

When decoding multiple pages (e.g. across many fragments), this created a sequential I/O pattern:

Page 1: [schedule] -> [poll] -> [submit I/O] -> [wait response] -> [decode]
Page 2:                                          [schedule] -> [poll] -> [submit I/O] -> [wait response] -> [decode]
Page 3:                                                                                   [schedule] -> [poll] -> ...

Each page's I/O request could only be submitted after the previous task started executing. The I/O scheduler had no visibility into upcoming requests, preventing it from batching or parallelizing them effectively.

After: Eager I/O submission (pipelined)

Now, io.submit_request() is called before constructing the PageLoadTask, and the resulting future is passed into create_page_load_task. All I/O requests for all pages are submitted upfront during the scheduling phase:

[schedule all pages] --> submit I/O page 1 -+
                     --> submit I/O page 2 -+
                     --> submit I/O page 3 -+  (all in-flight concurrently)
                     --> submit I/O page N -+
                                            |
                     [poll] -> [await page 1 response] -> [decode]
                     [poll] -> [await page 2 response] -> [decode]
                     [poll] -> [await page 3 response] -> [decode]

The object store layer can now see all pending requests at once and optimize I/O through batching, connection multiplexing, and parallel fetches. The async tasks only await the already-in-flight I/O futures.

Changes

rust/lance-encoding/src/encodings/logical/primitive.rs:
- Changed create_page_load_task signature to accept BoxFuture<'static, Result<Vec<Bytes>>> instead of FullZipReadSource + byte ranges + priority
- Moved io.submit_request() calls to happen eagerly at both call sites (schedule_ranges_with_rep_index and the non-rep-index path), before constructing the page load task

Performance

Tested with a multi-fragment dataset containing fixed-width columns (768-dim float32 vectors, 40 fragments, 50 rows/fragment):

Benchmark	Before (p50)	After (p50)	Speedup
Fixed-width column scan	3453 ms	523 ms	6.6x

The improvement comes entirely from I/O pipelining — the decoding logic itself is unchanged. The effect is most pronounced with many fragments or pages, where the serialized I/O submission was the dominant bottleneck.

codecov · 2026-04-14T13:11:14Z

Codecov Report

❌ Patch coverage is 90.90909% with 1 line in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
.../lance-encoding/src/encodings/logical/primitive.rs	90.90%	0 Missing and 1 partial ⚠️

📢 Thoughts on this report? Let us know!

Xuanwo

Nice change, thank you!

westonpace · 2026-04-15T20:19:18Z

Belated second approval. Thanks for catching this! This is the third time this issue (wrapping something with async move closure only to have the execution semantics get changed) has bitten me in two months 😆

perf: submit I/O requests eagerly in FullZipScheduler

3df9f7d

github-actions Bot added the performance label Apr 14, 2026

Xuanwo approved these changes Apr 14, 2026

View reviewed changes

Xuanwo merged commit 9df651b into lance-format:main Apr 14, 2026
30 checks passed

hushengquan deleted the optimize-fullzip branch April 16, 2026 02:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: submit I/O requests eagerly in FullZipScheduler#6513

perf: submit I/O requests eagerly in FullZipScheduler#6513
Xuanwo merged 1 commit intolance-format:mainfrom
hushengquan:optimize-fullzip

hushengquan commented Apr 14, 2026

Uh oh!

codecov Bot commented Apr 14, 2026

Uh oh!

Xuanwo left a comment

Uh oh!

Uh oh!

westonpace commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

hushengquan commented Apr 14, 2026

Summary

I/O Model Change

Before: Lazy I/O submission (serialized)

After: Eager I/O submission (pipelined)

Changes

Performance

Uh oh!

codecov Bot commented Apr 14, 2026

Codecov Report

Uh oh!

Xuanwo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

westonpace commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants