perf!: remove shuffle buffer by wkalt · Pull Request #5912 · lance-format/lance

wkalt · 2026-02-08T14:57:39Z

This removes a buffer in the shuffler that accumulated batches for batched writes to temporary storage. This was configured with a public buffer_size parameter hence the breaking change.

Previously, when we shuffled data we accumulated this many batches for each partition in memory and then flushed them all to disk at once. This may have been intended as an optimization in the original implementation of the shuffler, which supported external shuffling through arbitrary object storage. However, the shuffler was subsequently hardcoded to use local disk (where this kind of buffering serves no benefit) and even on remote object storage, we already have a layer of buffering in the storage writer.

Instead of buffering batches, just write them directly to the FileWriter. This results in much more predictable memory usage and also faster index builds.

This removes a buffer in the shuffler that accumulated batches for batched writes to temporary storage. This was configured with a public buffer_size parameter hence the breaking change. Previously, when we shuffled data we accumulated this many batches for each partition in memory and then flushed them all to disk at once. This may have been intended as an optimization in the original implementation of the shuffler, which supported external shuffling through arbitrary object storage. However, the shuffler was subsequently hardcoded to use local disk (where this kind of buffering serves no benefit) and even on remote object storage, we already have a layer of buffering in the storage writer. Instead of buffering batches, just write them directly to the FileWriter. This results in much more predictable memory usage and also faster index builds.

wkalt · 2026-02-08T14:59:07Z

Here is the result on an index build over 100M 384d vectors. Note that this chart still shows a slow leak over the course of the shuffle. I assume that's unrelated and haven't looked at it yet.

codecov · 2026-02-08T15:33:41Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

Xuanwo

Nice change!

westonpace

Nice work! The writers will already do their own buffering if they need to so I agree this extra layer of buffering is not needed.

westonpace · 2026-02-09T13:03:34Z

+                if !batches.is_empty() {
                    partition_sizes[part_id] += batches.iter().map(|b| b.num_rows()).sum::<usize>();
-                    futs.push(writer.write_batches(batches.iter()));
+                    writers[part_id].write_batches(batches.iter()).await?;


We can do this in a follow-up but it might be nice to still do all the writes in parallel. E.g. keep the futs Vec. shuffled is a Vec and not any kind of stream / iterator so the data is all in memory already (I think the important point is getting rid of the if counter % self.buffer_size == 0)

let mut futs = vec![]; if !batches.is_empty() { futs.push(writers[part_id].write_batches(batches.iter())); } try_join_all(futs).await?;

thanks, updated

github-actions Bot added performance breaking-change labels Feb 8, 2026

lint

33b50fa

Xuanwo approved these changes Feb 9, 2026

View reviewed changes

westonpace approved these changes Feb 9, 2026

View reviewed changes

make writes concurrent

8f12f82

wkalt mentioned this pull request Feb 9, 2026

feat: spill page metadata to disk during IVF shuffle #5921

Merged

westonpace merged commit 0b2c9e3 into lance-format:main Feb 10, 2026
30 checks passed

andrea-reale mentioned this pull request Mar 30, 2026

emilk/fix write starvation rerun-io/lance#12

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf!: remove shuffle buffer#5912

perf!: remove shuffle buffer#5912
westonpace merged 3 commits intolance-format:mainfrom
wkalt:task/remove-shuffle-buffer

wkalt commented Feb 8, 2026

Uh oh!

wkalt commented Feb 8, 2026 •

edited

Loading

Uh oh!

codecov Bot commented Feb 8, 2026

Uh oh!

Xuanwo left a comment

Uh oh!

westonpace left a comment

Uh oh!

westonpace Feb 9, 2026

Uh oh!

wkalt Feb 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

wkalt commented Feb 8, 2026

Uh oh!

wkalt commented Feb 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov Bot commented Feb 8, 2026

Codecov Report

Uh oh!

Xuanwo left a comment

Choose a reason for hiding this comment

Uh oh!

westonpace left a comment

Choose a reason for hiding this comment

Uh oh!

westonpace Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

wkalt Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wkalt commented Feb 8, 2026 •

edited

Loading