perf: reduce peak memory during cosine IVF-PQ index training by wkalt · Pull Request #6016 · lance-format/lance

wkalt · 2026-02-25T20:54:49Z

Two optimizations that together eliminate the transient 2x memory peak on the training sample during cosine-distance index builds:

Add normalize_fsl_owned that L2-normalizes a FixedSizeListArray in-place via Buffer::into_mutable() when the buffer is uniquely owned, avoiding a full copy. Falls back to the existing copy path when the buffer is shared.
Skip arrow::compute::filter when all vectors are already finite, avoiding another full copy of the training data.

Two optimizations that together eliminate the transient 2x memory peak on the training sample during cosine-distance index builds: 1. Add `normalize_fsl_owned` that L2-normalizes a FixedSizeListArray in-place via `Buffer::into_mutable()` when the buffer is uniquely owned, avoiding a full copy. Falls back to the existing copy path when the buffer is shared. 2. Skip `arrow::compute::filter` when all vectors are already finite, avoiding another full copy of the training data. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

wkalt · 2026-02-26T00:05:16Z

this is an index build on 100M 384d vectors before and after the change. Change targets the IVF training portion at the start.

codecov · 2026-02-26T00:12:40Z

Codecov Report

❌ Patch coverage is 94.20290% with 8 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
rust/lance-linalg/src/kernels.rs	94.54%	5 Missing and 1 partial ⚠️
rust/lance/src/index/vector/pq.rs	0.00%	0 Missing and 1 partial ⚠️
rust/lance/src/index/vector/utils.rs	96.00%	0 Missing and 1 partial ⚠️

📢 Thoughts on this report? Let us know!

BubbleCal

LGTM!

…ormat#6016) Two optimizations that together eliminate the transient 2x memory peak on the training sample during cosine-distance index builds: 1. Add `normalize_fsl_owned` that L2-normalizes a FixedSizeListArray in-place via `Buffer::into_mutable()` when the buffer is uniquely owned, avoiding a full copy. Falls back to the existing copy path when the buffer is shared. 2. Skip `arrow::compute::filter` when all vectors are already finite, avoiding another full copy of the training data. --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

github-actions Bot added the performance label Feb 25, 2026

wkalt force-pushed the wkalt/perf-ivf-training-memory branch from bc57006 to 2a505cd Compare February 25, 2026 23:20

lint

566b1f2

BubbleCal approved these changes Feb 26, 2026

View reviewed changes

wkalt merged commit 3cad9cb into lance-format:main Feb 26, 2026
29 checks passed

andrea-reale mentioned this pull request Mar 30, 2026

emilk/fix write starvation rerun-io/lance#12

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: reduce peak memory during cosine IVF-PQ index training#6016

perf: reduce peak memory during cosine IVF-PQ index training#6016
wkalt merged 2 commits intolance-format:mainfrom
wkalt:wkalt/perf-ivf-training-memory

wkalt commented Feb 25, 2026

Uh oh!

wkalt commented Feb 26, 2026

Uh oh!

codecov Bot commented Feb 26, 2026

Uh oh!

BubbleCal left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

wkalt commented Feb 25, 2026

Uh oh!

wkalt commented Feb 26, 2026

Uh oh!

codecov Bot commented Feb 26, 2026

Codecov Report

Uh oh!

BubbleCal left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants