perf: fast rotation for RQ quantization by BubbleCal · Pull Request #6024 · lance-format/lance

BubbleCal · 2026-02-26T11:08:10Z

this introduces a new rotation algorithm which can do rotation in O(dlogd) time (O(d^2) before), and reduces the memory footprint of rotation matrix from d^2 floats to d bits

this doesn't reduce recall, and significantly improves performance:

query performance is 93% faster, and 307% faster than IVF_PQ
indexing is 5.4x faster, and 11.9x faster than IVF_PQ

github-actions · 2026-02-26T11:09:27Z

PR Review: Fast Rotation for RQ Quantization

This PR introduces a new FHT-Kac rotation algorithm for RaBitQ, improving time complexity from O(d²) to O(d log d) and reducing memory from d² floats to d bits. The implementation looks solid overall.

P1 Issues

Potential panic in builder.rs:444-445
```
let input = column
    .as_slice()
    .expect("RabitQ input vectors should be contiguous");
```
Using .expect() here can cause a runtime panic if data isn't contiguous. This should return a proper Result error instead.
Minor: Python documentation typo (python/dataset.py:2866)
Documentation says "Rabit Quantization" but should be "RaBitQ" for consistency with the rest of the codebase.

Observations (not blocking)

The default_rotation_type_compat() returning Matrix for backwards compatibility with older metadata is good.
SIMD AVX2 optimization with scalar fallback is well implemented.
Test coverage for rotation type preservation through optimize operations is appreciated.

The performance improvements are impressive. LGTM pending the panic-to-error change.

…g/rq-fast-rotation-option

codecov · 2026-02-26T12:21:46Z

Codecov Report

❌ Patch coverage is 82.99180% with 83 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
rust/lance-index/src/vector/bq/builder.rs	81.21%	35 Missing and 2 partials ⚠️
rust/lance-index/src/vector/bq/storage.rs	72.00%	19 Missing and 2 partials ⚠️
rust/lance/src/index/vector.rs	0.00%	14 Missing ⚠️
rust/lance-index/src/vector/bq/rotation.rs	94.73%	5 Missing and 2 partials ⚠️
rust/lance-index/src/vector/bq.rs	87.87%	4 Missing ⚠️

📢 Thoughts on this report? Let us know!

Xuanwo · 2026-02-26T13:28:05Z

+    codes.fill(0);
+    for (bit_idx, value) in rotated.iter().enumerate() {
+        if value.is_sign_positive() {
+            codes[bit_idx / u8::BITS as usize] |= 1u8 << (bit_idx % u8::BITS as usize);


It looks like if dim is not byte-aligned, we could hit panic here, for example, if dim is 129 (is this possible?)

I ask codex to have a repro:

use std::sync::Arc; use arrow::datatypes::Float32Type; use arrow_array::{ArrayRef, FixedSizeListArray, Float32Array}; use arrow_schema::{DataType, Field}; use lance_index::vector::bq::builder::RabitQuantizer; use lance_index::vector::bq::RQRotationType; use lance_index::vector::quantizer::Quantization; fn main() { let dim = 129; // non-byte-aligned dimension let values = Float32Array::from(vec![1.0f32; dim]); let field = Arc::new(Field::new("item", DataType::Float32, true)); let vectors = FixedSizeListArray::try_new(field, dim as i32, Arc::new(values) as ArrayRef, None).unwrap(); // Panic is nondeterministic (depends on random rotation signs), loop to trigger reliably. for i in 0..5000 { let q = RabitQuantizer::new_with_rotation::<Float32Type>(1, dim as i32, RQRotationType::Fast); let _ = q.quantize(&vectors).unwrap(); if i % 1000 == 0 { println!("iter={i}"); } } println!("done"); }

This logic will panic at index out of bounds

good catch
will add requirement of dimension % 8 == 0, i think it's fine because real world vectors are almost always dividable by 8

this introduces a new rotation algorithm which can do rotation in `O(dlogd)` time (`O(d^2)` before), and reduces the memory footprint of rotation matrix from d^2 floats to d bits this doesn't reduce recall, and significantly improves performance: <img width="1116" height="839" alt="image" src="https://github.com/user-attachments/assets/308240b5-e38c-4e2a-aea6-c4171c28687a" /> query performance is 93% faster, and 307% faster than IVF_PQ indexing is 5.4x faster, and 11.9x faster than IVF_PQ

BubbleCal added 4 commits February 21, 2026 14:23

feat(rq): add fast random rotation option

3947987

perf(rq): optimize fast rotation path

3df09d4

Add matrix IVF_RQ optimize test

c998f84

Remove rotation_type parameters

e51d3ac

github-actions Bot added python performance labels Feb 26, 2026

BubbleCal added 2 commits February 26, 2026 19:18

Document rotation algorithms

86ae6b7

fix(ci): resolve clippy and jni rq params failures

a2163ce

github-actions Bot added the java label Feb 26, 2026

Merge branch 'main' of https://github.com/lance-format/lance into yan…

69305f1

…g/rq-fast-rotation-option

Xuanwo reviewed Feb 26, 2026

View reviewed changes

Enforce IVF RQ dim multiple

6e45ae7

Xuanwo approved these changes Feb 26, 2026

View reviewed changes

BubbleCal merged commit 4c81f5d into main Feb 27, 2026
29 checks passed

BubbleCal deleted the yang/rq-fast-rotation-option branch February 27, 2026 06:11

andrea-reale mentioned this pull request Mar 30, 2026

emilk/fix write starvation rerun-io/lance#12

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: fast rotation for RQ quantization#6024

perf: fast rotation for RQ quantization#6024
BubbleCal merged 8 commits intomainfrom
yang/rq-fast-rotation-option

BubbleCal commented Feb 26, 2026

Uh oh!

github-actions Bot commented Feb 26, 2026

Uh oh!

codecov Bot commented Feb 26, 2026 •

edited

Loading

Uh oh!

Xuanwo Feb 26, 2026

Uh oh!

BubbleCal Feb 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

BubbleCal commented Feb 26, 2026

Uh oh!

github-actions Bot commented Feb 26, 2026

PR Review: Fast Rotation for RQ Quantization

P1 Issues

Observations (not blocking)

Uh oh!

codecov Bot commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Xuanwo Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

BubbleCal Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov Bot commented Feb 26, 2026 •

edited

Loading