perf: use CPU pool to run WAND algo by BubbleCal · Pull Request #5363 · lance-format/lance

BubbleCal · 2025-11-27T13:12:37Z

this reduces 10%~20% cold latency for full text search

Signed-off-by: BubbleCal <bubble-cal@outlook.com>

…-latency

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2025-11-27T13:18:41Z

+        let partition_ptr = PartitionPtr::new(self);
+        let (candidates, local_metrics) = spawn_cpu(move || {
+            let local_metrics = LocalMetricsCollector::default();
+            // SAFETY: `partition_ptr` points to `self`, which outlives this task because we await it.
+            let partition = unsafe { partition_ptr.deref() };


Keep partition alive for spawn_cpu WAND task

The WAND search is now offloaded to spawn_cpu using a raw PartitionPtr to self, but the background CPU task is not tied to the async future’s cancellation. If the bm25_search future is dropped (e.g., request cancellation) while the index is concurrently torn down, the blocking task will continue running and dereference a pointer to a freed InvertedPartition, leading to potential use-after-free/UB. Consider holding an Arc<InvertedPartition> inside the closure or otherwise ensuring the partition outlives the spawned CPU job under cancellation.

Useful? React with 👍 / 👎.

codecov · 2025-11-27T14:08:13Z

Codecov Report

❌ Patch coverage is 91.66667% with 3 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
rust/lance-index/src/scalar/inverted/index.rs	91.66%	1 Missing and 2 partials ⚠️

📢 Thoughts on this report? Let us know!

Xuanwo · 2025-11-27T14:22:12Z

+        let partition_ptr = PartitionPtr::new(self);
+        let (candidates, local_metrics) = spawn_cpu(move || {
+            let local_metrics = LocalMetricsCollector::default();
+            // SAFETY: `partition_ptr` points to `self`, which outlives this task because we await it.


Can we avoid this? This can't be guaranteed since the future itself could be dropped at any time.

I was wondering if you could refactor this function.

Move the postings calculation out and await it separately, since bm25_search is just a pure blocking function.

Signed-off-by: BubbleCal <bubble-cal@outlook.com>

Xuanwo

Nice, I love it

this reduces 10%~20% cold latency for full text search --------- Signed-off-by: BubbleCal <bubble-cal@outlook.com>

BubbleCal added 2 commits November 12, 2025 17:44

perf: use CPU pool to run WAND algo

4446e90

Signed-off-by: BubbleCal <bubble-cal@outlook.com>

Merge branch 'main' of https://github.com/lancedb/lance into fts-cold…

daacbe3

…-latency

github-actions Bot added the performance label Nov 27, 2025

chatgpt-codex-connector Bot reviewed Nov 27, 2025

View reviewed changes

BubbleCal requested a review from Xuanwo November 27, 2025 14:13

Xuanwo reviewed Nov 27, 2025

View reviewed changes

refactor

9b289fc

Signed-off-by: BubbleCal <bubble-cal@outlook.com>

Xuanwo approved these changes Nov 28, 2025

View reviewed changes

Xuanwo merged commit 9344121 into lance-format:main Nov 28, 2025
39 of 43 checks passed

jackye1995 pushed a commit to jackye1995/lance that referenced this pull request Jan 21, 2026

perf: use CPU pool to run WAND algo (lance-format#5363)

f6b1f69

this reduces 10%~20% cold latency for full text search --------- Signed-off-by: BubbleCal <bubble-cal@outlook.com>

andrea-reale mentioned this pull request Mar 30, 2026

emilk/fix write starvation rerun-io/lance#12

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: use CPU pool to run WAND algo#5363

perf: use CPU pool to run WAND algo#5363
Xuanwo merged 3 commits intolance-format:mainfrom
BubbleCal:fts-cold-latency

BubbleCal commented Nov 27, 2025

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Nov 27, 2025

Uh oh!

codecov Bot commented Nov 27, 2025 •

edited

Loading

Uh oh!

Xuanwo Nov 27, 2025

Uh oh!

Xuanwo Nov 27, 2025

Uh oh!

Xuanwo left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

BubbleCal commented Nov 27, 2025

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Xuanwo Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

Xuanwo Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

Xuanwo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov Bot commented Nov 27, 2025 •

edited

Loading