fix: ensure I/O cancels correctly when scan is dropped by westonpace · Pull Request #5129 · lance-format/lance

westonpace · 2025-11-03T21:36:14Z

Previously the scheduler took a "poison the well approach". When the scheduler was dropped, it would poison any remaining tasks so they returned an error.

Unfortunately, this approach is not easily plugged into Datafusion, which does not have any kind of asynchronous cancellation of the stream.

Instead, Datafusion encourages a "abort all tasks" approach when the stream is cancelled. This PR migrates things to the "abort all tasks" approach. It also uses SpawnedTask::spawn in filtered_read instead of tokio::task::spawn to create abort-on-drop fire and forget tasks.

In addition, this PR connects the io_buffer_size property to filtered read. This scanner property was previously being ignored. I needed it for a unit test.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2025-11-03T21:41:11Z

+        let start = Instant::now();
+        while start.elapsed() < Duration::from_secs(10) {
+            if runtime.handle().metrics().num_alive_tasks() == 0 {
+                break;
+            }
+            std::thread::sleep(Duration::from_millis(100));
+        }
+
+        assert!(
+            runtime.handle().metrics().num_alive_tasks() == 0,
+            "Tasks should have finished within 10 seconds but there are still {} tasks running",
+            runtime.handle().metrics().num_alive_tasks()


Unstable tokio metrics API breaks build

The new test uses runtime.handle().metrics().num_alive_tasks() to wait for tasks to finish. Handle::metrics is gated behind the tokio_unstable configuration flag, and the workspace Cargo.toml only enables rt-multi-thread, macros, fs, and sync for tokio. Because tokio_unstable is not enabled, this code will not compile, causing the entire test suite to fail to build. The metrics-based polling should either be conditional on tokio_unstable or replaced with a stable mechanism for detecting task completion.

Useful? React with 👍 / 👎.

Some metrics are unstable. The ones in use here are stable.

wjones127

Great to see this!

…decoding

codecov-commenter · 2025-11-04T23:27:20Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 81.84%. Comparing base (6d43982) to head (779cb2b).
⚠️ Report is 16 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #5129      +/-   ##
==========================================
+ Coverage   81.73%   81.84%   +0.11%     
==========================================
  Files         340      341       +1     
  Lines      138875   140539    +1664     
  Branches   138875   140539    +1664     
==========================================
+ Hits       113503   115028    +1525     
- Misses      21631    21706      +75     
- Partials     3741     3805      +64

Flag	Coverage Δ
unittests	`81.84% <100.00%> (+0.11%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…5129) Previously the scheduler took a "poison the well approach". When the scheduler was dropped, it would poison any remaining tasks so they returned an error. Unfortunately, this approach is not easily plugged into Datafusion, which does not have any kind of asynchronous cancellation of the stream. Instead, Datafusion encourages a "abort all tasks" approach when the stream is cancelled. This PR migrates things to the "abort all tasks" approach. It also uses SpawnedTask::spawn in filtered_read instead of tokio::task::spawn to create abort-on-drop fire and forget tasks. In addition, this PR connects the `io_buffer_size` property to filtered read. This scanner property was previously being ignored. I needed it for a unit test.

Ensure I/O cancels correctly when scan is dropped

85ee4cb

github-actions Bot added the bug Something isn't working label Nov 3, 2025

chatgpt-codex-connector Bot reviewed Nov 3, 2025

View reviewed changes

wjones127 approved these changes Nov 3, 2025

View reviewed changes

Ensure scan scheduler stays alive even if scheduling finishes before …

779cb2b

…decoding

westonpace merged commit 2341378 into lance-format:main Nov 4, 2025
26 of 27 checks passed

andrea-reale mentioned this pull request Mar 30, 2026

emilk/fix write starvation rerun-io/lance#12

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: ensure I/O cancels correctly when scan is dropped#5129

fix: ensure I/O cancels correctly when scan is dropped#5129
westonpace merged 2 commits intolance-format:mainfrom
westonpace:fix/avoid-io-deadlock-dropped-scan

westonpace commented Nov 3, 2025 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Nov 3, 2025

Uh oh!

westonpace Nov 4, 2025

Uh oh!

wjones127 left a comment

Uh oh!

codecov-commenter commented Nov 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

westonpace commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

westonpace Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

wjones127 left a comment

Choose a reason for hiding this comment

Uh oh!

codecov-commenter commented Nov 4, 2025

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

westonpace commented Nov 3, 2025 •

edited

Loading