Lazy datafusion registration via custom table provider by alxmrs · Pull Request #100 · alxmrs/xarray-sql

alxmrs · 2026-01-15T14:13:38Z

Fixes #17, #93.

…rom arrow streams, which are lazy, to implement the table registration, which in the default df library, is not lazy enough.

…(XarrayContext). This leads to two failed tests, but this could be caused by test errors.

alxmrs

A few notes.

pyproject.toml

src/lib.rs

The previous implementation stored a single Arrow stream that could only be consumed once, causing subsequent queries on the same table to return empty results. This broke filters and aggregations. Changes: - Modify Rust PyArrowStreamPartition to accept a factory function instead of a stream object. The factory is called on each execute() to create a fresh stream, allowing multiple queries on the same table. - Update LazyArrowStreamTable to take a factory and schema instead of consuming a stream directly. - Update Python read_xarray_table to create a factory function that produces fresh XarrayRecordBatchReader instances. - Update tests to use the new factory-based API via read_xarray_table. This enables proper lazy evaluation while supporting multiple queries on registered tables, fixing the failing filter and aggregation tests.

XarrayRecordBatchReader already implements __arrow_c_stream__, so there's no need to wrap it in pa.RecordBatchReader.from_stream(). The Rust code can consume it directly via ArrowArrayStreamReader::from_pyarrow_bound.

https://quanttype.net/posts/2025-09-12-uv-and-maturin.html

…ality.

alxmrs · 2026-01-25T21:10:34Z

src/lib.rs

+        &self.schema
+    }
+
+    fn execute(&self, _ctx: Arc<TaskContext>) -> SendableRecordBatchStream {


CC: @maximedion2 I could use your expertise and attention here. This is new since the last review.

CC: @maximedion2 I could use your expertise and attention here. This is new since the last review.

So, I've checked a couple times today, it seems you've been changing this part of the code a few times? Are you still working on this?

The python <> rust part is confusing me a bit, I still have to wrap my head around grabbing the GIL everywhere, but generally speaking even without that, I'm not sure I completely follow. So there's 2 concepts here, a partition and a batch. In datafusion, from what I understand, partitions run in parallel, batches are streamed sequentially, for a given partition. For example, if you have 2 folders with 10 parquet files in each, you can spin up 2 partitions, each of which will produce 10 batches. The purpose of the partitions is to allow parallelism, and the purpose of batches is to allow work to progress down the the execution graph while earlier steps are running, i.e. as long as you don't need to materialize all the data, your first batch can go through operations while the second batch is read, third batch is read, etc...

Here from what I understand you are producing a single partition, that's the purpose of PartitionStream. That partition will produce multiple batches, sequentially, so I don't think I understand why there's a mutex?

The "state" pattern you are using is what I initially did for my zarr stream, see here https://github.com/datafusion-contrib/arrow-zarr/blob/fa08cf93f369159ce09e843d68ab90fd64a3d3b3/src/zarr_store_opener/zarr_data_stream.rs#L790. It's quite complicated, the state has to hold the futures to keep track of them, etc... Kyle refactored this part to what it looks like now (see this https://github.com/datafusion-contrib/arrow-zarr/blob/e61b8df47fc3dfd1a95b305c82420276b9efab86/src/zarr_store_opener/zarr_data_stream.rs#L757-L789), on the PR we had a discussion but in the end I did like what he did, simple and cleaner approach I think.

I think, given that the batch production logic here is fairly straightforward, that there's probably a more direct way to generate the record batch stream? I'll have to look more into this, like I said I'm still not completely comfortable with the back and forth with python haha.

Hmm, looking more at this, you might not be able to directly convert your reader into a stream because it's a python class, but maybe you can still use the try_stream macro in which you call "read_next_batch" through the python interface? your reader already handles returning None when it's done reading right, so that might be a short and clean approach? not 100% sure if the calls to the python interpreter would break this, but maybe worth a try.

Thanks Maxime! I really appreciate your review! Both of these ideas were excellent notes of feedback. Since you explained all this, I did some of my homework and read all the docs in the datafusion crate. Thanks for the first explanation of the parallelism model in Datafusion -- I agree that I'll likely need to create my own execution plan instead of just using a StreamPartition. In any case, the mutex wasn't needed and try_stream makes it much cleaner.

I still need to figure out how to address #106. See the "Parallelism" note at the top of the file, but I can probably address this later:

/! ## Parallel Execution Note
//!
//! When using DataFusion's parallel execution (multiple partitions), aggregation queries
//! without ORDER BY may return partial results due to how our stream interacts with
//! DataFusion's async runtime. To ensure complete results:
//! - Add ORDER BY to aggregation queries, or
//! - Use SessionConfig().with_target_partitions(1) for single-threaded execution
//! TODO(#106): Implement proper parallelism and partition handling.

Ok. I have a plan to fix 106, but it requires changing our python code. I'll merge this for now and make subsequent changes.

alxmrs added 3 commits January 15, 2026 17:42

New approach: python only stream API.

3403fef

Clever strategy: use a rust-implemented datafusion provider to read f…

bac5d5f

…rom arrow streams, which are lazy, to implement the table registration, which in the default df library, is not lazy enough.

CC Self-review. Applied lint. Using new method in the high level API …

a875edb

…(XarrayContext). This leads to two failed tests, but this could be caused by test errors.

alxmrs commented Jan 15, 2026

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

src/lib.rs Outdated Show resolved Hide resolved

alxmrs mentioned this pull request Jan 15, 2026

Does xarray-sql load datasets lazily #93

Closed

claude added 2 commits January 16, 2026 05:08

Simplify factory to return XarrayRecordBatchReader directly

d92c334

XarrayRecordBatchReader already implements __arrow_c_stream__, so there's no need to wrap it in pa.RecordBatchReader.from_stream(). The Rust code can consume it directly via ArrowArrayStreamReader::from_pyarrow_bound.

alxmrs mentioned this pull request Jan 17, 2026

New approach: python only stream API. #101

Closed

alxmrs added 11 commits January 18, 2026 17:13

Apply pre-commit (pyink).

74518b7

Pulling rust build items from other branch.

bd2e20a

rust fmt

2c4d3db

Add rust fmt to pre-commit.

e6c58e4

Don't use nightly fmt.

75ca76e

Updated rust ci.

93dcc51

rm wrong classifiers.

f44906f

New release process using uv (recommended by maturin).

707eeeb

Probably also need to install the rust toolchains to build.

2c288fe

Updating release process to work with native code on pypi.

560c1d6

remove unstable features.

449c036

alxmrs mentioned this pull request Jan 18, 2026

TODO(alxmrs): Support pushdown filters (in future PR). #104

Open

alxmrs added 10 commits January 18, 2026 19:20

Better maturin CI

1206eb0

cross compile with zig.

fe48ecd

we actually don't need this.

5ff0db4

Adding CI to test that it builds on other platforms.

cbe2d3e

reduce combinations.

deecee3

rm comment.

8a8ae22

Reduce combinations.

3dfb909

specify python version.

1e20edd

Better name

55f182d

Better python version

9169d3b

alxmrs added 8 commits January 24, 2026 16:30

we still want verbose

862a512

trying import hook. thanks

600cbf0

https://quanttype.net/posts/2025-09-12-uv-and-maturin.html

better system (inspired by datafusion-python).

a4c1230

new ideas to make the build faster from datafusion-python.

bd06611

adding cargo cache to ci build.

ac3fcf8

env aware cargo cache.

723de14

better build to take advantage of caches.

de549eb

rm --uv flag for build.

ebb2d38

alxmrs requested a review from maximedion2 January 25, 2026 06:40

alxmrs added 9 commits January 24, 2026 23:08

more accurate year.

0be5c38

fix minor typo in readme.

1d0dc2b

Upgrade read_xarray instead of deprecating. Use new reader function…

8f8abc2

…ality.

error prop

31e5da9

True streaming implementation.

1dccd88

attempt to fix rust pre-commit.

87d2597

rust fmt

5739e1d

revert

3e5ea89

experiment: GIL-aware iterator reaching straddling rust and python.

b5cbe40

alxmrs commented Jan 25, 2026

View reviewed changes

alxmrs added 8 commits January 25, 2026 13:17

Better streaming tests (WIP)

f1d19d7

Fix: use cstreams via record batches

3dd95af

Fixes to get most tests passing.

41c6138

Discovered issue with parallel execution. Workaround and filed a TODO.

e26dda5

try to fix clippy

6886af8

Simplify the code given feedback from Maxime.

2615b6a

cleaner implementation with try_stream macro.

f39f9d8

fix: typo.

a51bb7f

alxmrs merged commit a1cafbf into main Feb 1, 2026
12 checks passed

alxmrs deleted the reader branch February 1, 2026 00:44

alxmrs mentioned this pull request Feb 1, 2026

Zarrquet implementation: Adding Zarrs-based TableProvider with bindings in Python. #69

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lazy datafusion registration via custom table provider#100

Lazy datafusion registration via custom table provider#100
alxmrs merged 77 commits intomainfrom
reader

alxmrs commented Jan 15, 2026

Uh oh!

alxmrs left a comment

Uh oh!

Uh oh!

Uh oh!

alxmrs Jan 25, 2026

Uh oh!

maximedion2 Jan 26, 2026

Uh oh!

maximedion2 Jan 26, 2026

Uh oh!

alxmrs Feb 1, 2026

Uh oh!

alxmrs Feb 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

alxmrs commented Jan 15, 2026

Uh oh!

alxmrs left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

alxmrs Jan 25, 2026

Choose a reason for hiding this comment

Uh oh!

maximedion2 Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

maximedion2 Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

alxmrs Feb 1, 2026

Choose a reason for hiding this comment

Uh oh!

alxmrs Feb 1, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants