feat: Add shared array and buffer to nanoarrow.h by paleolimbot · Pull Request #864 · apache/arrow-nanoarrow

paleolimbot · 2026-04-06T06:24:49Z

For #861 (actual decoding of dictionary arrays), we really need shared arrays for this to be reasonable (e.g., to not deep copy each dictionary for each batch!).

This PR (1) moves shared buffers to the C library instead of the IPC extension, (2) implements a second kind of shared buffer, which is borrowed from an owned reference-counted array, and (3) implements a "clone" that mostly uses the second concept to explode an array into 100% reference counted buffers that we can clone.

I had hoped we could replace the R and Python versions of these but the logic is fuzzier there because once an array has been referenced we can't mess with it or any of its parents (or it will cause a crash). My mind explodes (and I get a lot of failing R tests cases) whenever I mess with that piece and so I'll try to deal with that a different day.

This still needs tests for the shared array move and clone.

This PR uses the previous steps to output arrays (including the ArrowArrayStream reader). This lets us wire it in to all the tests as well. The main follow up is that this PR currently deep copies the dictionary for every batch that arrives, negating much of the point of dictionary encoding. This is a fairly self-contained change that I'll do separately: #864 I also added dictionary index validation while I was here! It is fairly compact (compared to the other code). Closes #845.

paleolimbot added 9 commits April 5, 2026 21:04

move existing implementation to utils.c

e74d89f

add standalone tests

178a370

test lint

51f86dd

just use ArrowBuffer as the user facing type

ef0759f

shared buffers from arrays

a4f57b4

clone shared

66bb18f

use shared array thinger in r bindings

9e46940

fix issues with initial version

c84c04d

revert change

b6c2269

paleolimbot mentioned this pull request Apr 7, 2026

feat: Actually decode dictionary arrays #861

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add shared array and buffer to nanoarrow.h#864

feat: Add shared array and buffer to nanoarrow.h#864
paleolimbot wants to merge 9 commits intoapache:mainfrom
paleolimbot:shared-array-and-buffer

paleolimbot commented Apr 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

paleolimbot commented Apr 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant