Add ``sort_actor`` to cudf-polars + rapidsmpf by rjzamora · Pull Request #21690 · rapidsai/cudf

rjzamora · 2026-03-06T19:30:30Z

Description

Closes #20486
Depends on rapidsai/rapidsmpf#891 (or similar)

Adds sort_actor for the "rapidsmpf" runtime.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

wence- · 2026-03-09T10:15:08Z

+                )
+                if sort_ir.stable:
+                    nrows = df.table.num_rows()
+                    base = seq_num * (1 << 32)


This is incorrect in a multi-rank setting. I think what you're trying to do is give each row in the (concatenated) stream of local table chunks a unique id.

seq_num * (1 << 32) is a good offset for this because by construction no sequence numbers from chunks on the same rank can now overlap.

But, it seems unnecessary (you have the number of rows when adding the column, so just keep track of that locally).

However, what this doesn't do is ensure that two different ranks have different sequence numbers. For that, you need to incorporate the rank in the high bits of the id, such that when you sort globally, rows from rank-1 (say) come after rows from rank-0 if their keys are equal.

Okay, thanks for explaining - I updated the logic here (hopefully in line with your suggestion).

wence- · 2026-03-09T10:18:52Z

+                    df = DataFrame.from_table(
+                        out_table,
+                        cast(Sequence[str], column_names_list),
+                        dtypes_list,
+                        stream,
+                    )
+                    sort_order = [
+                        list(column_order)[by.index(n)]
+                        if n in by
+                        else plc.types.Order.ASCENDING
+                        for n in column_names_list
+                    ]
+                    nulls = [
+                        list(null_order)[by.index(n)]
+                        if n in by
+                        else plc.types.NullOrder.AFTER
+                        for n in column_names_list
+                    ]
+                    sorted_tbl = plc.sorting.sort(
+                        df.table, sort_order, nulls, stream=stream
+                    )
+                    out_table = plc.Table(sorted_tbl.columns()[:-1])


This sort is wrong, because it sorts by all the columns in the table, whereas you only want to sort by the key columns (and the disambiguating seq_id column).

You want to be using sort_by_key

Makes sense - I decided to construct a Sort so I can use do_evaluate, but I can move to an explicit sort_by_key if you'd prefer.

wence- · 2026-03-09T11:21:16Z

+    by: list[str],
+    by_dtypes: list[DataType],


These arguments are only used to construct the schema of the "empty" table for the case that the local candidates are empty, which is then used to provide the schema for chunk_to_frame.

Two things:

we only need the schema for all of those operations, so we should redo the utilities to take the schema.

Let's construct the empty table outside if local_candidates is empty.

So this function becomes:

async def _compute_sort_boundaries( context: Context, comm: Communicator, ir_context: IRExecutionContext, local_candidates: list[TableChunk], schema: Schema, num_partitions: int, column_order: list[plc.types.Order], null_order: list[plc.types.NullOrder], allgather_id: int ) -> plc.Table: stream = ... boundaries = _get_final_sort_boundaries( chunk_to_frame( await concat_batch(local_candidates, context, schema, ir_context) ) ) if comm.nranks > 1: ... boundaries = _get_final_sort_boundaries( ... ) return boundaries.table

Okay, yeah - I revised this a bit.

mroeschke

Just some non-blocking ideas/questions around sortedness:

Are there any simplifications to be made to computing sort boundaries if the incoming data is already sorted?
When processing the resulting message of the sorted TableChunk, is there a practical way to construct the cudf_polars DataFrame with sorted metadata i.e. set the is_sorted flag on the Columns of the cudf_polars DataFrame? Maybe the message sends metadata with that sorted information?

rjzamora · 2026-03-25T19:34:20Z

Are there any simplifications to be made to computing sort boundaries if the incoming data is already sorted?

The sort-boundary calculation requires the incoming chunks to be sorted. We guarantee this at lowering time by ensuring the child of a ShuffleSorted node is always a Sort (which is executed chunk-wise). In the future, we should have metadata like rapidsai/rapidsmpf#853 to skip those local sort operations when the data is already sorted.

When processing the resulting message of the sorted TableChunk, is there a practical way to construct the cudf_polars DataFrame with sorted metadata i.e. set the is_sorted flag on the Columns of the cudf_polars DataFrame? Maybe the message sends metadata with that sorted information?

I'm hoping to use rapidsai/rapidsmpf#853 to track this kind of information in the ChannelMetadata.

wence-

Tiny changes, but I think the core logic looks good.

rjzamora · 2026-04-01T15:21:28Z

/merge

Should close #21824 See test case for repro, but basically if you had ```python df = pl.LazyFrame({"a": [1, 2, 3]}) # Create two filters, both of which will give empty results q = pl.concat([df.filter(pl.col("a") == 0), df.filter(pl.col("a") == 4)]).sort("a") ``` then `q.collect(engine="gpu")` you'd get a cuda exception because we'd try to do an out of bounds access on an empty table. edit: turns out #21690 fixed this issue. This PR now will only contribute a test case. Authors: - J Berg (https://github.com/jberg5) Approvers: - Matthew Roeschke (https://github.com/mroeschke) URL: #21825

rjzamora added 11 commits February 27, 2026 10:38

add basic sort actor - not working yet

db31a81

merge with main

8bf52b3

more cleanup

45e33bc

comments

91cf7b1

support single output partition

018e5d2

save changes

fcc5e17

tests passing - not sure about slicing yet

52ca569

Merge remote-tracking branch 'upstream/main' into sort-actor

0ac6415

global sort actor seems to be working

dbaade5

comments

b25b91b

remove ununsed doc

01dac62

rjzamora self-assigned this Mar 6, 2026

rjzamora requested a review from a team as a code owner March 6, 2026 19:30

rjzamora added feature request New feature or request 2 - In Progress Currently a work in progress non-breaking Non-breaking change labels Mar 6, 2026

rjzamora requested review from bdice and mroeschke March 6, 2026 19:30

github-actions bot added Python Affects Python cuDF API. cudf-polars Issues specific to cudf-polars labels Mar 6, 2026

github-project-automation bot added this to cuDF Python Mar 6, 2026

GPUtester moved this to In Progress in cuDF Python Mar 6, 2026

wence- requested changes Mar 9, 2026

View reviewed changes

rjzamora added 7 commits March 9, 2026 09:10

Merge remote-tracking branch 'upstream/main' into sort-actor

4b32e60

small stream tweak

3d69821

partially address code review

651d418

update stable-sorting helper

89d8abe

fix/cleanup incorrect final sort

8e3b162

Merge remote-tracking branch 'upstream/main' into sort-actor

188a799

always repartition the final result

9e4cca6

mroeschke reviewed Mar 24, 2026

View reviewed changes

Comment thread python/cudf_polars/cudf_polars/experimental/rapidsmpf/collectives/sort.py Outdated

rjzamora added 6 commits March 24, 2026 10:54

fix tracing

9993f33

Merge remote-tracking branch 'upstream/main' into sort-actor

260a308

get joined stream

34b7b42

avoid default stream

b97df50

add dynamic-planning optimizations

56cffdb

Merge remote-tracking branch 'upstream/main' into sort-actor

c527165

mroeschke reviewed Mar 25, 2026

View reviewed changes

Comment thread python/cudf_polars/cudf_polars/experimental/rapidsmpf/collectives/common.py

Merge branch 'main' into sort-actor

066bb19

wence- approved these changes Mar 31, 2026

View reviewed changes

Comment thread python/cudf_polars/cudf_polars/experimental/rapidsmpf/collectives/sort.py Outdated

Comment thread python/cudf_polars/cudf_polars/experimental/rapidsmpf/collectives/sort.py Outdated

Comment thread python/cudf_polars/cudf_polars/experimental/rapidsmpf/collectives/sort.py Outdated

rjzamora added 10 commits March 31, 2026 08:57

address code review

ec3869d

Merge remote-tracking branch 'upstream/main' into sort-actor

70fce95

Merge branch 'sort-actor' of github.com:rjzamora/cudf into sort-actor

295baa3

Merge branch 'main' into sort-actor

66666a8

merge main

92435f3

Merge branch 'sort-actor' of github.com:rjzamora/cudf into sort-actor

e79172a

Merge branch 'main' into sort-actor

f53533c

minor tweak

efd1808

Merge remote-tracking branch 'upstream/main' into sort-actor

4f35121

Merge branch 'main' into sort-actor

886581c

rjzamora added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 3 - Ready for Review Ready for review by team labels Apr 1, 2026

rapids-bot bot merged commit c28d2ee into rapidsai:main Apr 1, 2026
90 checks passed

github-project-automation bot moved this from In Progress to Done in cuDF Python Apr 1, 2026

rjzamora deleted the sort-actor branch April 1, 2026 15:21

jberg5 mentioned this pull request Apr 5, 2026

Fix cuda error when sorting empty pl.concat result #21825

Merged

mroeschke mentioned this pull request Apr 8, 2026

[BUG] Repartion IR may passed into a sort_actor in cudf_polars + rapidsmpf #22050

Open

Conversation

rjzamora commented Mar 6, 2026

Description

Checklist

Uh oh!

Uh oh!

Uh oh!

wence- Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

rjzamora Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wence- Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

rjzamora Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

wence- Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

rjzamora Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mroeschke left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rjzamora commented Mar 25, 2026

Uh oh!

wence- left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rjzamora commented Apr 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants