Skip to content

Conversation

@Dandandan
Copy link
Contributor

@Dandandan Dandandan commented Apr 17, 2021

This reproduces the test failure for the TopKExec test that @andygrove was seeing in combination with a 24-core machine.

Also FYI @alamb

(Marked as ready for review to trigger CI)

@Dandandan Dandandan marked this pull request as ready for review April 17, 2021 11:29
@github-actions
Copy link

Thanks for opening a pull request!

If this is not a minor PR. Could you open an issue for this pull request on JIRA? https://issues.apache.org/jira/browse/ARROW

Opening JIRAs ahead of time contributes to the Openness of the Apache Arrow project.

Then could you also rename pull request title in the following format?

ARROW-${JIRA_ID}: [${COMPONENT}] ${SUMMARY}

or

MINOR: [${COMPONENT}] ${SUMMARY}

See also:

Copy link
Contributor

@alamb alamb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice

@Dandandan
Copy link
Contributor Author

@alamb @andygrove I pushed a version with a fix (I believe) for the TopKExec example. Not sure if the original version with try_fold could also be fixed to work, but this works at least. It now has a .clone on the map which is not a big deal for small k but probably would be better to avoid that.

@andygrove
Copy link
Member

Thanks @Dandandan 🚀

So if the issue was just in the test, I should change my vote on the release to a +1?

@codecov-commenter
Copy link

codecov-commenter commented Apr 17, 2021

Codecov Report

Attention: Patch coverage is 83.33333% with 2 lines in your changes missing coverage. Please review.

Project coverage is 78.92%. Comparing base (9c1e5bd) to head (7d058ea).

Files Patch % Lines
rust/datafusion/tests/user_defined_plan.rs 83.33% 2 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##           master   #10077      +/-   ##
==========================================
+ Coverage   78.90%   78.92%   +0.01%     
==========================================
  Files         286      286              
  Lines       64717    64713       -4     
==========================================
+ Hits        51068    51075       +7     
+ Misses      13649    13638      -11     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@Dandandan
Copy link
Contributor Author

@andygrove as far as I can see the issue was in the test and faulty TopKExec implementation only, not really in repartion or the optimization of the repartion node.

@andygrove
Copy link
Member

andygrove commented Apr 17, 2021 via email

@Dandandan
Copy link
Contributor Author

I probably won't get to that anymore today. Thanks, would be great

@andygrove andygrove changed the title [Rust][DataFusion] Reproduce topkexec failure ARROW-12421: [Rust] [DataFusion] Fix topkexec failure Apr 17, 2021
@github-actions
Copy link

@andygrove andygrove closed this in 27c4fa2 Apr 17, 2021
@alamb
Copy link
Contributor

alamb commented Apr 18, 2021

Thanks @Dandandan

@andygrove as far as I can see the issue was in the test and faulty TopKExec implementation only, not really in repartion or the optimization of the repartion node.

For the record, the TopKExec is an example (not part of the DataFusion codebase)

@Dandandan
Copy link
Contributor Author

Thanks @alamb @andygrove .

It would still be valuable to see if we can improve the example, or even add it to DataFusion once we have a good implementation (without the extension stuff of course)!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants