-
Notifications
You must be signed in to change notification settings - Fork 4k
ARROW-15067: [C++] Add tracing spans to the scanner #12328
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Thanks for opening a pull request! If this is not a minor PR. Could you open an issue for this pull request on JIRA? https://issues.apache.org/jira/browse/ARROW Opening JIRAs ahead of time contributes to the Openness of the Apache Arrow project. Then could you also rename pull request title in the following format? or See also: |
|
|
|
This can be closed in favor of the other PR, right? |
Continuing #12328 and #11964. The tracing spans were not propagated through all the asynchronous constructs, causing some spans to become disconnected from the trace. This PR aims to address this. Some things left to do: - [x] Possibly add some attributes to the `read_column` span - [x] fix parent/sibling relationships (some of the new spans should probably become a child) - [x] Do something about all the `#ifdefs` - [x] Wrap around a `Future` - [x] Wrap `Executor` - [x] Check if tracing now works properly for all of the file types, not just parquet - [x] lidavidm mentioned some memory leaks that should be investigated - [x] The `FragmentToBatches` span seems to be active way too long Closes #12609 from joosthooz/arrow-15067 Lead-authored-by: Joost Hoozemans <joosthooz@msn.com> Co-authored-by: David Li <li.davidm96@gmail.com> Co-authored-by: Matthijs Brobbel <m1brobbel@gmail.com> Signed-off-by: David Li <li.davidm96@gmail.com>
Continuing the work of @lidavidm in #11964 in this PR. Rebased on master.