Skip to content

Conversation

@nealrichardson
Copy link
Member

@nealrichardson nealrichardson commented Aug 31, 2022

Approach:

  • register_binding takes an additional optional argument, notes, where you can list any limitations or differences in behavior between the Arrow version and the R function
  • These notes are put in the .cache environment when the nse_funcs are built.
  • New script data-raw/docgen.R that reads arrow:::.cache$docs and writes out dplyr-funcs-docs.R containing roxygen.
  • Similarly, we pull the dplyr functions we s3_register and add them to the generated docs. Unfortunately, the notes about feature limitations aren't easily kept alongside the functions themselves because of how they're registered on load. The approach here creates a list in arrow-package.R, where the .onLoad() happens, and notes go there.
  • Docs and crossreferences are generated by roxygen2 as usual.

I deferred filling in all of the function notes. See followup JIRAs on ARROW-17665.

@github-actions
Copy link

@nealrichardson
Copy link
Member Author

Current status: check fails with

  Missing link or links in documentation object 'arrow-dplyr-functions.Rd':
    ‘add_filename’ ‘cast’ ‘[stringr:str_like]{str_like}’

so need to add docs for add_filename and cast and add special casing for str_like since it is not yet released still.

Also will need to rebase and add special handling for dplyr::across(), since it is not handled as a nse_func.

@nealrichardson nealrichardson marked this pull request as ready for review September 1, 2022 17:00
@nealrichardson
Copy link
Member Author

@github-actions crossbow submit -p pr_number=14014 preview-docs

@nealrichardson
Copy link
Member Author

This is ready for review. Feel free to wordsmith the doc page and bike-shed the Rd name (maybe acero? should be something convenient and natural to type ?name to view)

@github-actions
Copy link

github-actions bot commented Sep 9, 2022

Revision: 0849160cae0a806b89040fdc152b6c51cd5f0f5f

Submitted crossbow builds: ursacomputing/crossbow @ actions-84482a9513

Task Status
preview-docs Github Actions

@nealrichardson nealrichardson merged commit 93626ee into apache:master Sep 16, 2022
@nealrichardson nealrichardson deleted the fun-doc branch September 16, 2022 01:39
@ursabot
Copy link

ursabot commented Sep 16, 2022

Benchmark runs are scheduled for baseline = 2e72e0a and contender = 93626ee. 93626ee is a master commit associated with this PR. Results will be available as each benchmark for each run completes.
Conbench compare runs links:
[Finished ⬇️0.0% ⬆️0.0%] ec2-t3-xlarge-us-east-2
[Failed ⬇️0.24% ⬆️0.03%] test-mac-arm
[Failed ⬇️0.28% ⬆️0.0%] ursa-i9-9960x
[Finished ⬇️1.31% ⬆️0.04%] ursa-thinkcentre-m75q
Buildkite builds:
[Finished] 93626eeb ec2-t3-xlarge-us-east-2
[Failed] 93626eeb test-mac-arm
[Failed] 93626eeb ursa-i9-9960x
[Finished] 93626eeb ursa-thinkcentre-m75q
[Finished] 2e72e0a8 ec2-t3-xlarge-us-east-2
[Finished] 2e72e0a8 test-mac-arm
[Failed] 2e72e0a8 ursa-i9-9960x
[Finished] 2e72e0a8 ursa-thinkcentre-m75q
Supported benchmarks:
ec2-t3-xlarge-us-east-2: Supported benchmark langs: Python, R. Runs only benchmarks with cloud = True
test-mac-arm: Supported benchmark langs: C++, Python, R
ursa-i9-9960x: Supported benchmark langs: Python, R, JavaScript
ursa-thinkcentre-m75q: Supported benchmark langs: C++, Java

zagto pushed a commit to zagto/arrow that referenced this pull request Oct 7, 2022
…pache#14014)

Approach:

* `register_binding` takes an additional optional argument, `notes`, where you can list any limitations or differences in behavior between the Arrow version and the R function
* These notes are put in the `.cache` environment when the nse_funcs are built.
* New script `data-raw/docgen.R` that reads `arrow:::.cache$docs` and writes out `dplyr-funcs-docs.R` containing roxygen.
* Similarly, we pull the dplyr functions we s3_register and add them to the generated docs. Unfortunately, the notes about feature limitations aren't easily kept alongside the functions themselves because of how they're registered on load. The approach here creates a list in `arrow-package.R`, where the `.onLoad()` happens, and notes go there. 
* Docs and crossreferences are generated by roxygen2 as usual. 

I deferred filling in all of the function notes. See followup JIRAs on ARROW-17665. 

Authored-by: Neal Richardson <neal.p.richardson@gmail.com>
Signed-off-by: Neal Richardson <neal.p.richardson@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants