Skip to content

Add additional datasets to improve coverage#1441

Merged
rapids-bot[bot] merged 30 commits intorapidsai:branch-0.19from
jnke2016:branch-0.19-add_datasets_coverage
Mar 18, 2021
Merged

Add additional datasets to improve coverage#1441
rapids-bot[bot] merged 30 commits intorapidsai:branch-0.19from
jnke2016:branch-0.19-add_datasets_coverage

Conversation

@jnke2016
Copy link
Copy Markdown
Contributor

@jnke2016 jnke2016 commented Mar 7, 2021

add datasets to test self-loops, string vertex IDs and isolated vertices
Update README
closes #1214

jnke2016 and others added 16 commits March 7, 2021 16:22
add datasets to test self-loops, string vertex IDs and isolated vertices 
Update README 
closes rapidsai#1214
add datasets to test self-loops, string vertex IDs and isolated vertices 
Update README 
closes rapidsai#1214
add datasets to test self-loops, string vertex IDs and isolated vertices 
Update README
closes rapidsai#1214
add datasets to test self-loops, string vertex IDs and isolated vertices
Update README
closes rapidsai#1214
add datasets to test self-loops, string vertex IDs and isolated vertices
Update README
closes rapidsai#1214
@jnke2016 jnke2016 requested review from a team as code owners March 7, 2021 22:29
@jnke2016
Copy link
Copy Markdown
Contributor Author

jnke2016 commented Mar 7, 2021

One of the files (out of 6) I uploaded is above the 5242880 limit (the file is 5.3 MB). Should I just delete that file or include the file and update the download script ?

@BradReesWork BradReesWork added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Mar 9, 2021
@BradReesWork BradReesWork added this to the 0.19 milestone Mar 9, 2021
@BradReesWork
Copy link
Copy Markdown
Member

@jnke2016 this explodes the size of data in the repo which affects packaging size. Should some of these files be moved to the S3 drive and downloaded like we do for scale testing?

Is simply modifying the smaller karate dataset not enough?

@BradReesWork BradReesWork changed the title Branch 0.19 add datasets coverage Add additiomal datasets to improve coverage Mar 9, 2021
@BradReesWork BradReesWork changed the title Add additiomal datasets to improve coverage Add additional datasets to improve coverage Mar 9, 2021
Copy link
Copy Markdown
Member

@BradReesWork BradReesWork left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the large files need to be removed. you can add them to the S3 store

Comment thread datasets/README.md Outdated



## Tests coverage
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is a reference section under docs where we list all the test files used. Adding info on the files is nice

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In this PR, I updated datasets/README.md to reference and describe the new datasets. But after discussing with Rick, I instead modified the existing datasets (karate, dolphins) since they are much smaller, therefore easier to test. I will be uploading the new datasets in this PR

Copy link
Copy Markdown
Member

@BradReesWork BradReesWork left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

large datasets need to be removed

jnke2016 and others added 13 commits March 11, 2021 22:41
added datasets to test self-loops, isolated vertices, string vertex IDs, multi-edges
Updated download script accordingly
Updated README accordingly
closes rapidsai#1214
Added datasets to test self-loops, isolated vertices, string vertex IDs, multi-edges
Updated download script accordingly
Updated README accordingly
closes rapidsai#1214
Added datasets to test self-loops, isolated vertices, string vertex IDs, multi-edges
Updated download script accordingly
Updated README accordingly
closes rapidsai#1214
Added datasets to test self-loops, isolated vertices, string vertex IDs, multi-edges
Updated download script accordingly
Updated README accordingly
closes rapidsai#1214
@codecov-io
Copy link
Copy Markdown

codecov-io commented Mar 12, 2021

Codecov Report

Merging #1441 (be40a67) into branch-0.19 (4535396) will increase coverage by 0.05%.
The diff coverage is n/a.

Impacted file tree graph

@@               Coverage Diff               @@
##           branch-0.19    #1441      +/-   ##
===============================================
+ Coverage        60.72%   60.78%   +0.05%     
===============================================
  Files               70       70              
  Lines             3132     3134       +2     
===============================================
+ Hits              1902     1905       +3     
+ Misses            1230     1229       -1     
Impacted Files Coverage Δ
python/cugraph/_version.py 44.80% <0.00%> (+0.39%) ⬆️
python/cugraph/utilities/utils.py 72.44% <0.00%> (+0.78%) ⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4535396...be40a67. Read the comment docs.

Copy link
Copy Markdown
Contributor

@rlratzel rlratzel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, I just have a few minor requests.

Comment thread datasets/README.md
Comment thread datasets/README.md Outdated
Comment thread datasets/README.md Outdated
Comment thread datasets/README.md Outdated
added datasets to test self-loops, isolated vertices, string vertex IDs, multi-edges
Updated download script accordingly
Updated README accordingly
closes rapidsai#1214
@BradReesWork
Copy link
Copy Markdown
Member

@gpucibot merge

@rapids-bot rapids-bot Bot merged commit ab4b77b into rapidsai:branch-0.19 Mar 18, 2021
@BradReesWork BradReesWork linked an issue Mar 29, 2021 that may be closed by this pull request
@jnke2016 jnke2016 deleted the branch-0.19-add_datasets_coverage branch September 24, 2022 23:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

improvement Improvement / enhancement to an existing function non-breaking Non-breaking change

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[ENH] Add additional datasets for increased test coverage Test with isolated vertices and self loops

4 participants