-
Notifications
You must be signed in to change notification settings - Fork 3
Add confusion matrix notebook #8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add confusion matrix notebook #8
Conversation
* finish download module changes * download notebook * rerun split data module * rerun download module * rerun train_model * rerun evaluation module * rerun interpretation module * combine datasets * combine datasets * split changes * update format * format update * format * finish split data * combine datasets, remove holdout * formatting * rerun pipelines * remove folded class * rerun pipeline * Update utils/download_utils.py Co-authored-by: Dave Bunten <ekgto445@gmail.com> * PR fixes * module docstrings Co-authored-by: Dave Bunten <ekgto445@gmail.com>
d33bs
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice work! I left a few comments with this review - please don't hesitate to reach out if you have any questions.
gwaybio
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
| return fig, PR_data | ||
|
|
||
|
|
||
| def model_cm( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can you rename this model_confusion_matrix() it is clearer than cm
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yep! Completed in #8 for simplicity.
This PR is the first of many to restructure the evaluate model module and output the evaluations in tidy long format for later figure generation.
In this PR, a separate notebook is created to display the confusion matrices for the final and shuffled baseline models' predictions for the train and test datasets. These confusion matrices are converted to tidy long format and saved in confusion matrices.