Skip to content

tiny-count: adding classifier support to sequence-based counting#287

Merged
taimontgomery merged 5 commits intomasterfrom
issue-277
Mar 6, 2023
Merged

tiny-count: adding classifier support to sequence-based counting#287
taimontgomery merged 5 commits intomasterfrom
issue-277

Conversation

@AlexTate
Copy link
Member

@AlexTate AlexTate commented Mar 4, 2023

Counts for individual reference sequences are subset by Classify as... values in sequence-based counting mode. Unit tests have been added for the ReferenceSeqs class.

In unit_tests_hts_parsing.py, I split up the gigantic test suite by target class. This is a good thing but it makes the diff wildly out of control on this PR because I didn't commit the reorganization before adding the ReferenceSeqsTests class.

Additionally, both GFFValidator and SamSqValidator classes now report when validation has finished

Closes #277

AlexTate added 5 commits March 4, 2023 11:57
…that there's a class for SAM_reader, ReferenceFeatures, and CaseInsensitiveAttrs. Also moving the function for running all test suites up to the top of the file for convenience
…dly out of control because I wrote the new test suite before making the previous commit
… format of get_matches() is consistent with that of ReferenceFeatures.get_matches(), though that doesn't factor into the current design. The procedure in add_reference_seq() closely resembles the established procedure used by ReferenceFeatures in _add_subinterval_matches()
@AlexTate AlexTate requested a review from taimontgomery March 4, 2023 21:03
@taimontgomery
Copy link
Collaborator

Reviewed documentation. Tested on ram1 data.

@taimontgomery taimontgomery merged commit d0cc0f5 into master Mar 6, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

tiny-count: support a sequence-based counting mode when GFF files aren't provided

2 participants