-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Description
When importing a sample RareLink-CDM Phenopacket, an error is shown for the way we have zygosity and the dna change type encoded:
"genomicInterpretations": [
{
"subjectOrBiosampleId": "1",
"variantInterpretation": {
"variationDescriptor": {
"id": "829137227d3443f2aa73fc94a4f639",
"geneContext": {
"valueId": "HGNC:6932",
"symbol": "MC4R"
},
"expressions": [
{
"syntax": "hgvs",
"value": "NM_005912.3:c.751A>C"
}
],
"structuralType": {
"id": "LOINC:LA6694-9",
"label": "Frameshift"
},
"allelicState": {
"id": "LOINC:LA6706-1",
"label": "Heterozygous"
}
Error for zygosity:
Cell In[5], [line 5](vscode-notebook-cell:?execution_count=5&line=5)
[3](vscode-notebook-cell:?execution_count=5&line=3) cohort_creator = configure_caching_cohort_creator(hpo)
[4](vscode-notebook-cell:?execution_count=5&line=4) pp_dir = '/Users/adam/Documents/git/gpsea-cs/irud_test/phenopackets'
----> [5](vscode-notebook-cell:?execution_count=5&line=5) cohort, qc_results = load_phenopacket_folder(pp_dir, cohort_creator)
File ~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:440, in load_phenopacket_folder(pp_directory, cohort_creator, validation_policy)
[437](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:437) pp_files = _find_phenopacket_files(pp_directory)
[439](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:439) # Map to patients
--> [440](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:440) return load_phenopacket_files(
[441](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:441) pp_files=pp_files,
[442](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:442) cohort_creator=cohort_creator,
[443](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:443) validation_policy=validation_policy,
[444](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:444) )
File ~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:464, in load_phenopacket_files(pp_files, cohort_creator, validation_policy)
[447](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:447) def load_phenopacket_files(
[448](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:448) pp_files: typing.Iterator[str],
[449](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:449) cohort_creator: CohortCreator[Phenopacket],
[450](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:450) validation_policy: typing.Literal["permissive", "lenient", "strict"] = "permissive",
[451](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:451) ) -> typing.Tuple[Cohort, PreprocessingValidationResult]:
[452](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:452) """
[453](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:453) Load phenopacket JSON files, validate the data, and assemble into a :class:`~gpsea.model.Cohort`.
[454](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:454)
...
[73](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_phenopacket.py:73) return Genotype.HEMIZYGOUS
[74](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_phenopacket.py:74) else:
---> [75](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_phenopacket.py:75) raise ValueError(f"Unknown genotype {genotype}")
ValueError: Unknown genotype Heterozygous
-> this is fixed if I simply change the "Heterozygous" to a small capitalised "heterozygous". However, the LOINC label was automatically fetched from Bioportal. Given the Phenopackets also allow LOINC, could its label also be allowed to work?
Error for DNA Change type:
ValueError Traceback (most recent call last)
Cell In[6], [line 5](vscode-notebook-cell:?execution_count=6&line=5)
[3](vscode-notebook-cell:?execution_count=6&line=3) cohort_creator = configure_caching_cohort_creator(hpo)
[4](vscode-notebook-cell:?execution_count=6&line=4) pp_dir = '/Users/adam/Documents/git/gpsea-cs/irud_test/phenopackets'
----> [5](vscode-notebook-cell:?execution_count=6&line=5) cohort, qc_results = load_phenopacket_folder(pp_dir, cohort_creator)
File ~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:440, in load_phenopacket_folder(pp_directory, cohort_creator, validation_policy)
[437](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:437) pp_files = _find_phenopacket_files(pp_directory)
[439](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:439) # Map to patients
--> [440](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:440) return load_phenopacket_files(
[441](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:441) pp_files=pp_files,
[442](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:442) cohort_creator=cohort_creator,
[443](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:443) validation_policy=validation_policy,
[444](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:444) )
File ~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:464, in load_phenopacket_files(pp_files, cohort_creator, validation_policy)
[447](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:447) def load_phenopacket_files(
[448](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:448) pp_files: typing.Iterator[str],
[449](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:449) cohort_creator: CohortCreator[Phenopacket],
[450](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:450) validation_policy: typing.Literal["permissive", "lenient", "strict"] = "permissive",
[451](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:451) ) -> typing.Tuple[Cohort, PreprocessingValidationResult]:
[452](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:452) """
[453](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:453) Load phenopacket JSON files, validate the data, and assemble into a :class:`~gpsea.model.Cohort`.
[454](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_config.py:454)
...
[819](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_phenopacket.py:819) return None
[820](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_phenopacket.py:820) else:
--> [821](https://file+.vscode-resource.vscode-cdn.net/Users/adam/Documents/git/gpsea-cs/irud_test/~/Documents/git/gpsea-cs/gpcvenv/lib/python3.12/site-packages/gpsea/preprocessing/_phenopacket.py:821) raise ValueError(f"Unknown structural type {structural_type}")
ValueError: Unknown structural type LOINC:LA6694-9
Same here, why are LOINC values not allowed, although possible in Phenopackets themselves?
Metadata
Metadata
Assignees
Labels
No labels