Skip to content

Feature/msk ucsf ingestion#9

Open
KociOrges wants to merge 23 commits into
feature/submission-envelop-uuid-for-upload-areas-morphic-redesign-utilfrom
feature/msk-ucsf-ingestion
Open

Feature/msk ucsf ingestion#9
KociOrges wants to merge 23 commits into
feature/submission-envelop-uuid-for-upload-areas-morphic-redesign-utilfrom
feature/msk-ucsf-ingestion

Conversation

@KociOrges
Copy link
Copy Markdown

Description:
This pull request improves our dataset ingestion workflow by refactoring the linking logic for different dataset contexts (MSK, UCSF, and legacy JAX). The key changes include:

MSK-Specific Enhancements
Supports many-to-one relationships between differentiated cell lines and a shared library preparation.

New Context Support:
A new --context command‑line argument is introduced (e.g., --context unperturbed_multiple) so that the ingestion process can use a different linking strategy for UCSF datasets while preserving legacy behaviour for MSK and JAX when the context is not specified.

Refactored Linking:
The previously monolithic establish_links function is split into smaller helper methods (_link_cell_lines_to_children, _process_library_preparations, and _link_sequencing_files) to improve maintainability.

Enhanced Robustness:
A new helper, request_with_retries, has been added to implement exponential backoff for HTTP requests, improving resilience against transient connection errors.

Related tickets #126, #61

@KociOrges KociOrges self-assigned this Mar 27, 2025
@KociOrges KociOrges added the enhancement New feature or request label Mar 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants