Feature/filtering qc metrics highly variable genes by Oykupnrbs · Pull Request #8 · IEEE-Ege/SingleCellWebApp

Oykupnrbs · 2025-04-06T14:09:57Z

Pull Request Description:

This pull request adds three simple and reusable functions for preprocessing single-cell RNA-seq data with Scanpy.

What’s inside:

filter_data(): Removes cells with too few genes, and genes found in too few cells.
calculate_qc_metrics(): Finds mitochondrial genes and calculates quality control values.
select_highly_variable_genes(): Picks genes that are most variable for further analysis.

Demo Code:

Creates random fake data to test the functions.
Adds example mitochondrial genes to test QC.
Runs all three functions step by step with print statements to see results.

Why this is useful:

Code is now easier to read and use in other projects.
The demo shows how everything works in a simple way.

Demo Output:

…endi.

…election - Added filter_data() to filter out low-quality cells and genes - Added calculate_qc_metrics() to compute mitochondrial gene percentages and other QC stats - Added select_highly_variable_genes() to identify informative genes for downstream analysis These reusable functions improve code clarity and support scalable single-cell preprocessing workflows.

TRextabat

that is great work Thx @Oykupnrbs @Ekin-hub-code

TRextabat · 2025-04-09T16:36:23Z

src/modules/preprocessing.py

+def calculate_qc_metrics(adata: AnnData) -> AnnData:
+    """
+    Computes quality control (QC) metrics and adds them to the dataset.
+    Checks for mitochondrial genes with both uppercase and lowercase 'mt-' prefix.
+
+    Args:
+        adata (AnnData): The filtered dataset.
+
+    Returns:
+        AnnData: Dataset with QC metrics added.
+    """
+    if adata.var_names.str.startswith("MT-").any():
+        adata.var["mt"] = adata.var_names.str.startswith("MT-")
+    elif adata.var_names.str.startswith("mt-").any():
+        adata.var["mt"] = adata.var_names.str.startswith("mt-")
+    else:
+        # In case neither is found, check in a case-insensitive way just to be sure
+        adata.var["mt"] = adata.var_names.str.upper().str.startswith("MT-")
+
+    sc.pp.calculate_qc_metrics(
+        adata, 
+        qc_vars=["mt"], 
+        percent_top=None, 
+        log1p=False, 
+        inplace=True
+    )


you could add mt and MT as argument in future maybe we want to avoid mt filtration

Oykupnrbs and others added 9 commits April 2, 2025 21:18

preprocess filtering, qc_metrics, highly_variable_genes kısmı güncell…

04ef841

…endi.

Add files via upload

b79bb4b

preproccesing2-demo.py

c24786e

Add files via upload

53d06a4

preproccesing2-demo.py

2608e60

preprocessing

aec0bb9

preprocessing demo

8b19703

Preprocessing demo

7b5dbd9

Oykupnrbs requested review from Ekin-hub-code and TRextabat April 6, 2025 14:10

Oykupnrbs added 2 commits April 8, 2025 17:52

Add preprocessing module

887a0e8

Resolved FileNotFoundError caused by missing dataset file

ca37da3

TRextabat reviewed Apr 9, 2025

View reviewed changes

TRextabat self-requested a review April 9, 2025 16:15

TRextabat approved these changes Apr 9, 2025

View reviewed changes

TRextabat merged commit 461d960 into develop Apr 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/filtering qc metrics highly variable genes#8

Feature/filtering qc metrics highly variable genes#8
TRextabat merged 11 commits intodevelopfrom
feature/filtering-qc_metrics-highly_variable_genes

Oykupnrbs commented Apr 6, 2025 •

edited

Loading

Uh oh!

TRextabat left a comment

Uh oh!

TRextabat Apr 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Oykupnrbs commented Apr 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Description:

Uh oh!

TRextabat left a comment

Choose a reason for hiding this comment

Uh oh!

TRextabat Apr 9, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Oykupnrbs commented Apr 6, 2025 •

edited

Loading