Adding annotation of potential doublets to anndata obs #193

ptajvar · 2024-10-15T15:08:00Z

During the graph step, we split some components that appear to be technical multiplets through community detection. However we set parameters in a relatively conservative way so to avoid breaking up cells into multiple components. In the annotation step we are adding a less conservative parametrization to mark "potential doublets".

What is added:

is_potential_doublet as a column in adata.
n_edges_to_split_doublet as a column in adata: how many edges (molecules) should be removed to split the two (or more) detected sub-communities in the potential doublet.
fraction_potential_doublets to the json report of the annotation step. (rate of is_potential_doublet)
n_edges_to_split_potential_doublets to the json report of the annotation step. (sum of n_edges_to_split_doublet)

Fixes: EXE-2025

Type of change

New feature (non-breaking change which adds functionality)

How Has This Been Tested?

The unit tests.

PR checklist:

This comment contains a description of changes (with reason).
I have performed a self-review of my own code
My changes generate no new warnings
I have checked my code and documentation and corrected any misspellings
I have documented any significant changes to the code in CHANGELOG.md

…o_split_potential_doublets

ambarrio

LGTM. Some comments to take into account, but approving to not delay a post review merge once they are answered or tackled.

src/pixelator/annotate/__init__.py

ambarrio · 2024-10-21T23:28:23Z

src/pixelator/annotate/__init__.py

@@ -433,4 +437,12 @@ def anndata_metrics(adata: AnnData) -> AnnotateAnndataStatistics:
    if "doublet_size_threshold" in adata.uns:
        metrics["doublet_size_threshold"] = adata.uns["doublet_size_threshold"]

+    if "is_potential_doublet" in adata.obs:
+        metrics["fraction_potential_doublets"] = adata.obs[


If there are no is_potential_doublet here, what will it happen when run mean?

There will be a key error, that's why we check for it first.

But when adata.obs["is_potential_doublet"] returns empty and we called mean() - what happens then?

We only run the mean() function if "is_potential_doublet" exists as a column in adata.obs. Otherwise that part is not run and "fraction_potential_doublets" remains the default value which is 0 right now but following the discussion we're going to change it to None.

ambarrio · 2024-10-21T23:28:38Z

src/pixelator/annotate/__init__.py

+        metrics["fraction_potential_doublets"] = adata.obs[
+            "is_potential_doublet"
+        ].mean()
+        metrics["n_edges_to_split_potential_doublets"] = adata.obs[


Same with sum

src/pixelator/pixeldataset/utils.py

johandahlberg

I had some small suggestions regarding the type annotations. I think the code is clear and simple. Nice job!

src/pixelator/annotate/__init__.py

src/pixelator/pixeldataset/utils.py

Co-authored-by: Johan Dahlberg <johan.dahlberg@pixelgen.tech>

…ets optional

…l_doublets to None

ptajvar added 2 commits October 21, 2024 12:33

adding annotation of potential doublets

8ad1a9b

split _compute_sub_communities from _assess_doublet

e08516d

ptajvar force-pushed the feature/exe-2025-annotate-potential-doublets branch from e79fbf4 to e08516d Compare October 21, 2024 10:33

ptajvar added 5 commits October 21, 2024 17:02

Added n_edges_to_split_potential_doublets

128f116

Reduced Leiden resolution by 0.5 in the annotation step

42b1537

updated CHANGELOG.md

7a3e5c1

fix: set default values for fraction_potential_doublets and n_edges_t…

2b03001

…o_split_potential_doublets

fix: make fraction_potential_doublets non-optional

4d38f39

ptajvar marked this pull request as ready for review October 21, 2024 15:25

ptajvar requested review from ambarrio and johandahlberg October 21, 2024 15:25

ambarrio approved these changes Oct 21, 2024

View reviewed changes

johandahlberg approved these changes Oct 22, 2024

View reviewed changes

ptajvar and others added 10 commits October 22, 2024 10:05

Update src/pixelator/pixeldataset/utils.py

13f6618

Co-authored-by: Johan Dahlberg <johan.dahlberg@pixelgen.tech>

Update src/pixelator/pixeldataset/utils.py

a427c36

Co-authored-by: Johan Dahlberg <johan.dahlberg@pixelgen.tech>

Update src/pixelator/pixeldataset/utils.py

19bd66a

Co-authored-by: Johan Dahlberg <johan.dahlberg@pixelgen.tech>

Update src/pixelator/pixeldataset/utils.py

2e46451

Co-authored-by: Johan Dahlberg <johan.dahlberg@pixelgen.tech>

fix: typo

8db21e2

added RELATIVE_ANNOTATE_RESOLUTION as a constant

98da20f

make fraction_potential_doublets and n_edges_to_split_potential_doubl…

69041e7

…ets optional

set default fraction_potential_doublets and n_edges_to_split_potentia…

63b4ef1

…l_doublets to None

explicitly check n_edges_to_split_doublet exists before using sum

fb156c9

fix: typecheck

7042c1f

ptajvar merged commit 7ca7168 into dev Oct 22, 2024
14 checks passed

ptajvar deleted the feature/exe-2025-annotate-potential-doublets branch October 22, 2024 12:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding annotation of potential doublets to anndata obs #193

Adding annotation of potential doublets to anndata obs #193

ptajvar commented Oct 15, 2024 •

edited

Loading

ambarrio left a comment

ambarrio Oct 21, 2024

ptajvar Oct 22, 2024

ambarrio Oct 22, 2024

ptajvar Oct 22, 2024

ambarrio Oct 21, 2024

johandahlberg left a comment

Adding annotation of potential doublets to anndata obs #193

Adding annotation of potential doublets to anndata obs #193

Conversation

ptajvar commented Oct 15, 2024 • edited Loading

Type of change

How Has This Been Tested?

PR checklist:

ambarrio left a comment

Choose a reason for hiding this comment

ambarrio Oct 21, 2024

Choose a reason for hiding this comment

ptajvar Oct 22, 2024

Choose a reason for hiding this comment

ambarrio Oct 22, 2024

Choose a reason for hiding this comment

ptajvar Oct 22, 2024

Choose a reason for hiding this comment

ambarrio Oct 21, 2024

Choose a reason for hiding this comment

johandahlberg left a comment

Choose a reason for hiding this comment

ptajvar commented Oct 15, 2024 •

edited

Loading