-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Percentage unmapped: 100 #167
Comments
Hi I have same problem here. |
grep "barcode_of_interest" your.fastq_R2.file | head -n 20
that should print reads containing your barcodes
and to know where to trim, count the bases from the left of the sequence until the first nt of the barcode
Hope it helps,
Miriam
…________________________________
From: Dianita Susilo Saputri ***@***.***>
Sent: Tuesday, April 12, 2022 5:28 AM
To: Hoohm/CITE-seq-Count ***@***.***>
Cc: Miriam Llorian Sopena ***@***.***>; Author ***@***.***>
Subject: Re: [Hoohm/CITE-seq-Count] Percentage unmapped: 100 (Issue #167)
External Sender: Use caution.
Hi I have same problem here.
How did you extract that sequence from fastq and determine the start trim point?
do we use R2 fastq for it?
Thank you
—
Reply to this email directly, view it on GitHub<https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FHoohm%2FCITE-seq-Count%2Fissues%2F167%23issuecomment-1095992575&data=04%7C01%7C%7C43d7565d0cdf435498e608da1c3cda2e%7C4eed7807ebad415aa7a99170947f4eae%7C0%7C0%7C637853344938463050%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&sdata=jr7eSCnUof%2FZvUseRCvywNeymaSSsQXnWrUtXp%2BeI0Y%3D&reserved=0>, or unsubscribe<https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAGLEBWLL5G7BURPOMPQW7QDVET3VTANCNFSM5SSOCWCQ&data=04%7C01%7C%7C43d7565d0cdf435498e608da1c3cda2e%7C4eed7807ebad415aa7a99170947f4eae%7C0%7C0%7C637853344938463050%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&sdata=lXasiHU6xKUfEw4ZorXBM6iDoYYeQIXtF0gRd6vLqVQ%3D&reserved=0>.
You are receiving this because you authored the thread.Message ID: ***@***.***>
The Francis Crick Institute Limited is a registered charity in England and Wales no. 1140062 and a company registered in England and Wales no. 06885462, with its registered office at 1 Midland Road London NW1 1AT
|
Happy to see fellow users help each other. @dianitasusilo maybe a sliding window approach might help yes: @sopenaml Could you check out of your barcodes in R1 are overlapping? It might be a mapping between barcodes similar to totalSeqB. |
Hi Patrick, I've checked my cite-seq ab barcodes agains R1 and I don't see any matches. If I check my cell hashing barcodes, there's one that finds few (7 ) matches on R1, but the rest none. So it's not that my barcodes are overlapping with cell barcodes. Any other ideas of what the problem may be? Thanks |
Hi, |
Hi, I'm afraid I didn't get an answer/solution. My suspicion is that, at least in my case, the ADTs only label a small proportion of the cells so I was wondering if that could be the problem. I've tried cell-ranger multi instead, and I do get counts for the ADTs so I think I'm goin g to use cell ranger multi for now..
…________________________________
From: Lauren Wasson ***@***.***>
Sent: Tuesday, May 10, 2022 9:26 PM
To: Hoohm/CITE-seq-Count ***@***.***>
Cc: Miriam Llorian Sopena ***@***.***>; Mention ***@***.***>
Subject: Re: [Hoohm/CITE-seq-Count] Percentage unmapped: 100 (Issue #167)
External Sender: Use caution.
Hi,
I have the same problem, where I can grep my HTO out of read 2 but still get 100% reads unmapped. I am running 1.4.5 using Python 3.9. Do we know what the solution to this issue is?
—
Reply to this email directly, view it on GitHub<https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FHoohm%2FCITE-seq-Count%2Fissues%2F167%23issuecomment-1122827525&data=05%7C01%7C%7Cf2b4fe5b64b340682c2408da32c35098%7C4eed7807ebad415aa7a99170947f4eae%7C0%7C0%7C637878111710828541%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=w7d6%2F9pQlFZuK6oRBEwPx094TWQZ1%2BZcEn7ouV2N5FI%3D&reserved=0>, or unsubscribe<https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAGLEBWLD2X2G7VS2QJM5XOTVJLA55ANCNFSM5SSOCWCQ&data=05%7C01%7C%7Cf2b4fe5b64b340682c2408da32c35098%7C4eed7807ebad415aa7a99170947f4eae%7C0%7C0%7C637878111710828541%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=UVi3IogEEigT1CABkORGQGx%2Fkk%2BvLjVl7Fi33K0VtQ4%3D&reserved=0>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
The Francis Crick Institute Limited is a registered charity in England and Wales no. 1140062 and a company registered in England and Wales no. 06885462, with its registered office at 1 Midland Road London NW1 1AT
|
Hey @sopenaml, Here is the translation matrix. Is it a bit clearer? |
Hi everyone, I am running into an issue, where I do have about ~35% unmapped reads. Is there a way to bring that number up? Grepping the R2 file, shows that start trim needs to be --start-trim 0 Here is the is what I run to get such output:
Here is the the output I get: Thank you in advance! |
I am able to bring the number of mapped reads above 90, by setting --max-error 6 or higher. However, I do not think that is it a good solution as I get plenty of doublets and negatives Doublet 1868 Negative 35 Singlet 394 . Any idea what else I can do? |
Would you be able to send me a sample of your data so that I can run it and have a look? |
Hi Patrick,
Thank you for your reply. Yes, here is the data and the tags (hashtags
used). Please let me know if I am missing anything! I appreciate you
looking into this!
sham_hto.zip
<https://drive.google.com/file/d/1ynqVw-74I9gw6NErchiv0ix1jFSnP4kC/view?usp=drive_web>
tags (3).csv
<https://drive.google.com/file/d/1ahBW4U7tgkDH_qXLr_nVv051hncM0MiM/view?usp=drive_web>
…On Wed, Aug 10, 2022 at 4:54 AM Patrick Roelli ***@***.***> wrote:
Would you be able to send me a sample of your data so that I can run it
and have a look?
—
Reply to this email directly, view it on GitHub
<#167 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AQZZGLNKICTMW4YN72EZNITVYNU4BANCNFSM5SSOCWCQ>
.
You are receiving this because you commented.Message ID:
***@***.***>
--
Katya Stepanova
Graduate Student
Deppmann & Campbell Labs
|
I asked for access |
It seems pretty reasonable from what I see in the first sample. The unmapped.csv gives you the top sequences that are not mapping. 22% of polyG, means no sequence there or could not be read Why do you need to get higher? I want to make sure about the translation issue. Do you have a high overlap between the cells from the RNA side and the HTO? |
Hi, is the translation matrix used only with v3 chemistry? I seem to have a similar problem where grep doesnt return barcodes in my fastq R2 for which I know exist in my data after cellranger. I used the 5' v2 chemistry with gex, vdj and feature barcode libs. Thanks Leeana |
Hi, sorry for asking again, but after going through the previous answers I have not found the answer that solves my problem. I have data that contains: HTOs, ADTs, 5' gex and VDJ. I'm trying to use CITE-seq-count to get matrix counts for the HTOs and ADTs, in order to demultiplex my data. To do so I've opted for doing to separate runs for HTOs and ADTs for which I provide the
tags.csv
or cellsurface.barcodes.csv
All my abs are TotalSeq-C, and upon grep the tags I can see that they start in position 11: so I've added
--start-trim 10
When I run the script with the
tags.csv
file I get 96% mapped but when I do with thecellsurface.barcodes.csv
, I get 100% unmapped despite I can grep the tags in R2. Would you know why the ADT tags are not mapped? Since the libraries should contain both cell surface barcodes and HTOs I would expect the mapping to be split between both or am I wrong? Can anyone help please? Thank you very much.Miriam
my running commands
The text was updated successfully, but these errors were encountered: