Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regarding filtering of Low Frequency and Subclonal Mutations #68

Closed
Rohit-Satyam opened this issue Nov 10, 2022 · 4 comments
Closed

Regarding filtering of Low Frequency and Subclonal Mutations #68

Rohit-Satyam opened this issue Nov 10, 2022 · 4 comments
Labels
bug Something isn't working

Comments

@Rohit-Satyam
Copy link

Rohit-Satyam commented Nov 10, 2022

What happened?

I tried to assign Variant Allele Frequency on the wf-artic VCF files using vafator and I realize that there were few Subclonal and Low frequency Variants ( VAF < 20 % were considered LOW_FREQUENCY and variants with a VAF >= 20 % and < 80 % are considered SUBCLONAL). I wish to understand if they are retained or filtered before consensus FASTA assembly generation?

If not filter, do you suggest to filter them before submitting to GISAID. If yes, would you include this feature to filter out anything lower than 0.8 (or 80%).

Operating System

ubuntu 20.04

Workflow Execution

Command line

Workflow Execution - EPI2ME Labs Versions

Workflow Version

0.3.18

@Rohit-Satyam Rohit-Satyam added the bug Something isn't working label Nov 10, 2022
@Rohit-Satyam
Copy link
Author

@cjw85 @mattdmem @sarahjeeeze Do you guys have any thoughts on filtering of the variants for generating consensus fasta?

@Rohit-Satyam
Copy link
Author

Referencing a related issue here

@mattdmem
Copy link
Contributor

Currently we implement the field bioinformatics package from the Artic Network. We'll take this feedback into account for future versions of the workflow.

@corneliusroemer
Copy link
Contributor

@Rohit-Satyam I'd think you should report the subclonal sites as ambiguous if they are above a threshold of coverage - otherwise make them N. You should definitely not make them reference (reversions to reference are a very common artefact and screw up phylogenetics in a bad way).

I'm not sure what you mean by filter. Filtering them out entirely might cause the site to be output as reference which would be bad.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Development

No branches or pull requests

3 participants