Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some MuTect files are improperly joined #77

Closed
lbeltrame opened this issue Aug 1, 2013 · 3 comments
Closed

Some MuTect files are improperly joined #77

lbeltrame opened this issue Aug 1, 2013 · 3 comments

Comments

@lbeltrame
Copy link
Contributor

When testing the pipeline after fixing the cluster problem, I noticed that the GATK errored out much later on with this:

The provided VCF file is malformed at approximately line number 4: The VCF specification does not allow for whitespace in the INFO field

Looking at the VCF file, I noticed the following:

##fileformat=VCFv4.1
## No variants; no reads aligned in region
#CHROM  POS     ID      REF     ALT     QUAL    FILTER  INFO
chr10   89685461        .       G       T       .       REJECT  AC=0;AF=0.00;AN=1;DP=456        GT:AD:DP:FA     0:454,1:456:2.198e-03

As you can see, it's an empty VCF that's however been "filled" somehow. The reason is that MuTect manages to write data before erroring out (with that famous Java 7 error we discussed), therefore generating a wrong VCF file.

I'm not sure there's an easy solution for this, given that the GATK is now requiring Java 7 and MuTect does not work yet properly with it.

On the plus side, this gives me more motivation to implement support for other paired callers. ;)

@lbeltrame
Copy link
Contributor Author

Part of the sorting error is being fixed in broadinstitute/mutect#2 (although perhaps a different way will be implemented in the end).

@chapmanb
Copy link
Member

chapmanb commented Aug 1, 2013

Luca;
Thanks for the report. This is a bug in how I do the concatenation of files. The final concatenated files gets the header of the first file, which will be problematic if the first file is empty. I checked in a fix to avoid this so hopefully it'll work cleanly now. Let me know if you run into any other issues.

@lbeltrame
Copy link
Contributor Author

Thanks Brad, it works correctly now (run ended successfully).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants