Skip to content

a program to estimate the difference between data from reads in a SAM file to its reference genome

Notifications You must be signed in to change notification settings

guilhermesena1/seqdiff

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 

Repository files navigation

seqdiff

This program implements Heng Li's formulas in A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.

The goal of this program is to take as input a set of mapped reads and a reference diploid genome, and estimate the fraction of nucleotides that mutated, where each nucleotide can take a continuous number between 0 and 1, but likely 0, 0.5 or 1.

The important property of seqdiff is that it can be used for DNA methylation data, which allows Ts in reads to map to either Cs or Ts in the reference genome. With this, one can use WGBS data to estimate how much a dataset differs from the genome to which it was mapped.

Compile the program by running

make
make install

About

a program to estimate the difference between data from reads in a SAM file to its reference genome

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published