Skip to content

Telomeasure estimates average telomere length from whole genome sequencing (WGS) BAM files.

License

Notifications You must be signed in to change notification settings

nygenome/telomeasure

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Telomeasure

This repository is deprecated

- WARNING: Repo has been MOVED! Now has a docker image and a WDL workflow

Telomeasure calculates estimated telomere length. The output includes three estimates. The telomere_length is calculated using the number of telomeres indicated by arm level coverage and with the total alignment length after removing the percent of alignments that would be from duplicate reads based on duplication rate. The tumor_telomere_length_46 assumes 46 telomeres. The telomere_length_non_dedup is calculated without subtracting a fraction of telomere alignments based on duplications rate.

Telomeasure pipeline workflow

In most somatic cells telomere length decreases with each cycle of replication. Replicating tumor cells can escape replicative senescence triggered by critically short telomeres through either the expression of telomerase (expression of TERT) [1,2] or by homologous recombination-mediated alternative lengthening of telomeres (ALT) [3]. To investigate telomere dynamics in normal and their matching tumor cells we developed Telomeasure.

Telomeasure estimates average telomere length from whole genome sequencing (WGS) BAM files. Chromosome arm-level coverage is used to infer the number of telomeres and alignment to mock-human telomeres to estimate telomeric DNA content. Reads with one telomeric repeat (TTAGGG) are aligned with the fast and exhaustive GEM mapper [4]. Because GEM can return gapped alignments imperfect telomeric repeats can be detected. Mapping position of the telomeric read is also reported for use in identifying genomic rearrangements that relocate telomeres.

The total length of the alignment to the synthetic telomere is used with the estimated number to estimate average telomere length. Initially, base coverage is estimated for the genome (coverageautosomes) after filtering duplicate reads. Coverage is compared to chromosome arm coverage (coveragechrom_arm) to estimate the number of telomeres (telomerecount) (equation 1). At the same time coverage is estimated for autosomal sequence with similar GC content to telomeres (50%) (GC-matched coverageautosomes). Putative telomeric sequence in WGS reads is identified and the length (telo-aligned length) is calculated using the GC-matched coverageautosomes. Finally, average telomere length (average telo-length) is estimated by dividing telo-aligned length by telomerecount this is also divide by GC-matched coverageautosomes (equation 2).

Telomeasure steps

Citations

  1. Kim NW, Piatyszek MA, Prowse KR, Harley CB, West MD, Ho PL, Coviello GM, Wright WE, Weinrich SL, Shay JW, Specific association of human telomerase activity with immortal cells and cancer. Science, 1994;266(5193).
  2. Shay JW and Bacchetti S. A survey of telomerase activity in human cancer. Eur J Cancer. 1997;33(5).
  3. Henson JD and Reddel RR. Assaying and investigating Alternative Lengthening of Telomeres activity in human cells and cancers. FEBS Lett. 2010. 584(17).
  4. Marco-Sola S, Sammeth M, Guigó R, Ribeca P. The GEM mapper: fast, accurate and versatile alignment by filtration. Nature Methods. 2012;9(12):1185–1188.

License

This project is licensed under BSD License 2.0

Contact Jennifer Shelton jshelton@nygenome.org, New York Genome Center

Sept 16, 2020