Skip to content
This repository has been archived by the owner on Mar 4, 2020. It is now read-only.

Latest commit

 

History

History
49 lines (25 loc) · 1.97 KB

dna_single_sample.md

File metadata and controls

49 lines (25 loc) · 1.97 KB

megSAP - DNA analysis (single sample)

Basics

Single sample DNA analysis is performed using the analyze.php script.
Please have a look at the help using:

> php megSAP/src/Pipelines/analyze.php --help

The main parameters that you have to provide are:

  • folder - The sample folder, which contains the the FASTQ files as produced by bcl2fastq2.
  • name - The sample name, which must be a prefix of the FASTQ files.
  • steps - Analysis steps to perform. Please use ma,vc,an to perform mapping, variant calling and variant annotation.
  • system - The processing system INI file.

Poster

A poster about megSAP which describes the all steps of the single-sample analysis pipeline can be found here.

Running an analysis

The analysis pipeline assumes that that all data to analyze resides in a sample folder as produced by Illumina's bcl2fastq tool. If that is the case, the whole analysis is performed with one command, for example like this:

php megSAP/src/Pipelines/analyze.php -folder Sample_NA12878_01 -name NA12878_01 -system hpHBOCv5.ini -steps ma,vc,an

In the example above, the configuration of the pipeline is done using the hpHBOCv5.ini file, which contains all necessary information (see processing system INI file).

Example data with can be analyzed using the command above can be downloaded from here.

Output

After the analysis, these files are created in the output folder:

  1. mapped reads in BAM format
  2. a variant list in VCF format
  3. a variant list in GSvar format
  4. QC data in qcML format, which can be opened with a web browser

back to the start page