Skip to content
Dreycey Albin edited this page May 16, 2019 · 1 revision

CANU (genome assembly)

Installation

git clone https://github.com/marbl/canu.git cd canu/src make -j 3

Basic commands

canu [-correct | -trim | -assemble | -trim-assemble]
[-s ]
-p
-d
genomeSize=[g|m|k]
[other-options]
[-pacbio-raw | -pacbio-corrected | -nanopore-raw | -nanopore-corrected] *fastq

  • can correct, read, and trim separately:

  • correct

canu -correct \
  -p ecoli -d ecoli \
  genomeSize=4.8m \
  -pacbio-raw  pacbio.fastq
  • read
canu -trim \
  -p ecoli -d ecoli \
  genomeSize=4.8m \
  -pacbio-corrected ecoli/ecoli.correctedReads.fasta.gz
  • assemble
canu -assemble \
  -p ecoli -d ecoli-erate-0.039 \
  genomeSize=4.8m \
  correctedErrorRate=0.039 \
  -pacbio-corrected ecoli/ecoli.trimmedReads.fasta.gz

INPUT

GENERAL INPUT INTO PROGRAM (files and commands)

OUTPUT

GENERAL OUTPUT FROM PROGRAM (files and commands)

Example_1- assemble a low coverage genome

  • stolen from the reference manual
curl -L -o yeast.20x.fastq.gz http://gembox.cbcb.umd.edu/mhap/raw/yeast_filtered.20x.fastq.gz
gzip -d yeast.20x.fastq.gz;

Input files

nanopore reads

Output files

Genome assembly files

Download nanopore data

wget ftp://ftp.sra.ebi.ac.uk/vol1/fastq/ERR764/ERR764952/ERR764952_1.fastq.gz;
wget ftp://ftp.sra.ebi.ac.uk/vol1/fastq/ERR764/ERR764952/ERR764952_2.fastq.gz

convert nanopore fastq files to fasta

cat ERR764952_1.fastq | paste - - - - | sed 's/^@/>/g'| cut -f1-2 | tr '\t' '\n' > NANOPORE_INPUT.fa

commands

./Darwin-amd64/bin/canu  -p albinout -d pairedout  genomeSize=4.8m  -nanopore-raw Nanopore_TB.fa

Bioinformatics tools

These are a growing collection of manuals for commonly used bioinformatics tools.

How to use

Just go to the page for the tool you are trying to use, and scroll through the page to download and install. That simple. The goal is to add extra documentation for using these tools, in addition to what is already supplied by the manual pages for the programs.

Clone this wiki locally