Skip to content

Bio-protocol/POInT-WGT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

63 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

License: GPL v3

Modeling ancient whole-genome triplication with POInT

POInT stands for the Polyploid Orthology Inference Tool. Using POInT, we modeled the whole-genome triplication (WGT) events shared by the members of the tribe Brassiceae (Hao et al., 2021). Here we will show the steps for different WGT model comparisons.

Installation

The latest version of the POInT software is available on GitHub or on the lab website. For detailed installation guide, please go to the software installation page. To download and compile the software:

wget http://conantlab.org/POInT/POInT.tar
tar xvf POInT.tar
cd POInT
# To compile the OpenMP parallel version
./configure.pl -omp
make

Create a symbolic link in the home directory bin:

cd ~/bin
ln -s /where_POInT_was_downloaded/POInT/POInT

Test to see if the software was successfully compiled:

cd
POInT

You should see the following:

	Using 16 threads for this run
Usage: POInT -g:<genome file> -g:<genome file> -o:<ortholog file> -m:<Model file>  (-r:<Root model file>) (-t:treefile) (-p:<posteriortrackprobs file>) (-c:<conditional probabilities file> (-no_opt) (-s:<start>:<end>) (-zerolengthfixed) (-x:#TreestoSave)

Input data

The input files for this case study are available on Figshare and here.

WGT models

From Figure 2 in Hao et al. (2021).

  1. WGT Null model scenario: Null model with no biased fractionation.
POInT -d:3 -g:Brassica_rapa_POInT_geneorders.txt -g:Brassica_oleracea_POInT_geneorders.txt -g:Sinapis_alba_POInT_geneorders.txt -g:Crambe_hispanica_v3_POInT_geneorders.txt -o:FourSpp_M2Opt3.txt -m:WGT_Null_model.txt -t: BrBoSaCh_WGT_3rate_G1Dom_M2Opt3_Top3.tre
  1. WGT 1Dom model scenario: MF1 and MF2 subgenomes are more fractionated than the LF subgenome. But the fractionation rates for MF1 and MF2 are the same.
POInT -d:3 -g:Brassica_rapa_POInT_geneorders.txt -g:Brassica_oleracea_POInT_geneorders.txt -g:Sinapis_alba_POInT_geneorders.txt -g:Crambe_hispanica_v3_POInT_geneorders.txt -o:FourSpp_M2Opt3.txt -m:WGT_2rate_G1Dom_model.txt -t: BrBoSaCh_WGT_3rate_G1Dom_M2Opt3_Top3.tre
  1. WGT 1DomG3 model scenario: MF2 is more fractionated than MF1, and MF1 is more fractionated than LF.
POInT -d:3 -g:Brassica_rapa_POInT_geneorders.txt -g:Brassica_oleracea_POInT_geneorders.txt -g:Sinapis_alba_POInT_geneorders.txt -g:Crambe_hispanica_v3_POInT_geneorders.txt -o:FourSpp_M2Opt3.txt -m:WGT_3rate_G1Dom_model.txt -t: BrBoSaCh_WGT_3rate_G1Dom_M2Opt3_Top3.tre
  1. Root-spec. WGT 1DomG3 is similar to model 3, but with two sets of parameters, one for the root branch and the other for the remainder of the branches in the phylogenetic tree, modeling the scenario of shifted fractionation rates from root branch to later branches.
POInT -d:3 -g:Brassica_rapa_POInT_geneorders.txt -g:Brassica_oleracea_POInT_geneorders.txt -g:Sinapis_alba_POInT_geneorders.txt -g:Crambe_hispanica_v3_POInT_geneorders.txt -o:FourSpp_M2Opt3.txt -m:WGT_3rate_G1Dom_brnspec_model.txt -t: BrBoSaCh_WGT_3rate_G1Dom_M2Opt3_Top3.tre
  1. WGT 1DomG3 + Root is modeling the two-step hexaploidy scenario (Cheng et al., 2012; Tang et al., 2012), in which the MF1 and MF2 merged first following an initial round of gene loss, and the LF subgenome joined later, with subsequent more gene loss.
POInT -d:3 -g:Brassica_rapa_POInT_geneorders.txt -g:Brassica_oleracea_POInT_geneorders.txt -g:Sinapis_alba_POInT_geneorders.txt -g:Crambe_hispanica_v3_POInT_geneorders.txt -o:FourSpp_M2Opt3.txt -m: WGT_3rate_G1Dom_model.txt -r:WGT_RootModel.txt -t: BrBoSaCh_WGT_3rate_G1Dom_M2Opt3_Top3.tre

Expected results

The likelihoods for different WGT models were compared using likelihood ratio tests shown in the above figure. The significant likelihood increase from model 1 to model 2, and from model 2 to model 3 showed strong evidence of biased fractionation in the three subgenomes. We also found strong support of the two-step hexaploidy model, suggesting that the first two subgenomes, MF1 and Mf2 merged first, and the third subgenome LF joined after a certain amount of time. All of the completed ancient polyploidy analyses can be visualized using an online tool called the POInTbrowse. Users will be able to browse the Gene retention/loss pattens in different subgenomes with simple clicks.

License

POInT is a free and open source software, licensed under GPLv3.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Packages

No packages published