These scripts are extracted from TADbit, a computational framework to analyze and model the chromatin fiber in three dimensions, to illustrate the use of IMP to model a genomic region with 5 Topologically Associating Domains (TADs) from the fly genome.
-
data
: contains the input data and configuration files. -
scripts
01_model_and_analyze.py
the main IMP script modeling to model the regionIMPOptimizer.py
contains classes and scripts to find the optimal parameters in the modelling for IMP
-
outputs
will contain the results of the script as a folder structure with:- .log file
- .tsv optimal parameters' file for IMP
- .xyz file for each model
- .cmm UCSF Chimera file for each model
- .cmd UCSF Chimera file to display all the models at the same time
- .json file to visualize and analyze the model using TADkit. TADkit creates interactive 3D representations of chromatin conformations.
-
test
file to run the test
To run the modeling script, just change into its directory and run it from the command line, e.g.
cd scripts
python 01_model_and_analyze.py --cfg ../data/chr4.cfg --ncpus 12
A prebuild optimal imp parameters it's already included in the data directory to speed up the test which should take around 2 hours in a single cpu. If you wish to test the optimization and the modeling just delete the configuration file or change the extension to something else than .tsv. The whole computation should take around 40 hours in a single cpu.
The results will be produced in the outputs directory as described above.
Author(s): François Serra, Davide Baù, David Castillo, Guillaume Filion, Marc A. Marti-Renom
Date: October 1st, 2016
Testable: Yes.
Parallelizeable: Yes
Citation:
- Serra, F., Baù, D., Filion, G., & Marti-Renom, M. A. (2016). Structural features of the fly chromatin colors revealed by automatic three-dimensional modeling. bioRxiv. doi:10.1101/036764