Skip to content

Performs enrichment plots of GWAS p-value lists in DHSs (like Fig. 5 of Maurano, Humbert, et al. Science 2012)

Notifications You must be signed in to change notification settings

mauranolab/GWAS_plots

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

45 Commits
 
 
 
 
 
 
 
 

Repository files navigation

#GWAS_plots Performs enrichment plots of GWAS p-value lists in DHSs (like Fig. 5 of Maurano, Humbert, et al. Science 2012)

#Requirements:

#Example usage

cd src
perl ./gwasVsRegions.pl -p pvalues.hg19.YOURSTUDY.bed5 -s ../hg19/namedFDR5pctHotspots.starch -r ../results_hotspots_nocoding

#Input format The GWAS P-value file is a tab-delimited file provided by the user. It includes the dbSNP ID in column 4, and the P-value in column 5; only the latter is actually used.

#Details The two key scripts are in src. The pipeline is divided into a Perl script which processes the overlap, and then a second script which does the plotting in R:

  1. gwasVsRegions.pl runs the overlap of the p-value file with the DHS master list

    The intermediate results go into results_hotspots_nocoding/YOURSTUDY. There is one text file per cell type, though you can see at the end of a Perl script I put all of these together into a single file before plotting.

    namedFDR5pctHotspots.starch is a starch archive (see BEDOPS) containing the DHS master list.

    Samples listed in excluded_samples.txt will be ignored

    (This script was written by Eric Haugen, UW)

  2. GWASPlot.R is the plotting script.
    It is invoked automatically at the end of the overlap script.

    You can see that right now only the x-axis upper limit and of the number of cell types to label are parameterized on the command line. If you look inside, you'll see that the legendSamples list maps samples to group names and colors using regexp.

    This could be easily parallelized by chromosome.

    Key variables for each plot need to be optimized by the user:

    • which samples to display on the plot
    • what the labels to give each sample and how to color them (only the top N samples get labels)
    • The sample groups and labels (i.e. endothelia, muscle, fetal, etc.)
    • adjusting the range of the X and Y axes
    • plot title

About

Performs enrichment plots of GWAS p-value lists in DHSs (like Fig. 5 of Maurano, Humbert, et al. Science 2012)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published