Skip to content

Web application for the Manuscript "From Comorbidities to Gene Expression Fingerprints and Back"

License

Notifications You must be signed in to change notification settings

bsc-life/rgenexcom

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

60 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

rgenexcom (RNA-seq Gene Expression Comorbidities)

Web application for the manuscript Patient stratification reveals the molecular basis of disease co-occurrences

Table of Contents

  1. Web Application
  2. Paper
  3. Documentation
  4. Tutorial

Web Application

rgenexcom is a web application that allows for both detailed studies on the molecular mechanisms underlying specific comorbidities and large-scale studies on the molecular basis of comorbidities.

Link to rgenexcom: http://disease-perception.bsc.es/rgenexcom/

image

Paper

The manuscript is already available in MedRxiv at https://doi.org/10.1101/2021.07.22.21260979.

Abstract

Epidemiological evidence shows that some diseases tend to co-occur; more exactly, certain groups of patients with a given disease are at a higher risk of developing a specific secondary condition. Here we develop a new approach to generate a disease network that uses the accumulating RNA-seq data on human diseases to significantly match an unprecedented proportion of known comorbidities, providing plausible biological models for such co-occurrences and effectively mirroring the underlying structure of complex disease relationships. Furthermore, 64% of the known disease pairs can be explained by analyzing groups of patients with similar expression profiles, highlighting the importance of patient stratification in the study of comorbidities. These results solidly support the existence of molecular mechanisms behind many of the known comorbidities, with most captured co-occurrences implicating the immune system. Additionally, we identified new and potentially underdiagnosed comorbidities, providing molecular insights that could inform targeted therapeutic strategies. We provide a functional and comprehensive web application to explore diseases, disease co-occurrences and their underlying molecular processes at different resolution levels at http://disease-perception.bsc.es/rgenexcom/.

Description of the web application

To enhance the visualization and exploration of the generated networks, we developed a web application that dynamically displays both the DSN and SSN (Chang et al., 2016). This interactive tool allows users to filter the networks based on the type of interactions (positive or negative) and set minimum and maximum thresholds for edge weights. Users can also derive network backbones using metric and ultra-metric closures from Simas et al. Community detection algorithms such as greedy modularity optimization (Pons and Latapy, 2006) or random walks (Clauset et al., 2004) can be applied to the filtered networks and interactions involving specific nodes, genes and pathways from Reactome, KEG, and GO can be selectively filtered. Furthermore, the application facilitates easy inspection and comparison of molecular mechanisms behind diseases and their interactions, enabling users to gain valuable insights into the complexity behind and between diseases at a molecular level.

GitHub

The code of the paper is available at the repository https://github.com/beatrizurda/Urda-Garcia_Patient_Stratification

Documentation

Disease Similarity Network (DSN)

First, we implemented an RNA-seq pipeline to perform Differential Expression Analysis for each disease. Next, we computed the spearmans' correlation between the diseases' gene expression profiles. We kept significant interactions after multiple testing correction (FDR < 0.05).

The obtained DSN contains positive and negative interactions, representing diseases with significantly similar and dissimilar gene expression profiles, respectively. Next, we evaluated the overlap of the positive interactions in the DSN with the epidemiological network from Hidalgo et al. Positive interactions in the DSN described in this epidemiological network are represented with red dashed lines.

Stratified Similarity Network (SSN)

We stratified each disease into subgroups of patients with similar expression profiles (meta-patients) by applying the k-medoids clustering algorithm to its normalized and batch effect corrected gene expression matrix. Next, we performed Differential Expression Analyses for each meta-patient.

To analyze the disease subtype-associated comorbidities, we built the Stratified Similarity Network (SSN) connecting meta-patients and diseases based on the similarities of their gene expression profiles (following the same methodology described for the DSN). The resulting Stratified Similarity Network (SSN) contains three types of interactions: (1) the previously described disease-disease interactions, (2) interactions connecting different meta-patients and (3) interactions connecting meta-patients to diseases.

Molecular mechanism behind diseases

It shows the Reactome pathways significantly dysregulated in human diseases by pathway category. For each disease, Reactome pathways significantly over and underexpressed were identified using the GSEA method (FDR <= 0.05). Ward2 algorithm was applied to cluster diseases based on the Euclidean distance of their binarized Normalized Effect Size (1s, and -1s for over and underxpressed pathways). The heatmap shows the dysregulated Reactome pathways (rows) in the diseases (columns), where over and underexpressed pathways are blue and red colored respectively.

Molecular mechanism behind disease interactions

Over and underexpressed pathways behind epidemiological and not epidemiological interactions for each disease category pair. Percentage of epidemiological versus non epidemiological interactions that share overexpressed or underexpressed pathways. Each point represents a Reactome pathway category. The size of the points corresponds to the mean number of shared pathways in the epidemiological interactions. The color corresponds to the ratio of the mean number of shared pathways in epidemiological versus non epidemiological interactions.

Get genes and pathways

In this section you can access the differentially expressed genes and pathways in a given phenotype (disease or meta-patient) and commonly dysregulated in phenotype pairs or groups.

If you select one phenotype, you will get the table of dysregulated genes and pathways for that phenotype. You can filter the tables by selecting only the features that are significantly altered or by selecting only the over or underexpressed features. If you only select two or more phenotypes, you will get the genes or pathways that are significantly altered in all those phenotypes. Again, you can select only the over or the underexpressed features.

Tutorial

Download the tutorial of the web application Download PDF

About

Web application for the Manuscript "From Comorbidities to Gene Expression Fingerprints and Back"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages