Enrichment Network Analysis and Visualization (ENViz) Cytoscape plugin for integrative statistical analysis and visualization of multiple sample matched.

Slides:



Advertisements
Similar presentations
Martijn van Iersel PathVisio & WikiPathways BiGCaT Bioinformatics Maastricht University.
Advertisements

Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Overviews and Omics Viewers. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of a different aspect of the cellular.
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Data integration across omics landscapes Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine
Pathways & Networks analysis COST Functional Modeling Workshop April, Helsinki.
How we assist knowledge collection Serving the monks Chris Evelo Dept of Bioinformatics – BiGCaT Maastricht University.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Where we are and where we are going From biology to data and back again Chris Evelo Department of Bioinformatics - BiGCaT Maastricht University.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
BioPathways SIG, July Networks in Biology Molecular interaction and similarity networks are vital for understanding gene function.
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Daehee Hwang Leroy Hood Institute for Systems Biology.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Introduction The goal of translational bioinformatics is to enable the transformation of increasingly voluminous genomic and biological data into diagnostics.
Cytoscape A powerful bioinformatic tool Mathieu Michaud
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
Quantitative PCR Analysis of DNA, RNAs, and Proteins in the Same Single Cell A. Ståhlberg, C. Thomsen, D. Ruff, and P. Åman December 2012
Radiogenomics in glioblastoma multiforme
EGAN: Exploratory Gene Association Networks by Jesse Paquette Biostatistics and Computational Biology Core Helen Diller Family Comprehensive Cancer Center.
Copyright OpenHelix. No use or reproduction without express written consent1.
Networks and Interactions Boo Virk v1.0.
Basic features for portal users. Agenda - Basic features Overview –features and navigation Browsing data –Files and Samples Gene Summary pages Performing.
A New Oklahoma Bioinformatics Company. Microarray and Bioinformatics.
Tutorial session 2 Network annotation Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar.
Visualization and analysis of microarray and gene ontology data with treemaps Eric H Baehrecke, Niem Dang, Ketan Babaria and Ben Shneiderman Presenter:
CellFateScout step- by-step tutorial for a case study Version 0.94.
EADGENE and SABRE Post-Analyses Workshop 12-14th November 2008, Lelystad, Netherlands 1 François Moreews SIGENAE, INRA, Rennes Cytoscape.
Construction of cancer pathways for personalized medicine | Presented By Date Construction of cancer pathways for personalized medicine Predictive, Preventive.
Network-Ontology Visualization and Analysis (AVALON) Chao Zhang Computer Science Department 1.
Tutorial session 3 Network analysis Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar.
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
SUPPLEMENTAL FIGURES AND TABLES. Supplementary Table 1: List of new and improved features in GSEA-P version 2 Java software. Examples and screenshots.
Copyright OpenHelix. No use or reproduction without express written consent1.
Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013.
GO enrichment and GOrilla
Introduction and Applications of Microarray Databases Chen-hsiung Chan Department of Computer Science and Information Engineering National Taiwan University.
Welcome to Gramene’s RiceCyc (Pathways) Tutorial RiceCyc allows biochemical pathways to be analyzed and visualized. This tutorial has been developed for.
CSC, Dec.15-16,2005. Cytoscape Team Trey Ideker Mark Anderson Nerius Landys Ryan Kelley Chris Workman Past contributors: Nada Amin Owen Ozier Jonathan.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
NCode TM miRNA Analysis Platform Identifies Differentially Expressed Novel miRNAs in Adenocarcinoma Using Clinical Human Samples Provided By BioServe.
Canadian Bioinformatics Workshops
Microarray Technology and Data Analysis Roy Williams PhD Sanford | Burnham Medical Research Institute.
GPML Plugin for Cytoscape Thomas Kelder Maastricht University
Ingenuity Pathway Analysis Alex Pico. Description "IPA is a software application that enables researchers to analyze and understand the complex biological.
Multi-scale network biology model & the model library 多尺度网络生物学模型 -- 兼论模型库的建立与应用 Jianghui Xiong 熊江辉
Thomas Hartung, Andre Kleensang & team
a Cytoscape plugin to assess enrichment of
Volume 19, Issue 5, Pages (May 2017)
Pathway Visualization
The Omics Dashboard Suzanne Paley Pathway Tools Workshop 2018
Nicole Anguiano and Anu Varshneya
Schedule for the Afternoon
Volume 5, Issue 4, Pages e5 (October 2017)
Volume 23, Issue 4, Pages (April 2018)
A Call for Systematic Research on Solute Carriers
Anastasia Baryshnikova  Cell Systems 
Phenotype validation: a novel heat shock function for the copper-responsive regulator CopR. Phenotype validation: a novel heat shock function for the copper-responsive.
The Omics Dashboard.
Volume 4, Issue 3, Pages e3 (March 2017)
Volume 29, Issue 5, Pages (May 2016)
Volume 19, Issue 5, Pages (May 2017)
Knowledge-Guided Sample Clustering
The CREBBP-modulated network is enriched in signaling pathways upregulated in the light zone (LZ). The CREBBP-modulated network is enriched in signaling.
Volume 28, Issue 3, Pages e7 (July 2019)
Characteristic gene expression patterns distinguish LCH cells from other immune cells present in LCH lesions. Characteristic gene expression patterns distinguish.
Presentation transcript:

Enrichment Network Analysis and Visualization (ENViz) Cytoscape plugin for integrative statistical analysis and visualization of multiple sample matched data sets Anya Tsalenko Agilent Laboratories December 14, 2012

Why ENViz? Many high throughput data sets measured in the same set of samples: - ‘omics’ - proteomics - metabolomics Rich databases with systematic annotations: - GO - pathways - drug targets How do we analyze this data together to get deeper biological insights into studied phenotype?

Genomic Workbench Integrated AnalysisNetwork Biology Integrated Biology Informatics Primary Analysis LC/MS GC/MS Microarrays Target Enrichment NMR Microfluidics Proteins Metabolites DNA / RNA miRNA GeneSpring MassHunter Workstation Genome Browser Public Data Hypothesis, experiment, model Integrated Biology

Example: breast cancer study “miRNA-mRNA integrated analysis reveals roles for miRNAs in primary breast tumors” Enerly et al, PLoS One 2011 Cancer dataset from Anne-Lise Børresen-Dale Lab in Norwegian Radium Hospital, Oslo 100 breast tumor samples with various characteristics Matched miRNA and mRNA data, Agilent microarrays

Correlation of miRNA and mRNA expression, miR-150 Sorted expression of miRNA -150 Genes sorted by correlation to miR-150 across 100 breast cancer samples

Enrichment analysis of ranked list of genes correlated to miR-150 GO terms enrichment analysis in the top of the list of genes ordered by correlation to miR-150 based on minimum Hypergeometric Statistics (Eden et al, PLoS CB 2007) mHG p-value<E-147 Analysis and visualization in GOrilla software

Biological validation Association between miR-19a and the cell-cycle module was substantiated as an association to proliferation. Further validated using high- throughput transfection assays where transfection of miR-19a to MCF7 cell lines resulted in increased proliferation. GO enrichment for genes correlated to miR-19a

Generic 3 matrices enrichment analysis Two different types of measurements in the same set of samples:  mRNA and miRNA expression (or other non-coding RNAs)  mRNA expression and quantitative clinical phenotypes  mRNA expression and metabolites levels  mRNA expression and copy number Roy Navon Enrichments Correlations Analysis is based on statistical enrichment of annotation elements in lists ranked by correlation Enrichment can be calculated based on any annotation such as GO, pathway, disease ontology or other custom primary data categories Primary Data genes samples Pivot Data samples miRNAs/other Annotation Pathways/GO/other

ENViz: what it is Enrichment Network Visualization (ENViz): a Cytoscape plugin for integrative statistical analysis and visualization of multiple sample matched data sets

Use the main control panel to: Input primary data, pivot, and annotation files Run analysis Set thresholds that control the size of the enrichment network to visualize Run the visualization Separate sub-panels can be collapsed or expanded by clicking on their handles (collapsible subpanels, Bader Lab, U Toronto) Interactive Legend: graphical overview of the workflow. click on labeled boxes for file prompt. drag and drop a file reference onto a labeled box. Control Panel

Enrichment Network Enrichment network built from mRNA and miRNA data from Enerly et al, using WikiPathway annotation. Results are represented as bi-partite graph: nodes = pathways (yellow->red) and miRNAs (grey). Edge represents enrichment of pathway node in the set of genes whose expression correlate the expression pattern of miRNA node, red = positive correlation, blue = negative correlation

Enrichment Network Zoom: Zoom in to see details around selected nodes and edges See zoomed-in network in the context of the whole network on the bottom left

Pathway visualization in WikiPathways Click on selected edge loads and shows corresponding WikiPathway All gene nodes in the mRNA processing pathway that map to primary data elements are color coded (blue -> red) for correlation score between the primary data element (mRNA) and the pivot data element for the clicked edge (hsa-miR-92a) thick borders and high opacity show genes above correlation threshold that were included in the gene set used for enrichment analysis.

Tiling Pathway views Double-click on a Pathway Node to loads multiple WikiPathways, each one colored by correlation with the specific pivot datum for an Edge, connected to the Node, up to a user- configurable limit Network views are tiled in a small multiples view that accentuates contrasts between correlations for different pivot data.

Gene Ontology enrichment and visualization Enrichment network built from Enerly et al. mRNA and miRNA data, and Gene Ontology annotation. left = bi-partite graph for GO terms (yellow -> red scale) and miRNA (grey) edge is enrichment of the GO term in the set of genes most correlated with the miRNA. right = GO summary network for GO terms in the left enrichment network. Each GO nodes color-coded by cumulative enrichment score for its set of pivot nodes. parent terms are added, to complete the GO hierarchy view.

miR oriented GO Terms Double-click on an pivot node in the enrichment network to show GO terms in the GO Summary network that have significant enrichment values for selected pivot. GO Summary network on the right is color-coded by enrichment of genes correlated to miR-150

Summary: key features of ENViz Enrichment of annotation elements among primary data most correlated to secondary(pivot) data across a set of samples for each pivot and each annotation node Representation of results as bi-partite graph (network) Pathway and GO enrichment analysis with customized visualization Zoom-in into results in the context of WikiPathways Interactive and intuitive data loading and analysis Power of network analysis in Cytoscape

Next steps Beta-release for collaborators Working on performance, completeness, robustness for Cytoscape plugin release Extend support for other organisms beyond Homo sapiens, Mus Musculus, mycobacterium tuberculosis Extend the range of database id mappings Possible future features: heatmap view, sample grouping, more built-in annotation types (TFs, disease ontologies)

Acknowledgements Agilent Team –Allan Kuchinsky, Roy Navon, Zohar Yakhini, Michael Creech Technion – Israel Steinfeld Collaborators –Norwegian Radium Hospital, Oslo: Espen Enerly, Kristine Kleivi, Vessela N. Kristensen, Anne-Lise Børresen-Dale –UCSF/Gladstone: Alex Pico, Nathan Salomonis, Kristina Hanspers, Bruce Conklin, Scooter Morris –Maastricht University: Thomas Kelder, Martijn van Iersel, Chris Evelo –Cytoscape core developers and PIs: Trey Ideker, Chris Sander, Gary Bader, Benno Schwikowski, Mike Smoot, Peng Liang, Kei Ono, Leroy Hood, Ben Gross, Ethan Cerami