DREAM4 Puzzle – inferring network structure from microarray data Qiong Cheng.

Slides:



Advertisements
Similar presentations
Bayesian Belief Propagation
Advertisements

Network analysis Sushmita Roy BMI/CS 576
Markov Networks Alan Ritter.
Probabilistic analog of clustering: mixture models
Network biology Wang Jie Shanghai Institutes of Biological Sciences.
Probabilistic modelling in computational biology Dirk Husmeier Biomathematics & Statistics Scotland.
Exact Inference in Bayes Nets
CGeMM – University of Louisville Mining gene-gene interactions from microarray data - Coefficient of Determination Marcel Brun – CGeMM - UofL.
The multi-layered organization of information in living systems
Dynamic Bayesian Networks (DBNs)
GENIE – GEne Network Inference with Ensemble of trees Van Anh Huynh-Thu Department of Electrical Engineering and Computer Science, Systems and Modeling,
Mutual Information Mathematical Biology Seminar
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
CISC667, F05, Lec26, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Genetic networks and gene expression data.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
Global topological properties of biological networks.
How to reconstruct a large genetic network from n gene perturbations in fewer than easy steps Andreas Wagner 2001 Ren-Hua Chung Department of Computer.
Simulation and Application on learning gene causal relationships Xin Zhang.
6. Gene Regulatory Networks
Copyright N. Friedman, M. Ninio. I. Pe’er, and T. Pupko. 2001RECOMB, April 2001 Structural EM for Phylogentic Inference Nir Friedman Computer Science &
Computer vision: models, learning and inference Chapter 10 Graphical Models.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Finding associated genes in large collections of microarrays.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Bayes Net Perspectives on Causation and Causal Inference
Dependency networks Sushmita Roy BMI/CS 576 Nov 26 th, 2013.
Optimization Based Modeling of Social Network Yong-Yeol Ahn, Hawoong Jeong.
Gene expression & Clustering (Chapter 10)
Aracne Jorge Viveros Summer 2006 Workshop June 29 th, 2006.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Genetic network inference: from co-expression clustering to reverse engineering Patrik D’haeseleer,Shoudan Liang and Roland Somogyi.
Reverse Engineering of Genetic Networks (Final presentation)
Microarrays to Functional Genomics: Generation of Transcriptional Networks from Microarray experiments Joshua Stender December 3, 2002 Department of Biochemistry.
Reverse engineering gene regulatory networks Dirk Husmeier Adriano Werhli Marco Grzegorczyk.
Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland.
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Markov Random Fields Probabilistic Models for Images
10 December, 2008 CIMCA2008 (Vienna) 1 Statistical Inferences by Gaussian Markov Random Fields on Complex Networks Kazuyuki Tanaka, Takafumi Usui, Muneki.
Probabilistic modelling in computational biology Dirk Husmeier Biomathematics & Statistics Scotland.
Ritesh Krishna Department Of Computer Science WPCCS July 1, 2008.
Inferring gene regulatory networks with non-stationary dynamic Bayesian networks Dirk Husmeier Frank Dondelinger Sophie Lebre Biomathematics & Statistics.
Nuria Lopez-Bigas Methods and tools in functional genomics (microarrays) BCO17.
IMPROVED RECONSTRUCTION OF IN SILICO GENE REGULATORY NETWORKS BY INTEGRATING KNOCKOUT AND PERTURBATION DATA Yip, K. Y., Alexander, R. P., Yan, K. K., &
ARCANE Anusha Garimella
Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland.
Dependency networks Sushmita Roy BMI/CS 576 Nov 25 th, 2014.
Lecture 2: Statistical learning primer for biologists
Exact Inference in Bayes Nets. Notation U: set of nodes in a graph X i : random variable associated with node i π i : parents of node i Joint probability:
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Finding associated genes in large collections of microarrays
Distance based phylogenetics
Biological networks CS 5263 Bioinformatics.
Bud Mishra Professor of Computer Science and Mathematics 12 ¦ 3 ¦ 2001
Building and Analyzing Genome-Wide Gene Disruption Networks
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
CISC 841 Bioinformatics (Spring 2006) Inference of Biological Networks
1 Department of Engineering, 2 Department of Mathematics,
Pattern Recognition and Image Analysis
Markov Random Fields Presented by: Vladan Radosavljevic.
Volume 23, Issue 3, Pages (March 2016)
CISC 667 Intro to Bioinformatics (Spring 2007) Genetic networks and gene expression data CISC667, S07, Lec24, Liao.
Markov Networks.
Presentation transcript:

DREAM4 Puzzle – inferring network structure from microarray data Qiong Cheng

Outline Gene Network Gene Regulatory Systems and Related Work FunGen: Reconstructing Biological Networks Using Conditional Correlation Analysis ARACNE: Algorithm for Reconstructing Accurate Cellular Network

Gene Network Directed network –nodes : genes –edges : regulation –including loops –Scale-free: Degree distribution: –power law P(k) ~ k -λ

Genetic Network Generation Schematic Jong Modeling and simulation of genetic regulatory systems: a literature review. J. Comput Biol 2002;9(1):67-103

Random Network Model ER model –each pair of nodes connected by an edge with probability p –Independence of the edges –poisson degree distribution (e.g. P(k) ~ e -k for k) BA model –Scale-free distribution ( P(k) ~ k -x ) –Process: new nodes prefer attached to already high degree nodes

Random Network Model Module extraction from source random scale- free network (used by DREAM3) –Hierarchical scale-free network –Extraction: Random seed node + iteratively adding neighbor nodes with highest modularity Q Marbach D, Schaffter T, Mattiussi C, and Floreano D (2009) Generating Realistic in silico Gene Networks for Performance Assessment of Reverse Engineering Methods. J Comput Biol, 16(2):229–239

Microarray Data Distributions Benford’s law ( in base 10): P(D)=log 10 (1+D -1 ) Zipf’s law: microarray data log-normal distribution as a potential distribution for normalization of the bulk of the corrected spot intensities Noise Source: “Make Sense Of Microarray Data Distributions”

Reverse Engineering Clustering + … Correlation measures + … Optimization method –Bayesian network (conditional independence via DAG) –Markov chains –Dynamic Bayesian network –Expectation maximization (max likelihood) –GA –Neuron network Simulation –Piecewise-linear differential equations –Stochastic equations –Stochastic/hybrid petri-net –Boolean network Regression techniques

FunGen : Reconstructing Biological Networks Using Conditional Correlation Analysis Synthetic network Network dynamics Simulation protocol - perturbation Conditional correlation –Correlation is symetric –Matrix is non-symetric –May lead to indirect connection False positive ( indirect connection ) + false negative ( noise ) – error = FP/(FP+TN) + FN/(FN+TP) Reduce false positive –Choose optimal ρ_opt –Triangle reduction construction

ARACNE: Algorithm for Reconstructing Accurate Cellular Network Assume two-way interaction: pairwise potential determines all statistical dependencies + uniform marginal distributions Mutual information (MI) = measure of relatedness Independency Data processing inequality: if genes g 1 and g 3 interact through g 2 then ARACNE starts with network so for every edge look at gene triplets and remove edge with smallest MI Ignore the direction of the edges Reconstruct tree-network topologies exactly –higher-order potential interactions will not be accounted for (ARACNE’s algorithm will open 3-gene loops). –A two-gene interaction will be detected iff there are no alternate paths.

ARACNE – Example & Evaluation  Synthetic networks: ER, BA Performance to be assessed via Precision-Recall curves (PRCs)  Example:

(Demo) Sample input data file Input_file_name.exp N = 3 # genes M = 2 # microarrays Input file has N+1=4 lines each lines has M+2 (2M+2) fields AffyIDHG_U95Av2SudHL6.CHPST486.CHP G1G G2G G3G header line annotation name Microarray chip names (value,p-value)-chip1 Source from ARACNE slides

(Demo, cont’d) Sample output data file input_data_file_name[non-default_param_vals].adj # lines = N = # genes G1: G2: G3: G4: G5: G6: G7: G8: G9: G10: AffyIDID# Associated gene ID# MI value Source from ARACNE slides