Detecting robust time-delayed regulation in Mycobacterium tuberculosis Iti Chaturvedi and Jagath C Rajapakse INCOB 2009.

Slides:



Advertisements
Similar presentations
Bayesian network for gene regulatory network construction
Advertisements

A Tutorial on Learning with Bayesian Networks
INTRODUCTION TO MACHINE LEARNING Bayesian Estimation.
BIOINFORMATICS GENE DISCOVERY BIOINFORMATICS AND GENE DISCOVERY Iosif Vaisman 1998 UNIVERSITY OF NORTH CAROLINA AT CHAPEL HILL Bioinformatics Tutorials.
Hidden Markov Models (1)  Brief review of discrete time finite Markov Chain  Hidden Markov Model  Examples of HMM in Bioinformatics  Estimations Basic.
HMM II: Parameter Estimation. Reminder: Hidden Markov Model Markov Chain transition probabilities: p(S i+1 = t|S i = s) = a st Emission probabilities:
. Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman.
Introduction of Probabilistic Reasoning and Bayesian Networks
 CpG is a pair of nucleotides C and G, appearing successively, in this order, along one DNA strand.  CpG islands are particular short subsequences in.
Profiles for Sequences
Hidden Markov Models Theory By Johan Walters (SR 2003)
GS 540 week 6. HMM basics Given a sequence, and state parameters: – Each possible path through the states has a certain probability of emitting the sequence.
HMM for CpG Islands Parameter Estimation For HMM Maximum Likelihood and the Information Inequality Lecture #7 Background Readings: Chapter 3.3 in the.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
CISC667, F05, Lec26, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Genetic networks and gene expression data.
Hidden Markov Models I Biology 162 Computational Genetics Todd Vision 14 Sep 2004.
1. Elements of the Genetic Algorithm  Genome: A finite dynamical system model as a set of d polynomials over  2 (finite field of 2 elements)  Fitness.
Cs726 Modeling regulatory networks in cells using Bayesian networks Golan Yona Department of Computer Science Cornell University.
1 Genetic Algorithms. CS The Traditional Approach Ask an expert Adapt existing designs Trial and error.
Lecture 9 Hidden Markov Models BioE 480 Sept 21, 2004.
Gene Regulatory Networks - the Boolean Approach Andrey Zhdanov Based on the papers by Tatsuya Akutsu et al and others.
Efficient Estimation of Emission Probabilities in profile HMM By Virpi Ahola et al Reviewed By Alok Datar.
6. Gene Regulatory Networks
Class 5 Hidden Markov models. Markov chains Read Durbin, chapters 1 and 3 Time is divided into discrete intervals, t i At time t, system is in one of.
Cristina Manfredotti D.I.S.Co. Università di Milano - Bicocca An Introduction to the Use of Bayesian Network to Analyze Gene Expression Data Cristina Manfredotti.
. Parameter Estimation For HMM Lecture #7 Background Readings: Chapter 3.3 in the text book, Biological Sequence Analysis, Durbin et al.,  Shlomo.
Inferring subnetworks from perturbed expression profiles Dana Pe’er, Aviv Regev, Gal Elidan and Nir Friedman Bioinformatics, Vol.17 Suppl
國立陽明大學生資學程 陳虹瑋. Genetic Algorithm Background Fitness function ……. population selection Cross over mutation Fitness values Random cross over.
Phylogeny Estimation: Traditional and Bayesian Approaches Molecular Evolution, 2003
Genetic Algorithm.
Isolated-Word Speech Recognition Using Hidden Markov Models
BLAST What it does and what it means Steven Slater Adapted from pt.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
Learning Structure in Bayes Nets (Typically also learn CPTs here) Given the set of random variables (features), the space of all possible networks.
. Parameter Estimation For HMM Lecture #7 Background Readings: Chapter 3.3 in the text book, Biological Sequence Analysis, Durbin et al., 2001.
Intro. ANN & Fuzzy Systems Lecture 36 GENETIC ALGORITHM (1)
Fundamentals of Hidden Markov Model Mehmet Yunus Dönmez.
ResponseNet revealing signaling and regulatory networks linking genetic and transcriptomic screening data CSE Fall.
Motif finding with Gibbs sampling CS 466 Saurabh Sinha.
Hidden Markov Models Usman Roshan CS 675 Machine Learning.
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Learning With Bayesian Networks Markus Kalisch ETH Zürich.
Slides for “Data Mining” by I. H. Witten and E. Frank.
Multi-Speaker Modeling with Shared Prior Distributions and Model Structures for Bayesian Speech Synthesis Kei Hashimoto, Yoshihiko Nankaku, and Keiichi.
Genetic Algorithms. 2 Overview Introduction To Genetic Algorithms (GAs) GA Operators and Parameters Genetic Algorithms To Solve The Traveling Salesman.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Supervised Learning Resources: AG: Conditional Maximum Likelihood DP:
Dependency networks Sushmita Roy BMI/CS 576 Nov 25 th, 2014.
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Elements of a Discrete Model Evaluation.
D Nagesh Kumar, IIScOptimization Methods: M8L5 1 Advanced Topics in Optimization Evolutionary Algorithms for Optimization and Search.
The Standard Genetic Algorithm Start with a “population” of “individuals” Rank these individuals according to their “fitness” Select pairs of individuals.
A Cooperative Coevolutionary Genetic Algorithm for Learning Bayesian Network Structures Arthur Carvalho
Hidden Markov Model Parameter Estimation BMI/CS 576 Colin Dewey Fall 2015.
Hidden Markov Models. A Hidden Markov Model consists of 1.A sequence of states {X t |t  T } = {X 1, X 2,..., X T }, and 2.A sequence of observations.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
1 Comparative Study of two Genetic Algorithms Based Task Allocation Models in Distributed Computing System Oğuzhan TAŞ 2005.
Genetic Algorithm(GA)
Hidden Markov Models BMI/CS 576
Learning gene regulatory networks in Arabidopsis thaliana
Bud Mishra Professor of Computer Science and Mathematics 12 ¦ 3 ¦ 2001
Compact Query Term Selection Using Topically Related Text
Christopher A. Penfold Vicky Buchanan-Wollaston Katherine J. Denby And
Hidden Markov Models Part 2: Algorithms
N-Gram Model Formulas Word sequences Chain rule of probability
Regulation Analysis using Restricted Boltzmann Machines
Parametric Methods Berlin Chen, 2005 References:
Presentation transcript:

Detecting robust time-delayed regulation in Mycobacterium tuberculosis Iti Chaturvedi and Jagath C Rajapakse INCOB 2009

Gene Regulatory Networks (GRN) GRN represents the regulatory effects (causal effects) among genes involved in a particular pathway. Signal transduction may is transient – dynamic delayed regulations seem to exist. Distributed nature causes intense cross-talk M. tuberculosis is a bacteria causing TB in man and has a very slow growth rate in vitro. The DNA repair pathway is activated when a damage to DNA occurs. System consists of LexA and RecA and upto 40 other genes that are regulated by these two proteins. linB Rv2719c lexA fadd21 recA dnaB fadd23 ruvC dnaE2 agenda

Methods of Building GRN Bayesian networks (BN): graphical models with regulations denoted by conditional probabilities (Heckerman et al 1995) Dynamic BN (DBN): Transition network over time can model cyclic events (Friedman, N. et al 1998) Higher-order (HDBN): Extended transition network for longer delays (Zheng et al 2006) Skip-chain BN : Can model very long delays of arbitary length using feature functions. (Galley, M. 2006) a i are the parents of gene i

Optimization Using GA Crossover involves swapping several rows between two parents. Each individual is an o-nary interaction matrix where for no interaction and for o-order interaction

GA Algorithm Initialize N individuals with 0.7 similarity using Mutual Information Rank all individuals using fitness function. Elite individual E is sent to next generation. Select two parents using roulette strategy for mating. Swap last few rows of the selected parents to generate two new children. Try another crossover point if population similarity is < 0.7 Invert a random cell to cause mutation If d ga < 1 for 20 generations or number of generations greater than Q No Yes E is optimal network Initialize Selection Mating Crossover Mutate Terminate

Skip-chain Model The likelihood of a gene expression x i is given by a weighted sum of linear and skip edge scores : Linear-chain feature functions represent local dependencies of o-order. The skip-chain features represent long range dependencies in a GRN using a HMM agenda

Viterbi Forward Path The skip-edge score is given as a the normalized MAP interaction: We can use maximum likelihood to estimate state transition and emission probabilities : where denotes number of occurrences for The most probable path is given by MAP estimate using dynamic programming :

Priors for Networks Most higher-order Markov models are sensitive to change in pathways and associated data. –Gibbs prior is used to model target network prior. –Interaction potentials, denotes an interaction in target network and no interaction –Here a small and large will reflect prior more and vice versa. We use adaption to reduce over-fitting due to sparse feature specific data. Adaption model can combine the reliable DBN with a volatile feature specific HMM for long delays. where

Dirichlet Prior over Parameters We extend the MLE to a Bayesian learning. Dirichlet is a conjugate prior for multinomial distribution. We can maximize probability as (MAP) : Using the linear feature as a Dirichlet conjugate prior for the skip feature of a gene we get : Lastly, the interpolated probability of gene based on linear and skip-edges is : where

Experiments : M. tuberculosis Here we looked at the response of bacteria to drug-induced stress. Treatment with Mitocyin C caused DNA damage and hence led to the upregulation of associated repair genes. Eight time points are available at NCBI Gene Expression Omnibus (GSE1642-GPL1396 series) 0.33hr, 0.75hr,1.5hr, 2hr, 4hr, 6hr, 8hr and 12hr after DNA damage. Data was discretized into 0 for down and 1 for up regulation. The corresponding skip probabilities were calculated as described in methods. Upto seven time points of delays were allowed. Firstly, we used 9 genes previously specified. In order to get an expanded dataset, the original dataset was subjected to ICA and the components closest to 9 genes were identified. This gave us a second dataset of 32 genes. agenda

Table 1. Predicted by DBN, HDBN, and skip-chain without priors Higher-order edges(hrs) # GenesModel:oML1(0.75)2(1.5)3(2)4(4)5(6) 9 DBN: HDBN: SKIP-CHAIN:1/ (3) 32 DBN: HDBN: SKIP-CHAIN:2/ (41)(4) Time delays It can be seen that the ML of the underlying skip-chain prediction is much higher than the DBN or HDBN, confirming that the network fits data well.

Time delays : Priors Higher-order edges(hrs) # GenesModel:oML 1(0.75)2(1.5)3(2)4(4)5(6) 9 SKIP-CHAIN: (3) SKIP-CHAIN(Gibbs): (2) SKIP-CHAIN(Dirichlet): (11)(1) SKIP-CHAIN(Gibbs and Dirichlet): (4)(5) 32 SKIP-CHAIN: (41)(4) SKIP-CHAIN(Gibbs): (40)(3) SKIP-CHAIN(Dirichlet): (37)(4) SKIP-CHAIN(Gibbs and Dirichlet): (41)(4) Table 2. Predicted by skip-chain models with priors: Gibbs prior for structures and Dirichlet prior for parameters Using priors further increased likelihood and gave many new time- delayed interactions. The combined use of both Dirichlet and Gibbs priors is optimal.

Networks 9 genes Fig 3 : Time-delayed interactions in predicted network of 9 genes (a) DBN network, (b) HDBN network, (c) Skip- chain network, (d) Skip-chain network with Gibbs prior, (e) Skip-chain network with Dirichlet prior, (f) Skip-chain network with Gibbs and Dirichlet prior A small number of transcription factors (TF) regulate the rest of the repair system. At the same time the in-degree is low, as each gene is regulated by just one TF.

Networks 32 genes Fig 4 : Time-delayed interactions in predicted network of 32 genes (a) DBN network, (b) HDBN network, (c) Skip- chain network, (d) Skip-chain network with Gibbs prior, (e) Skip-chain network with Dirichlet prior, (f) Skip-chain network with Gibbs and Dirichlet prior The second dataset of 32 genes indicated that our method is good for identifying core genes. Use of priors gave better networks with fewer hubs.

Conclusion An organism responds to changes in its environment by altering the level of expression of critical genes. Skip-chain models address the difficulties of a HDBN by easily incorporating long time-delayed regulations. Numerous time-delays are identified between the same pair of genes. The forward Viterbi path determines the best long-distant time delay between two genes. Using priors gave us higher likelihood and improved the over-fitting in building the regulatory networks. The work can be extended to skip-chain conditional random fields and fusion with protein interaction networks.

Thank You backup