Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

Brief introduction on Logistic Regression
CGeMM – University of Louisville Mining gene-gene interactions from microarray data - Coefficient of Determination Marcel Brun – CGeMM - UofL.
Experimental Design, Response Surface Analysis, and Optimization
Ch11 Curve Fitting Dr. Deshi Ye
04/02/2006RECOMB 2006 Detecting MicroRNA Targets by Linking Sequence, MicroRNA and Gene Expression Data Joint work with Quaid Morris (2) and Brendan Frey.
Clustering short time series gene expression data Jason Ernst, Gerard J. Nau and Ziv Bar-Joseph BIOINFORMATICS, vol
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Mutual Information Mathematical Biology Seminar
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
CISC667, F05, Lec26, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Genetic networks and gene expression data.
Chapter 1 Introduction The solutions of engineering problems can be obtained using analytical methods or numerical methods. Analytical differentiation.
Least Square Regression
‘Gene Shaving’ as a method for identifying distinct sets of genes with similar expression patterns Tim Randolph & Garth Tan Presentation for Stat 593E.
Integrated analysis of regulatory and metabolic networks reveals novel regulatory mechanisms in Saccharomyces cerevisiae Speaker: Zhu YANG 6 th step, 2006.
Exploring Network Inference Models Math-in-Industry Camp & Workshop: Michael Grigsby: Cal Poly, Pomona Mustafa Kesir: Northeastern University Nancy Rodriguez:
Predicting protein functions from redundancies in large-scale protein interaction networks Speaker: Chun-hui CAI
Development of Empirical Models From Process Data
Lecture 5 Curve fitting by iterative approaches MARINE QB III MARINE QB III Modelling Aquatic Rates In Natural Ecosystems BIOL471 © 2001 School of Biological.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Assigning Numbers to the Arrows Parameterizing a Gene Regulation Network by using Accurate Expression Kinetics.
Modeling the Gene Expression of Saccharomyces cerevisiae Δcin5 Under Cold Shock Conditions Kevin McKay Laura Terada Department of Biology Loyola Marymount.
Classification and Prediction: Regression Analysis
Software Refactoring and Usability Enhancement for GRNmap, a Gene Regulatory Network Modeling Application Mathematical Model Equation 2. Equation 3. Future.
Genome of the week - Deinococcus radiodurans Highly resistant to DNA damage –Most radiation resistant organism known Multiple genetic elements –2 chromosomes,
CpE- 310B Engineering Computation and Simulation Dr. Manal Al-Bzoor
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
R. Kass/W03P416/Lecture 7 1 Lecture 7 Some Advanced Topics using Propagation of Errors and Least Squares Fitting Error on the mean (review from Lecture.
Comparison of methods for reconstruction of models for gene expression regulation A.A. Shadrin 1, *, I.N. Kiselev, 1 F.A. Kolpakov 2,1 1 Technological.
MECN 3500 Inter - Bayamon Lecture 9 Numerical Methods for Engineering MECN 3500 Professor: Dr. Omar E. Meza Castillo
A COMPREHENSIVE GENE REGULATORY NETWORK FOR THE DIAUXIC SHIFT IN SACCHAROMYCES CEREVISIAE GEISTLINGER, L., CSABA, G., DIRMEIER, S., KÜFFNER, R., AND ZIMMER,
Summary Various mathematical models describing gene regulatory networks as well as algorithms for network reconstruction from experimental data have been.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
DISCRIMINATIVE TRAINING OF LANGUAGE MODELS FOR SPEECH RECOGNITION Hong-Kwang Jeff Kuo, Eric Fosler-Lussier, Hui Jiang, Chin-Hui Lee ICASSP 2002 Min-Hsuan.
Analysis of the yeast transcriptional regulatory network.
Changes in Gene Regulation in Δ Zap1 Strain of Saccharomyces cerevisiae due to Cold Shock Jim McDonald and Paul Magnano.
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
GRNmap and GRNsight June 24, Systems Biology Workflow DNA microarray data: wet lab-generated or published Generate gene regulatory network Modeling.
Identification of cell cycle-related regulatory motifs using a kernel canonical correlation analysis Presented by Rhee, Je-Keun Graduate Program in Bioinformatics.
© Copyright McGraw-Hill 2000
Steady-state Analysis of Gene Regulatory Networks via G-networks Intelligent Systems & Networks Group Dept. Electrical and Electronic Engineering Haseong.
IMPROVED RECONSTRUCTION OF IN SILICO GENE REGULATORY NETWORKS BY INTEGRATING KNOCKOUT AND PERTURBATION DATA Yip, K. Y., Alexander, R. P., Yan, K. K., &
Introduction to biological molecular networks
Curve Fitting Pertemuan 10 Matakuliah: S0262-Analisis Numerik Tahun: 2010.
Evaluation of gene-expression clustering via mutual information distance measure Ido Priness, Oded Maimon and Irad Ben-Gal BMC Bioinformatics, 2007.
. Finding Motifs in Promoter Regions Libi Hertzberg Or Zuk.
Chapter 2-OPTIMIZATION G.Anuradha. Contents Derivative-based Optimization –Descent Methods –The Method of Steepest Descent –Classical Newton’s Method.
Inferring gene regulatory networks from multiple microarray datasets (Wang 2006) Tiffany Ko ELE571 Spring 2009.
A comparative approach for gene network inference using time-series gene expression data Guillaume Bourque* and David Sankoff *Centre de Recherches Mathématiques,
Outline S. cerevisiae, a eukaryote known for cold-shock adaption, used in cold-shock experiments Deletion strand HMO1 and the comparison of microarray.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Computacion Inteligente Least-Square Methods for System Identification.
Identifying submodules of cellular regulatory networks Guido Sanguinetti Joint work with N.D. Lawrence and M. Rattray.
Chapter 15 Inference for Regression. How is this similar to what we have done in the past few chapters?  We have been using statistics to estimate parameters.
Inferring Regulatory Networks from Gene Expression Data BMI/CS 776 Mark Craven April 2002.
Journal club Jun , Zhen.
Chapter 7. Classification and Prediction
Bulgarian Academy of Sciences
Departments of Biology and Mathematics
CSE 4705 Artificial Intelligence
System Structures Identification
1 Department of Engineering, 2 Department of Mathematics,
Introduction to Instrumentation Engineering
Regression Models - Introduction
1 Department of Engineering, 2 Department of Mathematics,
CISC 841 Bioinformatics (Spring 2006) Inference of Biological Networks
1 Department of Engineering, 2 Department of Mathematics,
Analyzing Time Series Gene Expression Data
SEG5010 Presentation Zhou Lanjun.
CISC 667 Intro to Bioinformatics (Spring 2007) Genetic networks and gene expression data CISC667, S07, Lec24, Liao.
Presentation transcript:

Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T., and Vohradsky, J. (2006) Nucleic Acids Research 35: Jeffrey Crosson 1 Kara M. Dismuke 2 Natalie E. Williams 3 1 Department of Engineering, 2 Department of Mathematics, 3 Department of Biology BIOL398-04/MATH388 March 24, 2015

Outline ●Introduction ●Constructing and Testing the Model o Dynamics of Model o Computational algorithm o Dataset selection o Inference of regulators o Comparison with linear model ●Discussion ●Summary

Outline ●Introduction ●Constructing and Testing the Model o Dynamics of Model o Computational algorithm o Dataset selection o Inference of regulators o Comparison with linear model ●Discussion ●Summary

Introduction ●Regulation of gene expression controls cellular processes ●Microarray data tracks dynamics of gene expression ●Nonlinear differential equation to predict and model gene expression ●Many methods in identifying relationships between genes and their regulators

Outline ●Introduction ●Constructing and Testing the Model o Dynamics of Model o Computational algorithm o Dataset selection o Inference of regulators o Comparison with linear model ●Discussion ●Summary

Dynamic Model of Transcriptional Control ●A result of previous work on the dynamic simulation of genetic networks ●Assumes the recursive action of regulators on the target gene over time ●Assumes the regulatory effect on the expression can be expressed as a combinatorial action of its regulators ●g = regulatory effect for a certain gene ●j = 1, 2, … m ●m = number of regulators for a gene ●w = regulatory weights ●y = expression levels ●b = transcription initiation delay ● ⍴ = sigmoid function of regulatory effect 1 2a 2b

Dynamic Model of Transcriptional Control (Continued) ●z = target gene expression level ●dz/dt = rate of expression ●k1 = maximal rate of expression ●k2 = rate constant of degradation ●a values = computed from the experimental gene expression profile using least squares minimization procedure ●E = error function ●The polynomial fit is an approximation of the true expression profile ●Gene profiles that minimize the mean square error function are sought for ●The result allows the parameters in the differential equation to be estimated

Computational Algorithm ●The aim is to find a set of potential regulators of a certain target gene by estimating its expression profile ●It searches from a group of transcriptional regulators using least squares minimization, the differential equation, and the error function o The differential equation is solved numerically o The parameters w, b, k1, and k2 are optimized with a least squares minimization loop ●The missing data points and fluctuation in gene expression profiles is compensated for by approximating the regulator gene profiles by a polynomial of degree n o The degree n is chosen by the number of of data points in the profile and the level of fluctuations

Computational Algorithm (Continued) 1.Fit regulator gene profiles with a polynomial of degree n 2.Select a target gene 3.Select a candidate regulatory gene from the pool of possible regulators 4.Apply least squares minimization procedure to the target and regulator genes using the differential equation with the error function 5.Repeat step 3 for all possible regulators 6.Select regulators that best satisfy the selection criterion 7.Repeat step 2 for all target genes 5 4 6

Dataset Selection ●The eukaryotic cell cycle dataset published by Spellman and others was chosen to evaluate the performance of the model ●The dataset records changes in gne expressions using microarrays at 18 points in time over two cell cycle periods ●800 genes were identified whose expression was associated with the cell cycle, but the real number of regulators controlling the cell cycle is much smaller ●Therefore 184 potential regulator genes were selected for the identification of yeast cell cycle regulators o By combining data from previous published papers and the YEASTRACT database ●40 target genes were selected o The same ones in the paper by Chen and others

Inference of Regulators ●Data in form of log base 2 of ratio between RNA amount and value of standard (same for all time points) ●Least squares minimization for each target gene for all potential regulators ●Approximation of unknown real profile = least squares best fit of polynomial of degree n to target gene expression profile z p o Estimation of overall error o Deviation from experimental data

●Find regulator profile using: o model (Eqn 4) o minimizing E (Eqn 6) ●Assumption: fit of model to target regulator profile is at least as good as fit given by Eqn 8 o in other words: deviation E (Eqn 6) must be ≤ deviation E 1 (Eqn 9) ●Choose regulators where E ≤ E 1 ●Determine which regulators fit target gene profile better than the others (call “best regulators”) ●Correct Identification: regulator identified was also regulator in YEASTRACT o YEASTRACT: current knowledge, but still incomplete Inference of Regulators

Table 1: Summary of identification of regulators for 40 selected yeast cell cycle regulated genes

Figure 1: Regulators that are repressors have the “opposite” curve as the target genes and reconstructed target curve

Figure 2: Regulators that are activators have a similar curve as the target genes and the reconstructed target curve

Figure 3: The amount of runs used to correctly identify at least one regulator was lower for the nonlinear model. ●A: Nonlinear model ●B: Linear model ●Distribution of order of correctly identified regulators in the sorted list Histogram of distribution of the order of correctly identified regulators in the sorted list of potential regulators.

Outline ●Introduction ●Constructing and Testing the Model o Dynamics of Model o Computational algorithm o Dataset selection o Inference of regulators o Comparison with linear model ●Discussion ●Summary

Discussion: The nonlinear model was able to pair target gene expression with its regulator ●Nonlinear algorithm selected the most probable regulator and provided information about how well it controls the target gene ●Drawbacks: o The model does not test indirect controls of target genes; o Regulators are selected from a pool independently, usually through sequence analysis; o Does not consider that individual target genes may regulate other target genes; and, o Transcriptional regulation also is controlled by proteins which cannot be recorded by microarrays

Discussion: The nonlinear algorithm can lead to further explorations in modeling gene regulatory networks. ●This model focuses on analyzing the influence of all possible regulators of a given target gene and recover basic transcriptional regulations ●Combinatorial control and larger networks can be created by the addition of smaller medium-scale gene regulatory networks ●In the future, the speed of the algorithm will improve and the algorithm may have additional extensions that could allow it to consider other factors in constructing these networks

Outline ●Introduction ●Constructing and Testing the Model o Dynamics of Model o Computational algorithm o Dataset selection o Inference of regulators o Comparison with linear model ●Discussion ●Summary

Summary ●The dynamics of the model were given by: ●The least squares best fit was then compared to the deviations from the experimental data, which resulted in: o Table 1 identified regulators for the target genes o Figures 1 & 2 had the best expression profiles of target/regulator pairs o Figure 3 shows the distribution of the order of correctly identified regulators in the sorted list ●The model is capable of correctly identifying regulators, modeling expression profiles, and predicting if regulators repress or activate