Combinatorially Complex Equilibrium Model Selection Tom Radivoyevitch Assistant Professor Epidemiology and Biostatistics Case Western Reserve University.

Slides:



Advertisements
Similar presentations
Biochemistry 2/e - Garrett & Grisham Copyright © 1999 by Harcourt Brace & Company Enzyme Kinetics.
Advertisements

Combinatorially Complex Equilibrium Model Selection Tom Radivoyevitch Assistant Professor Epidemiology and Biostatistics Case Western Reserve University.
Kinetics: Reaction Order Reaction Order: the number of reactant molecules that need to come together to generate a product. A unimolecular S  P reaction.
DNTP Supply Gene Expression Patterns in Microarrays after P53 Loss dNTP Supply Gene Expression Patterns in Microarrays after P53 Loss Tom Radivoyevitch.
ENZYMES: KINETICS, INHIBITION, REGULATION
Enzyme Kinetics, Inhibition, and Control
The Search For Mitochondrial Ribonucleotide Reductase Daniel Bai Dr. Christopher Mathews Department of Biochemistry and Biophysics HHMI.
Mathematical Models of dNTP Supply Tom Radivoyevitch, PhD Assistant Professor Epidemiology and Biostatistics CCCC Developmental Therapeutics Program.
Computational characterization of biomolecular networks in physiology and disease Kakajan Komurov, Ph.D Department of Systems Biology University of Texas.
Marc Riedel Synthesizing Stochasticity in Biochemical Systems Electrical & Computer Engineering Jehoshua (Shuki) Bruck Caltech joint work with Brian Fett.
Mathematical Models of dNTP Supply Tom Radivoyevitch, PhD Assistant Professor Epidemiology and Biostatistics CCCC Developmental Therapeutics Program.
Extracting Essential Features of Biological Networks Natalie Arkus, Michael P. Brenner School of Engineering and Applied Sciences Harvard University.
DNTP Imbalance in Mitochondria Alexandra Frolova Dr. Christopher K. Mathews Laboratory Biochemistry and Biophysics.
Tomas Radivoyevitch, PhD Assistant Professor Epidemiology and Biostatistics Case Western Reserve University
General Features of Enzymes Most biological reactions are catalyzed by enzymes Most enzymes are proteins Highly specific (in reaction & reactants) Involvement.
Objective To learn how to use mathematical models and computer simulations to synthesize various forms of cancer relevant data to yield experimentally.
Medical Biochemistry, Lecture 24
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Chapter 6 Metabolism: Energy and Enzymes. Metabolism The totality of an organism's chemical reactions, consisting of catabolic and anabolic pathways Catabolic.
Statistical Learning: Pattern Classification, Prediction, and Control Peter Bartlett August 2002, UC Berkeley CIS.
Enzyme Kinetics: Study the rate of enzyme catalyzed reactions. - Models for enzyme kinetics - Michaelis-Menten kinetics - Inhibition kinetics - Effect.
Chapter 12 Enzyme Kinetics, Inhibition, and Control Chapter 12 Enzyme Kinetics, Inhibition, and Control Revised 4/08/2014 Biochemistry I Dr. Loren Williams.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Inhibited Enzyme Kinetics Inhibitors may bind to enzyme and reduce their activity. Enzyme inhibition may be reversible or irreversible. For reversible.
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
1 A Combinatorial Toolbox for Protein Sequence Design and Landscape Analysis in the Grand Canonical Model Ming-Yang Kao Department of Computer Science.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
Community Detection by Modularity Optimization Jooyoung Lee
GTL Facilities Computing Infrastructure for 21 st Century Systems Biology Ed Uberbacher ORNL & Mike Colvin LLNL.
6 Energy, Enzymes, and Metabolism. 6 Energy and Energy Conversions To physicists, energy represents the capacity to do work. To biochemists, energy represents.
Presentation Schedule. Homework 8 Compare the tumor-immune model using Von Bertalanffy growth to the one presented in class using a qualitative analysis…
Metabolic pathway alteration, regulation and control (5) -- Simulation of metabolic network Xi Wang 02/07/2013 Spring 2013 BsysE 595 Biosystems Engineering.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Combined Experimental and Computational Modeling Studies at the Example of ErbB Family Birgit Schoeberl.
23.6 Enzymes Three principal features of enzyme-catalyzed reactions: 1. For a given initial concentration of substrate, [S] 0, the initial rate of product.
Enzymes II: Enzyme Kinetics
SURVEY OF BIOCHEMISTRY Enzyme Kinetics and Inhibition
Combinatorially Complex Equilibrium Model Selection Tom Radivoyevitch Assistant Professor Epidemiology and Biostatistics Case Western Reserve University.
P53 Missense Mutation Cancer. Outline Disease related to p53 Role and regulation pathway Structure of p53 Missense mutation and consequences Experiment’s.
Biologically-Based Estimates of Radiation- Induced CML Risk Tom Radivoyevitch Assistant Professor Epidemiology and Biostatistics Case Western Reserve University.
Rules for deriving rate laws for simple systems 1.Write reactions involved in forming P from S 2. Write the conservation equation expressing the distribution.
LECTURE 4: Principles of Enzyme Catalysis Reading: Berg, Tymoczko & Stryer: Chapter 8 ENZYME An ENZYME is a biomolecular catalyst that accelerates the.
Marc D. Riedel Associate Professor, ECE University of Minnesota EE 5393: Circuits, Computation and Biology ORAND.
Paul D. Adams University of Arkansas Mary K. Campbell Shawn O. Farrell Chapter Six The Behavior of Proteins:
Prof. R. Shanthini 23 Sept 2011 Enzyme kinetics and associated reactor design: Determination of the kinetic parameters of enzyme-induced reactions CP504.
The Practical Side of Nucleotide Metabolism November 29, 2001.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
INTRODUCTION TO Machine Learning 3rd Edition
CHAPTER 17 O PTIMAL D ESIGN FOR E XPERIMENTAL I NPUTS Organization of chapter in ISSO –Background Motivation Finite sample and asymptotic (continuous)
Cell metabolism. Metabolism encompasses the integrated and controlled pathways of enzyme catalysed reactions within a cell Metabolism The word “metabolism”
Nucleotide metabolism
Modeling Combinatorially Complex Ribonucleotide Reductase Tom Radivoyevitch Assistant Professor Epidemiology and Biostatistics Case Western Reserve University.
Enzyme Kinetics.
Investigation of the enzymatic processes depending on the type of reaction.
Biochemistry 412 Enzyme Kinetics II April 1st, 2005.
BENG/CHEM/Pharm/MATH 276 HHMI Interfaces Lab 2: Numerical Analysis for Multi-Scale Biology Modeling Cell Biochemical and Biophysical Networks Britton Boras.
5’-IMP 5’-XMP 5’-GMP GDP GTP dGDP dGTP 5’-AMP ADP ATP dADP dATP.
Metabolic pathways. What do we mean by metabolism? Metabolism is the collective term for the thousands of biochemical _________ that occur within a living.
Lesson 5 Enzymes. Catalyst: something that increases the rate of reactions Enzymes are biological catalysts Often ends with –ase Most enzymes are proteins.
Pyruvate Dehydrogenase Kinase 4: A potential drug target in non insulin dependent diabetes mellitus Thandeka Khoza.
1 Department of Engineering, 2 Department of Mathematics,
Computational Diagnostics
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
M.Todd Washington, Louise Prakash, Satya Prakash  Cell 
Volume 112, Issue 3, Pages (February 2003)
Model selection and fitting
Presentation transcript:

Combinatorially Complex Equilibrium Model Selection Tom Radivoyevitch Assistant Professor Epidemiology and Biostatistics Case Western Reserve University Website:

Combinatorially Complex Equilibrium Model Selection (overview/title breakdown) Combinatorially Complex Equilibria: Combinatorially Complex Equilibria: few reactants => many possible complexes Generate the possible models + Select or average the best Generate the possible models + Select or average the best Selection/Averaging: Akaike Information Criterion (AIC) Selection/Averaging: Akaike Information Criterion (AIC) AIC decreases with P and then increases AIC decreases with P and then increases Billions of models, but only thousands near AIC upturn Billions of models, but only thousands near AIC upturn Challenge: Generate 1P, 2P, 3P model space chunks sequentially Challenge: Generate 1P, 2P, 3P model space chunks sequentially 8GB RAM => max of ~5000 models at a time 8GB RAM => max of ~5000 models at a time Models = Hypotheses Models = Hypotheses Spurs: complete K = ∞  [Complex]=0 Spurs: complete K = ∞  [Complex]=0 Grids: binary K equal each other Grids: binary K equal each other R package ccems (CRAN) implements the methods R package ccems (CRAN) implements the methods

Center Relevance Structures constrain minimal simplicity and maximal complexity of possible complexes and models in the model space Structures constrain minimal simplicity and maximal complexity of possible complexes and models in the model space Move prior knowledge of structures into enzyme models Move prior knowledge of structures into enzyme models System Biology models integrate enzyme models System Biology models integrate enzyme models Protein-Protein Protein-Ligand Interaction readouts: Protein-Protein Protein-Ligand Interaction readouts: Enzyme activities and DLS mass (i.e. average measures) Enzyme activities and DLS mass (i.e. average measures) Mass distributions of complexes (e.g. SEC and GEMMA) Mass distributions of complexes (e.g. SEC and GEMMA) Validate omic-predicted interactions and model them Validate omic-predicted interactions and model them PEPCC (Harry Gill): Systematically express, purify and characterize interaction suspects (e.g. of core wnt signaling proteins) PEPCC (Harry Gill): Systematically express, purify and characterize interaction suspects (e.g. of core wnt signaling proteins)

Ultimate Goal Present Future Safer flying airplanes with autopilots Safer flying airplanes with autopilots Ultimate Goal: individualized, state feedback based clinical trials Ultimate Goal: individualized, state feedback based clinical trials Better understanding => better control Better understanding => better control Conceptual models help trial designs today Conceptual models help trial designs today Computer models of airplanes help train pilots and autopilots Computer models of airplanes help train pilots and autopilots Radivoyevitch et al. (2006) BMC Cancer 6:104

dNTP Supply System Figure 1. dNTP supply. Many anticancer agents act on or through this system to kill cells. The most central enzyme of this system is RNR. UDP CDP GDP ADP dTTP dCTP dGTP dATP dT dC dG dA DNA dUMP dU TS DCTD dCK DNA polymerase TK1 cytosol mitochondria dT dC dG dA TK2 dGK dTMP dCMP dGMP dAMP dTTP dCTP dGTP dATP 5NT NT2 cytosol nucleus dUDP dUTP dUTPase dN dCK flux activation inhibition ATP or dATP RNR dCK

R1 R2 R1 R2 UDP, CDP, GDP, ADP bind to catalytic site ATP, dATP, dTTP, dGTP bind to selectivity site dATP inhibits at activity site, ATP activates at activity site? 5 catalytic site states x 5 s-site states x 3 a-site states x 2 h-site states = 150 states  (150) 6 different hexamer complexes => 2^(150) 6 models ~ 1 followed by a trillion zeros ATP activates at hexamerization site?? Ribonucleotide Reductase (RNR) R2 RNR is Combinatorially Complex

Michaelis-Menten Model RNR: no NDP and no R2 dimer => k cat of complex is zero, else different R1-R2-NDP complexes can have different k cat values. E + S ES

solid line = Eqs. (1-2) dotted = Eq. (3) Data from Scott, C. P., Kashlan, O. B., Lear, J. D., and Cooperman, B. S. (2001) Biochemistry 40(6), R=R1 r=R2 2 G=GDP t=dTTP Substitute this in here to get a quadratic in [S] which solves as Bigger systems of higher polynomials cannot be solved algebraically => use ODEs (above) Michaelis-Menten Model [S] vs. [S T ] [S] vs. [S T ] (3)

E ES EI ESI E ES EI ESI E ES EI E ES EI ESI E EI ESI E ES ESI E EI ESI E ES ESI = = E EI E ESI E ES E Competitive inhibition uncompetitive inhibition if k cat_ESI =0 E | ES EI | ESI noncompetitive inhibition Example of K=K’ Model = = Enzyme, Substrate and Inhibitor

Total number of spur graph models is 16+4=20 Radivoyevitch, (2008) BMC Systems Biology 2:15 Rt Spur Graph Models R RR RRtt RRt Rt R RRtt RRt Rt R RR RRtt Rt R RR RRt Rt R RR RRtt RRt R RRtt Rt R RRt Rt R RR Rt IJJJ JJIJ JJJIJIJJ IJIJIJJIJJII JJJJ R RRtt RRt R RR RRtt R RR RRt R Rt R RRtt R RRt JIIJ IIJJ JIJI R RR R IJII IIIJIIJI JIII IIII R Rt R RRtt R RRt R RR I0II III0 II0I0III R = R1 t = dTTP for dTTP induced R1 dimerization (RR, Rt, RRt, RRtt)

R t RR t Rt R t RRt t Rt RRtt K d_R_R K d_Rt_R K d_Rt_Rt K d_R_t K d_RRt_t K d_RR_t = = = = | | | Rt Grid Graph Models R t RR t Rt R t RRt t RRtt K R_R K R_t K RRt_t K RR_t = = = = = = = = = = = = = = = = = = = R t RR t Rt R t RRt t Rt RRtt K R_R K Rt_R K Rt_Rt K R_t | | | | | | | | | | | | | | | ? ? HIFF HDFFHDDD =

AIC c = N*log(SSE/N)+2P+2P(P+1)/(N-P-1) Scott, C. P., Kashlan, O. B., Lear, J. D., and Cooperman, B. S. (2001) Biochemistry 40(6), Radivoyevitch, (2008) BMC Systems Biology 2:15 Application to Data HDFF = = R RRtt IIIJ III0m ModelParameterInitial Value Optimal Value Confidence Interval 1 III0mm (79.838, ) SSE AIC IIIJR2t21.000^32.725^3(2.014^3, 3.682^3) SSE AIC HDFFR2t (0, ) R1t0_t (0.003, ) R2t0_t (0.000, ) SSE AIC

= 28 parameters => 2 28 =2.5x10 8 spur graph models via K j =∞ hypotheses 28 models with 1 parameter, 428 models with 2, 3278 models with 3, with 4 R = R1 X = ATP Yeast R1 structure. Dealwis Lab, PNAS 102, , 2006 ATP-induced R1 Hexamerization Kashlan et al. Biochemistry :462

Kolmogorov-Smirnov Test p < of top 30 did not include an h-site term; 28/30 ≠ 503/2081 with p < This suggests no h-site. Top 13 included R6X8 or R6X9, save one, single edge model R6X7 This suggests that less than 3 a-sites are occupied in hexamer Models with SSE < 2 min (SSE) Data from Kashlan et al. Biochemistry :462

Conclusions (so far) 1.The dataset does not support the existence of an h-site 2.The dataset suggests that ~1/2 of the a-site are not occupied by ATP R1 a a [ATP]=~1000[dATP] So system prefers to have 3 a-sites empty and ready for dATP Inhibition versus activation is partly due to differences in pockets a a a a UDP CDP GDP ADP dTTP dCTP dGTP dATP dT dC dG dA DNA dUMP dU TS DCTD dCK DNA polymerase TK1 cytosol mitochondria dT dC dG dA TK2 dGK dTMP dCMP dGMP dAMP dTTP dCTP dGTP dATP 5NT NT2 cytosol nucleus dUDP dUTP dUTPase dN dCK flux activation inhibition ATP or dATP RNR dCK

X 8 K Models x 8 k Models => 64 Models Tetrameric Enzyme Models (e.g. Thymidine Kinase 1) If [S] ≈ [S T ]

Complete K vs. Binary K

K = (0.85, 0.69, 0.65, 0.51) µM k = (3.3, 3.9, 4.1, 4.1) sec -1 k max = 4.1/sec S 50 = 0.6 µM h = 1.1 DFFF.DDDD DDLL.DDDD DDDM.DDDD DDDD.DFFF DDDD.DDLL DDDD.DDDM

8-parameter model fits to the datasets

K = (10 -6, 10 -3, 1, 10 3 ) A) k = (100, 10, 40, 10) B) k = (10, 10, 40, 100) With parameters initially at ½ their true values, A identified itself, B had difficulties (its 1 st two K estimates were off 2-5-fold). Horizontal lines are k 1 /4, 2k 2 /4, 3k 3 /4 and 4k 4 /4 Vertical lines are K 1 /4, 2K 2 /3, 3K 3 /2 and 4K 4

Combinatorially Complex Equilibrium Model Selection (ccems, CRAN 2009) Systems Biology Markup Language interface to R (SBMLR, BIOC 2004) Model networks of enzymes Model individual enzymes SUMMARY R1 R2 R1 R2

Figure 8. T. Thorsen et al. (S. R. Quake Lab) Science 2002 Figure 9. J. Melin and S. R. Quake Annu. Rev. Biophys. Biomol. Struct :213–31 Background: Quake Lab Microfluidics Figure 9 shows how a peristaltic pump is implemented by three valves that cycle through the control codes 101, 100, 110, 010, 011, 001, where 0 and 1 represent open and closed valves; note that the 0 in this sequence is forced to the right as the sequence progresses.

Adaptive Experimental Designs Find best next 10 measurement conditions given models of data collected. Need automated analyses in feedback loop of automatic controls of microfluidic chips

setpoint KpKp KiKi ∫ Σ hot plate water temperature Simple example of a control system for a single-input single-output (SISO) system

Emphasis is on the stochastic component of the model. Is there something in the black box or are the input wires disconnected from the output wires such that only thermal noise is being measured? Do we have enough data? Model components: (Deterministic = signal) + (Stochastic = noise) Statistics Engineering Emphasis is on the deterministic component of the model We already know what is in the box, since we built it. The goal is to understand it well enough to be able to control it. Predict the best multi-agent drug dose time course schedules Increasing amounts of data/knowledge Why Systems Biology

dNTPs + Analogs DNA + Drug-DNA Damage Driven or S-phase Driven dNTP demand is either DNA repair Salvage De novo MMR - Cancer Treatment Strategy IUdR

Conclusions For systems biology to succeed: For systems biology to succeed: –move biological research toward systems which are best understood –specialize modelers to become experts in biological literatures (SB is not a service)

Acknowledgements Case Comprehensive Cancer Center Case Comprehensive Cancer Center NIH (K25 CA104791) NIH (K25 CA104791) Chris Dealwis (CWRU Pharmacology) Chris Dealwis (CWRU Pharmacology) Anders Hofer (Umea) Anders Hofer (Umea) Thank you Thank you

Indirect Approach pro-B Cell Childhood ALL T: TEL-AML1 with HR T: TEL-AML1 with HR t : TEL-AML1 with CCR t : TEL-AML1 with CCR t : other outcome t : other outcome B: BCR-ABL with CCR B: BCR-ABL with CCR b: BCR-ABL with HR b: BCR-ABL with HR b: censored, missing, or other outcome b: censored, missing, or other outcome Ross et al: Blood 2003, 102: Yeoh et al: Cancer Cell 2002, 1: Radivoyevitch et al., BMC Cancer 6, 104 (2006)

THF CH 2 THF CH 3 THF CHOTHF DHF CHODHF HCHO GAR FGAR AICAR FAICAR dUMP dTMP NADP + NADPH NADP + NADPH NADP + NADPH MetHcys Ser Gly GART ATIC TS ATP ADP 11R 2R HCOOH MTHFD MTHFR MTR DHFR SHMT FTS FDS Morrison PF, Allegra CJ: Folate cycle kinetics in human breast cancer cells. JBiolChem 1989, 264:

library(ccems) # Thymidine Kinase Example topology = list( heads=c("E1S0"), # E1S0 = substrate free E sites=list( c=list( # c for catalytic site t=c("E1S1","E1S2","E1S3","E1S4") ) # t for tetramer ) ) # TK1 is 25kDa = 25mg/umole, so 1 mg =.04 umoles g = mkg(topology, activity=T,TCC=F) dd=subset(TK1,(year==2000),select=c(E,S,v)) dd=transform(dd, ET=ET/4,v=ET*v/(.04*60))# now uM/sec tops=ems(dd,g,maxTotalPs=8,kIC=10,topN=96)#9 min on 1 cpu library(ccems) # Ribonucleotide Reductase Example topology <- list( heads=c("R1X0","R2X2","R4X4","R6X6"), sites=list( # s-sites are already filled only in (j>1)-mers a=list( #a-site thread m=c("R1X1"), # monomer 1 d=c("R2X3","R2X4"), # dimer 2 t=c("R4X5","R4X6","R4X7","R4X8"), # tetramer 3 h=c("R6X7","R6X8","R6X9","R6X10", "R6X11", "R6X12") # hexamer 4 ), # tails of a-site threads are heads of h-site threads h=list( # h-site m=c("R1X2"), # monomer 5 d=c("R2X5", "R2X6"), # dimer 6 t=c("R4X9", "R4X10","R4X11", "R4X12"), # tetramer 7 h=c("R6X13", "R6X14", "R6X15","R6X16", "R6X17", "R6X18")# hexamer 8 ) g=mkg(topology,TCC=TRUE) dd=subset(RNR,(year==2002)&(fg==1)&(X>0),select=c(R,X,m,year)) cpusPerHost=c("localhost" = 4,"compute-0-0"=4,"compute-0-1"=4,"compute-0-2"=4) top10=ems(dd,g,cpusPerHost=cpusPerHost, maxTotalPs=3, ptype="SOCK",KIC=100) K = e ∆G/RT => K = (0, ∞) maps to ∆G = (-∞, ∞) Model weights e ∆AIC /∑e ∆AIC

Comments on Methods

Conjecture Greater X/R ratio dominates at high Ligand concentrations In this limit the system wants to partition As much ATP into a bound form as possible Bias in 1 st four residuals => Do not weight errors