Presentation on theme: "Introduction to Steady State Metabolic Modeling Concepts Flux Balance Analysis Applications Predicting knockout phenotypes Quantitative Flux Prediction."— Presentation transcript:
Introduction to Steady State Metabolic Modeling Concepts Flux Balance Analysis Applications Predicting knockout phenotypes Quantitative Flux Prediction Lab Practical Flux Balance Analysis of E. coli central metabolism Predicting knockout phenotypes Modeling M. tuberculosis Mycolic Acid Biosynthesis
Why Model Metabolism? Predict the effects of drugs on metabolism –e.g. what genes should be disrupted to prevent mycolic acid synthesis Many infectious disease processes involve microbial metabolic changes –e.g. switch from sugar to fatty acid metabolism in TB in macrophages
Genome Wide View of Metabolism Streptococcus pneumoniae Explore capabilities of global network How do we go from a pretty picture to a model we can manipulate?
uptake Steady State Assumptions Dynamics are transient At appropriate time- scales and conditions, metabolism is in steady state A B uptakeconversionsecretion Two key implications 1.Fluxes are roughly constant 2.Internal metabolite concentrations are constant t Conversion t Steady state
Metabolic Flux Output fluxes Input fluxes Volume of pool of water = metabolite concentration Slide Credit: Jeremy Zucker
Reaction Stoichiometries Are Universal The conversion of glucose to glucose 6-phosphate always follows this stoichiometry : 1ATP + 1glucose = 1ADP + 1glucose 6-phosphate This is chemistry not biology. Biology => the enzymes catalyzing the reaction Enzymes influence rates and kinetics Activation energy Substrate affinity Rate constants Not required for steady state modeling!
Metabolic Flux Analysis Use universal reaction stoichiometries to predict network metabolic capabilities at steady state
Stoichiometry As Vectors We can denote the stoichiometry of a reaction by a vector of coefficients One coefficient per metabolite –Positive if metabolite is produced –Negative if metabolite is consumed Example: Metabolites: [ A B C D ] T Reactions: 2A + B -> C C -> D Stoichiometry Vectors: [ -2 -1 1 0 ] T [ 0 0 -1 1 ] T
A (Very) Simple System We have introduced two new things Reversible reactions – are represented by two reactions that proceed in each direction (e.g. v4, v5) Exchange reactions – allow for fluxes from/into an infinite pool outside the system (e.g. vin and vout). These are frequently the only fluxes experimentally measured. A B D C vin v1 v3 v2 v5 vout v4 ABCDABCD v1 v2 v3 v4 v5 vin vout 0 1 0 0 10001000 1 0 1 0 1 0 1 Exchange Reactions
Calculating changes in concentration ABCDABCD v1 v2 v3 v4 v5 vin vout 0 1 0 0 10001000 1 0 1 0 1 0 1 A B D C vin v1 v3 v2 v5 vout v4 What happens if v in is 1 “unit” per second 1 0 0 0 0 0 0 0 v1 0 v2 0 v3 0 v4 0 v5 1 vin 0 vout = 10001000 dA/dt dB/dt dC/dt dD/dt A grows by 1 “unit” per “second” We can calculate this with S Given these fluxes These are the changes in metabolite concentration
The Stoichiometric Matrix V is a vector of fluxes through each reaction Then S*V is a vector describing the change in concentration of each metabolite per unit time R1 R2 R3 R4 R5 R6 R7 R8 R9 R10 0 -2 0 1 0 1 0 0 0 1 0 ABCDEFGHIABCDEFGHI =
Some advantages of S Chemistry not Biology: the stoichiometry of a given reaction is preserved across organisms, while the reaction rates may not be preserved Does NOT depend on kinetics or reaction rates Depends on gene annotations and mapping from gene to reactions Depends on information we frequently already have
Genes to Reactions Expasy enzyme database Indexed by EC number EC numbers can be assigned to genes by –Blast to known genes –PFAM domains
Online Metabolic Databases Pathlogic/BioCyc Kegg There are several online databases with curated and/or automated EC number assignments for sequenced genomes
From Genomes to the S Matrix ABCDEFGHIABCDEFGHI R1 R2 R3 R4 R5 R6 R7 R8 R9 R10 0 -2 0 1 0 1 0 0 0 1 0 Columns encode reactions Relationships btw genes and rxns -1 gene 1 rxn -1 gene 1+ rxns -1+ genes 1 rxn The same reaction can be included as multiple roles (paralogs) Gene AGene B Gene C Examples Gene DGene E Gene E’ Enzyme AEnzyme B/CEnyzme DEnzyme E Enzyme E’ Same rxn
What Can We Use S For? From S we can investigate the metabolic capabilities of the system. We can determine what combination of fluxes (flux configurations) are possible at steady state
Flux Configuration, V Imagine we have another simple system: v1 v2 v3 V Flux Configuration AB v1 CD v3 v2 flux v1 flux v2 flux v3 1 2 3 4 1 123 Rate of change of C to D per unit time 113113 V Flux Configuration 113 We want to know what region of this space contains feasible fluxes given our constraints
The Steady State Constraint We have But also recall that at steady state, metabolite concentrations are constant: dx/dt=0
Positive Flux Constraint *recall that reversible reactions are represented by two unidirectional fluxes * All steady state flux vectors, V, must satisfy these constraints What region do these V live in? The solution through convex analysis flux v1 flux v2 flux v3 ?
The Flux Cone Every steady state flux vector is inside this cone Edges of the cone are circumscribed by Extreme Pathways v p1 p1 p2 p2 p3 p3 p4 p4 flux v1 flux v2 flux v3 Solution is a convex flux cone At steady state, the organism “lives in” here
Extreme Pathways Extreme pathways are “fundamental modes” of the metabolic system at steady state They are steady state flux vectors All other steady state flux vectors are non-negative linear combinations v p1 p1 p2 p2 p3 p3 p4 p4 flux v1 flux v2 flux v3
Example Extreme Patways A B D C vin v1 v3 v2 v4 vout ABCDABCD v1 v2 v3 v4 vin vout 0 1 0 0 10001000 1 0 1 0 1 b1 110011110011 001111001111 b2 v1 v2 v3 v4 vin vout All steady state fluxes configurations are combinations of these extreme pathways A B D C vin v1 v3 v2 v4 vout b1 b2 1 1 1 1 1 1 1 1
Capping the Solution Space Cone is open ended, but no reaction can have infinite flux Often one can estimate constraints on transfer fluxes –Max glucose uptake measured at maximum growth rate –Max oxygen uptake based on diffusivity equation Flux constraints result in constraints on extreme pathways p1 p1 p2 p2 p3 p3 p4 p4 flux v1 flux v2 flux v3 p1 p1 p2 p2 p3 p3 p4 p4 max flux v3 flux v1 flux v2 flux v3
The Constrained Flux Cone p1 p1 p2 p2 p3 p3 p4 p4 flux v1 flux v2 flux v3 Contains all achievable flux distributions given the constraints: –Stoichiometry –Reversibility –Max and Min Fluxes Only requires: –Annotation –Stoichiometry –Small number of flux constraints (small relative to number of reactions)
Selecting One Flux Distribution p1 p1 p2 p2 p3p3 p4 p4 flux v1 flux v2 flux v3 At any one point in time, organisms have a single flux configuration How do we select one flux configuration? We will assume organisms are trying to maximize a “fitness” function that is a function of fluxes, F(v)?
Optimizing A Fitness Function p1 p1 p2 p2 p3p3 p4 p4 NADH->NADPH (v1) AMP->ADP (v2) ADP->ATP (v3) Imagine we are trying to optimize ATP production Then a reasonable choice for the fitness function is F(V)=v3 Goal: find a flux in the cone that maximizes v3 7 5 If we choose F(v) to be a linear function of V: The optimizing flux will always lie on vertex or edge of the cone Linear Programming
Flux Balance Analysis p1 p1 p2 p2 p3p3 p4 p4 flux v1 flux v2 flux v3 Start with stoichiometric matrix and constraints: Choose a linear function of fluxes to optimize: Use linear programming we can find a feasible steady state flux configuration that maximizes the F (all i) (some j)
Optimizing E. coli Growth Z = 41.257v ATP - 3.547v NADH + 18.225v NADPH + 0.205v G6P + 0.0709v F6P +0.8977v R5P + 0.361v E4P + 0.129v T3P + 1.496v 3PG + 0.5191v PEP +2.8328v PYR + 3.7478v AcCoA + 1.7867v OAA + 1.0789v AKG Metabolite(mmol) ATP41.257 NADH-3.547 NADPH18.225 G6P0.205 F6P0.0709 R5P0.8977 E4P0.361 T3P0.129 3PG1.496 PEP0.5191 PYR2.8328 AcCoA3.7478 OAA1.7867 AKG1.0789 For one gram of E. coli biomass, you need this ratio of metabolites Assuming a matched balanced set of metabolite fluxes, you can formulate this objective function
FBA Overview Stoichiometric Matrix Gene annotation Enzyme and reaction catalog Feasible Space S*v=0 Add constraints: v i >0 i >v i > i Optimal Flux Growth objective Z=c*v Solve with linear programming Flux solution
in silico Deletion Analysis Can we predict gene knockout phenotype based on their simulated effects on metabolism? Q: Why, given other computational methods exist? (e.g. protein/protein interaction map connectivity) A: Other methods do not directly consider metabolic flux or specific metabolic conditions
in silico Deletion Analysis A B D C vin v1 v3 v2 v5 vout v4 ABCDABCD v1 v2 v3 v4 v5 vin vout 0 1 0 0 10001000 1 0 1 0 1 0 1 “wild-type” “mutant” A B D C vin v1 v3 v2 v5 vout v4 ABCDABCD v1 v2 v3 v4 v5 vin vout 0 1 0 0 10001000 1 0 1 0 1 0 1 Gene knockouts modeled by removing a reaction
Mutations Restrict Feasible Space Removing genes removes certain extreme pathways Feasible space is constrained If original optimal flux is outside new space, new optimal flux is predicted Calculate new optimal flux No change in optimal flux Difference of F(V) (i.e growth) between optima is a measure of the KO phenotype
Mutant Phenotypes in E. coli PNAS| May 9, 2000 | vol. 97 | no. 10 Model of E. coli central metabolism 436 metabolites 720 reactions Simulated mutants in glycolysis, pentose phosphate, TCA, electron transport Edward & Palsson (2000) PNAS
E. coli KO simulation results If Zmutant/Z =0, mutant is no growth (-) growth (+) otherwise Compare to experiment (in vivo / prediction) 86% agree Used FBA to predict optimal growth of mutants (Z mutant ) versus non-mutant (Z) Simulated growth on glucose, galactose, succinate, and acetate “lethal” “reduced growth” Edward & Palsson (2000) PNAS in vivo in silico +- +362 -932 Condition specific prediction
What do the errors tell us? Errors indicate gaps in model or knowledge Authors discuss 7 errors in prediction –fba mutants inhibit stable RNA synthesis (not modeled by FBA) –tpi mutants produce toxic intermediate (not modeled by FBA) –5 cases due to possible regulatory mechanisms (aceEF, eno, pfk, ppc) Edward & Palsson (2000) PNAS
Quantitative Flux Prediction Measure some fluxes, say A & B –Controlled uptake rates Predict optimal flux of A given B as input to model Can models quantitatively predict fluxes and/or growth rate?