Dynameomics: Protein Mechanics, Folding and Unfolding through Large Scale All-Atom Molecular Dynamics Simulations INCITE 6 David A. C. Beck Valerie Daggett.

Slides:



Advertisements
Similar presentations
10/21/20091 Protein Explorer: A Petaflops Special-Purpose Computer System for Molecular Dynamics Simulations Makoto Taiji, Tetsu Narumi, Yousuke Ohno,
Advertisements

Computational methods in molecular biophysics (examples of solving real biological problems) EXAMPLE I: THE PROTEIN FOLDING PROBLEM Alexey Onufriev, Virginia.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
The Protein Folding Problem David van der Spoel Dept. of Cell & Mol. Biology Uppsala, Sweden
The Calculation of Enthalpy and Entropy Differences??? (Housekeeping Details for the Calculation of Free Energy Differences) first edition: p
Molecular Simulation. Molecular Simluation Introduction: Introduction: Prerequisition: Prerequisition: A powerful computer, fast graphics card, A powerful.
Evaluation of Fast Electrostatics Algorithms Alice N. Ko and Jesús A. Izaguirre with Thierry Matthey Department of Computer Science and Engineering University.
StreamMD Molecular Dynamics Eric Darve. MD of water molecules Cutoff is used to truncate electrostatic potential Gridding technique: water molecules are.
Molecular Dynamics Molecular dynamics Some random notes on molecular dynamics simulations Seminar based on work by Bert de Groot and many anonymous Googelable.
Protein Primer. Outline n Protein representations n Structure of Proteins Structure of Proteins –Primary: amino acid sequence –Secondary:  -helices &
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
Molecular Mechanics, Molecular Dynamics, and Docking Michael Strong, PhD National Jewish Health 11/23/2010.
Stochastic Roadmap Simulation: An Efficient Representation and Algorithm for Analyzing Molecular Motion Mehmet Serkan Apaydin, Douglas L. Brutlag, Carlos.
Coarse grained simulations of p7 folding
The Geometry of Biomolecular Solvation 1. Hydrophobicity Patrice Koehl Computer Science and Genome Center
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Molecular Mechanics, Molecular Dynamics, and Docking
Ana Damjanovic (JHU, NIH) JHU: Petar Maksimovic Bertrand Garcia-Moreno NIH: Tim Miller Bernard Brooks OSG: Torre Wenaus and team.
‘Tis not folly to dream: Using Molecular Dynamics to Solve Problems in Chemistry Christopher Adam Hixson and Ralph A. Wheeler Dept. of Chemistry and Biochemistry,
Algorithms and Software for Large-Scale Simulation of Reactive Systems _______________________________ Ananth Grama Coordinated Systems Lab Purdue University.
What are proteins? Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed.
Molecular Dynamics Valerie Daggett Bioengineering Department University of Washington...everything that living things do can be understood in terms of.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Bioinformatics: Practical Application of Simulation and Data Mining Protein Folding I Prof. Corey O’Hern Department of Mechanical Engineering & Materials.
-A cell is an organization of millions of molecules -Proper communication between these molecules is essential to the normal functioning of the cell -To.
CZ5225 Methods in Computational Biology Lecture 4-5: Protein Structure and Structural Modeling Prof. Chen Yu Zong Tel:
BlueGene/L Facts Platform Characteristics 512-node prototype 64 rack BlueGene/L Machine Peak Performance 1.0 / 2.0 TFlops/s 180 / 360 TFlops/s Total Memory.
E-science grid facility for Europe and Latin America E2GRIS1 André A. S. T. Ribeiro – UFRJ (Brazil) Itacuruça (Brazil), 2-15 November 2008.
Department of Mechanical Engineering
Chem. 860 Molecular Simulations with Biophysical Applications Qiang Cui Department of Chemistry and Theoretical Chemistry Institute University of Wisconsin,
Computational Chemistry Molecular Mechanics/Dynamics F = Ma Quantum Chemistry Schr Ö dinger Equation H  = E 
Folding of Distinct Sequences into Similar Functional Topologies Hossein Mohammadiarani and Harish Vashisth (Advisor) Department of Chemical Engineering,
NIH Resource for Biomolecular Modeling and Bioinformatics Beckman Institute, UIUC NAMD Development Goals L.V. (Sanjay) Kale Professor.
Conformational Entropy Entropy is an essential component in ΔG and must be considered in order to model many chemical processes, including protein folding,
Bioinformatics: Practical Application of Simulation and Data Mining Markov Modeling II Prof. Corey O’Hern Department of Mechanical Engineering Department.
10/3/2003 Molecular and Cellular Modeling 10/3/2003 Introduction Objective: to construct a comprehensive simulation software system for the computational.
Molecular Dynamics simulations
Protein Folding and Modeling Carol K. Hall Chemical and Biomolecular Engineering North Carolina State University.
Molecular simulation methods Ab-initio methods (Few approximations but slow) DFT CPMD Electron and nuclei treated explicitly. Classical atomistic methods.
Altman et al. JACS 2008, Presented By Swati Jain.
Molecular Simulation of Reactive Systems. _______________________________ Sagar Pandit, Hasan Aktulga, Ananth Grama Coordinated Systems Lab Purdue University.
SMA5233 Particle Methods and Molecular Dynamics Lecture 5: Applications in Biomolecular Simulation and Drug Design A/P Chen Yu Zong Tel:
Algorithms and Software for Large-Scale Simulation of Reactive Systems _______________________________ Metin Aktulga, Sagar Pandit, Alejandro Strachan,
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Center for Biological Physics* Arjan van der Vaart – studying the conformational changes of proteins as they interact with other molecules Banu Ozkan –
Review Session BS123A/MB223 UC-Irvine Ray Luo, MBB, BS.
Molecular Dynamics Arjan van der Vaart PSF346 Center for Biological Physics Department of Chemistry and Biochemistry Arizona State.
FlexWeb Nassim Sohaee. FlexWeb 2 Proteins The ability of proteins to change their conformation is important to their function as biological machines.
A Pattern Language for Parallel Programming Beverly Sanders University of Florida.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Molecular dynamics (MD) simulations  A deterministic method based on the solution of Newton’s equation of motion F i = m i a i for the ith particle; the.
Protein Structures from A Statistical Perspective Jinfeng Zhang Department of Statistics Florida State University.
Parallel Molecular Dynamics A case study : Programming for performance Laxmikant Kale
A Computational Study of RNA Structure and Dynamics Rhiannon Jacobs and Dr. Harish Vashisth Department of Chemical Engineering, University of New Hampshire,
A Computational Study of RNA Structure and Dynamics Rhiannon Jacobs and Harish Vashisth Department of Chemical Engineering, University of New Hampshire,
Flexibility and Interoperability in a Parallel MD code Robert Brunner, Laxmikant Kale, Jim Phillips University of Illinois at Urbana-Champaign.
Computational Structure Prediction
March 21, 2008 Christopher Bruns
Protein Structure Prediction and Protein Homology modeling
Introduction to Molecular Simulation
Protein dynamics Folding/unfolding dynamics
Algorithms and Software for Large-Scale Simulation of Reactive Systems
Enzyme Kinetics & Protein Folding 9/7/2004
Molecular simulation methods
Algorithms and Software for Large-Scale Simulation of Reactive Systems
Experimental Overview
Presentation transcript:

Dynameomics: Protein Mechanics, Folding and Unfolding through Large Scale All-Atom Molecular Dynamics Simulations INCITE 6 David A. C. Beck Valerie Daggett Research Group Department of Medicinal Chemistry University of Washington, Seattle November 15 th, 2005

Proteins Proteins are life’s machines, tools and structures –Many jobs, many shapes, many sizes

Proteins Proteins are life’s machines, tools and structures –Nature reuses designs for similar jobs 1hdd 1enh 1f43 1ftt 1bw5 1du6 1cqt

Proteins Proteins are hetero-polymers of specific sequence –There are 20 common polymeric units (amino acids) Composed of a variety of basic chemical moieties –Chain lengths range from 40 amino acids on up M K L V D Y A G E

Proteins Proteins are hetero-polymers that adopt a unique fold M K L V D Y A G E

Proteins Protein folding as a reaction Products Reactants Transition state Free Energy Bad Good

Proteins Protein folding … Native Denatured / Partially Unfolded Transition state Free Energy Bad Good

Proteins Folded proteins Native Denatured / Partially Unfolded Transition state Free Energy Folded, active, functional, biologically relevant state (ensemble of conformers) Bad Good

Proteins Folded proteins Native Denatured / Partially Unfolded Transition state Free Energy Static, 3D coordinates of some proteins’ atoms are available from x-ray crystallography & NMR Bad Good

Proteins Folded proteins Native Denatured / Partially Unfolded Transition state Free Energy Static, 3D coordinates of some proteins’ atoms are available from PDB Bad Good

Proteins Folded proteins are complex and dynamic molecules Native Denatured / Partially Unfolded Transition state Free Energy Bad Good

Proteins Folded proteins are complex and dynamic molecules Native Denatured / Partially Unfolded Transition state Free Energy Bad Good

Molecular Dynamics MD provides atomic resolution of native dynamics PDB ID: 3chy, E. coli CheY 1.66 Å X-ray crystallography

Molecular Dynamics MD provides atomic resolution of native dynamics PDB ID: 3chy, E. coli CheY 1.66 Å X-ray crystallography

Molecular Dynamics MD provides atomic resolution of native dynamics 3chy, hydrogens added

Molecular Dynamics MD provides atomic resolution of native dynamics 3chy, waters added (i.e. solvated)

Molecular Dynamics MD provides atomic resolution of native dynamics 3chy, waters and hydrogens hidden

Molecular Dynamics MD provides atomic resolution of native dynamics native state simulation of 3chy at 298 Kelvin, waters and hydrogens hidden

Proteins Folding & unfolding at atomic resolution Native Denatured / Partially Unfolded Transition state Free Energy Disordered, non-functional, heterogeneous ensemble of conformers Bad Good

Proteins Protein folding, why we care how it happens Native Denatured / Partially Unfolded Transition state Free Energy mutation Many diseases are related to protein folding and / or misfolding in response to genetic mutation.

Proteins Protein folding, why we care how it happens Native Denatured / Partially Unfolded Transition state Free Energy mutation We need to comprehend folding to build nano-scale biomachines (that could produce energy, etc…)

Proteins Protein folding takes > 10 μs (often much longer) Native Denatured / Partially Unfolded Transition state Free Energy Bad Good

Proteins Protein folding is the reverse of protein unfolding Native Denatured / Partially Unfolded Transition state Free Energy Bad Good

Proteins Protein unfolding is relatively invariant to temperature Native Denatured / Partially Unfolded Transition state Free Energy Temperature Bad Good

Molecular Dynamics MD provides atomic resolution of folding / unfolding unfolding simulation (reversed) of 3chy at 498 Kelvin, waters & hydrogens hidden

Molecular Dynamics 1 Classically evolves an atomic system with time –Potential function (a.k.a force field) Describes the energies of interaction between atom centers –Integration algorithm Time dependent evolution of atomic coordinates in response to potential energy –Statistical sampling ensemble Fixed thermodynamic variables, i.e. NVE Number of atoms, box Volume, total Energy 1.Beck, D.A.C. Daggett, V. Methods (2004) 31:

Molecular Dynamics Potential function for MD 1,2 U = Bond + Angle + Dihedral + van der Waals + Electrostatic 1.Levitt M. Hirshberg M. Sharon R. Daggett V. Comp. Phys. Comm. (1995) 91: Levitt M. et al. J. Phys. Chem. B (1997) 101:

Molecular Dynamics Potential function for MD 1,2 U = Bond + Angle + Dihedral + van der Waals + Electrostatic 1.Levitt M. Hirshberg M. Sharon R. Daggett V. Comp. Phys. Comm. (1995) 91: Levitt M. et al. J. Phys. Chem. B (1997) 101:

Molecular Dynamics Potential function for MD 1,2 U = Bond + Angle + Dihedral + van der Waals + Electrostatic 1.Levitt M. Hirshberg M. Sharon R. Daggett V. Comp. Phys. Comm. (1995) 91: Levitt M. et al. J. Phys. Chem. B (1997) 101: b0b0

Molecular Dynamics Potential function for MD 1,2 U = Bond + Angle + Dihedral + van der Waals + Electrostatic 1.Levitt M. Hirshberg M. Sharon R. Daggett V. Comp. Phys. Comm. (1995) 91: Levitt M. et al. J. Phys. Chem. B (1997) 101: θ0θ0

Molecular Dynamics Potential function for MD 1,2 U = Bond + Angle + Dihedral + van der Waals + Electrostatic 1.Levitt M. Hirshberg M. Sharon R. Daggett V. Comp. Phys. Comm. (1995) Levitt M. et al. J. Phys. Chem. B (1997) 101: Φ0Φ0

Molecular Dynamics Potential function for MD 1,2 U = Bond + Angle + Dihedral + van der Waals + Electrostatic 1.Levitt M. Hirshberg M. Sharon R. Daggett V. Comp. Phys. Comm. (1995) 91: Levitt M. et al. J. Phys. Chem. B (1997) 101:

Molecular Dynamics Non-bonded components of potential function U nb = van der Waals + Electrostatic To a large degree, protein structure is dependent on non-bonded atomic interactions

Molecular Dynamics Non-bonded components of potential function U nb = van der Waals + Electrostatic

Molecular Dynamics Non-bonded components of potential function U nb = van der Waals + Electrostatic

Molecular Dynamics Non-bonded components of potential function U nb = van der Waals + Electrostatic + -

Molecular Dynamics Non-bonded components of potential function U nb = van der Waals + Electrostatic +

Molecular Dynamics Non-bonded components of potential function U nb = van der Waals + Electrostatic NOTE: Sum over all pairs of N atoms, or pairs N is often between 5x10 5 to 5x10 6 For 5x10 5 that is 1.25x10 11 pairs THAT IS A LOT OF POSSIBLE PAIRS!

Molecular Dynamics Time dependent integration of classical equations of motion

Molecular Dynamics Time dependent integration

Molecular Dynamics Time dependent integration

Molecular Dynamics Time dependent integration

Molecular Dynamics Time dependent integration

Molecular Dynamics Time dependent integration

Molecular Dynamics Time dependent integration

Molecular Dynamics Time dependent integration Evaluate forces and perform integration for every atom Each picosecond of simulation time requires 500 iterations of cycle E.g. w/ 50,000 atoms, each ps ( s) involves 25,000,000 evaluations

Molecular Dynamics Scalable, parallel MD & analysis software: in lucem Molecular Mechanics 1 ilmm 1.Beck, Alonso, Daggett, (2004) University of Washington, Seattle

Molecular Dynamics ilmm is written in C (ANSI / POSIX) 64 bit math POSIX threads / MPI Software design philosophy: –Kernel Compiles user’s molecular mechanics programs Schedules execution across processor and machines –Modules, e.g. Molecular Dynamics Analysis CPU POSIX threads (multiprocessor machines) CPU Message Passing Interface (multiple machines) + VERY high bandwidth

Molecular Dynamics ilmm is written in C (ANSI / POSIX) 64 bit math POSIX threads / MPI Software design philosophy: –Kernel Compiles user’s molecular mechanics programs Schedules execution across processor and machines –Modules, e.g. Molecular Dynamics Analysis CPU POSIX threads (multiprocessor machines) CPU Message Passing Interface (multiple machines) + VERY high bandwidth

Dynameomics Simulate representative protein from all folds

Dynameomics Simulate representative protein from all folds –Nature reuses designs for similar jobs 1hdd 1enh 1f43 1ftt 1bw5 1du6 1cqt

Dynameomics Simulate representative protein from all folds 1. Day R., Beck D. A. C., Armen R., Daggett V. Protein Science (2003) 10: fold population coverage 150 folds represent ~ 75% of known protein structures fold 1

Dynameomics Simulate representative protein from all folds –Native (folded) dynamics 20 nanosecond simulation at 298 Kelvin –Folding / unfolding pathway 3 x 2 ns simulations at 498 K 2 x 20 ns simulations at 498 K –Each target requires 6 simulations = MANY CPU HOURS

Dynameomics NERSC DOE INCITE award –2,000,000 + hours –906 simulations of 151 protein folds on Seaborg –One to two simulations per node (8 – 16 CPUs / simulation) –Opportunity to tune ilmm for maximum performance

Dynameomics Load balancing –Even distribution of non-bonded pairs to processors ~20% faster

Dynameomics Parallel efficiency –Threaded computations on 16 CPU IBM Nighthawk p, number of processors t(p), run-time using p processors parallel efficiency,

Dynameomics Simulate representative from top 151 folds –151 folds represent about 75% of known proteins ~ 11 μs of combined sim. time from 906 sims! ~ 2 terabytes of data (w/ 40 to 60% compression!) ~ 75 / 151 have been analyzed Validated against experiment where possible

Dynameomics Now what? –Simulate the top 1130 folds (>90%) More CPU time –Share simulation data from top 151 folds w/ world: Coordinates, analyses, available via WWW MicrosoftSQL database w/ On-Line Analytical Processing (OLAP) End-user queries of coordinate data, analyses, etc. –Data mining More CPU time, clever statistical algorithms, etc.

Acknowledgements DOE / NERSC’s INCITE (David Skinner, et al) NIH Microsoft, Inc. Structures rendered using Chimera, Molscript, Raster3D & PyMOL