BioSimGRID: A GRID Database of Biomolecular Simulations Mark S.P. Sansom

Slides:



Advertisements
Similar presentations
Fighting Malaria With The Grid. Computing on The Grid The Internet allows users to share information across vast geographical distances. Using similar.
Advertisements

Simulazione di Biomolecole: metodi e applicazioni giorgio colombo
Bioinformatics and Molecular Modeling studies of Membrane Proteins Shiva Amiri.
Computer simulations in drug design, and the GRID Dr Jonathan W Essex University of Southampton.
BioCoRE and GEMS: Cyber Infrastructure for Cyber Chemistry Jesús A. Izaguirre Computer Science & Engineering University of Notre Dame with Kirby Vandivort.
Ion Solvation Thermodynamics from Simulation with a Polarizable Force Field Gaurav Chopra 07 February 2005 CS 379 A Alan GrossfeildPengyu Ren Jay W. Ponder.
Molecular Dynamics, Monte Carlo and Docking Lecture 21 Introduction to Bioinformatics MNW2.
1 Enriching UK PubMed Central SPIDER launch meeting, Wolfson College, Oxford Paul Davey, UK PubMed Central Engagement Manager.
Protein Primer. Outline n Protein representations n Structure of Proteins Structure of Proteins –Primary: amino acid sequence –Secondary:  -helices &
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
An Integrated Approach to Protein-Protein Docking
Coarse grained simulations of p7 folding
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Molecular Dynamics of AChBP: Water in the Binding Pocket Shiva Amiri Biophysical Society Annual Meeting, February,
Bioinformatics and molecular modelling studies of membrane proteins Shiva Amiri Professor Mark S.P. Sansom June 1, 2004.
Forces and Prediction of Protein Structure Ming-Jing Hwang ( 黃明經 ) Institute of Biomedical Sciences Academia Sinica
Developing Reusable Software Infrastructure – Middleware – for Multiscale Modeling Wilfred W. Li, Ph.D. National Biomedical Computation Resource Center.
Ana Damjanovic (JHU, NIH) JHU: Petar Maksimovic Bertrand Garcia-Moreno NIH: Tim Miller Bernard Brooks OSG: Torre Wenaus and team.
BIOC3010: Bioinformatics - Revision lecture Dr. Andrew C.R. Martin
BioSimGRID and BioSimGRID ’lite’ - Towards a worldwide repository for biomolecular simulation Philip C Biggin
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
Jeremy C. Smith, University of Heidelberg Introduction to Protein Simulations and Drug Design R P.
GTL Facilities Computing Infrastructure for 21 st Century Systems Biology Ed Uberbacher ORNL & Mike Colvin LLNL.
Application of e-infrastructure to real research.
y-sa/2.0/. Integrating the Data Prof:Rui Alves Dept Ciencies Mediques Basiques, 1st.
Expanding the Accessibility and Impact of Language Technologies for Supporting Education (TFlex): Edinburgh Effort Dr. Myroslava Dzikovska, Prof. Johanna.
Biomolecular Modelling and Simulation Julia M Goodfellow, Birkbeck College, University of London.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE Molecular Science in NPACI Russ B. Altman NPACI Molecular Science Thrust Stanford Medical.
Investigating Protein Conformational Change on a Distributed Computing Cluster Christopher Woods Jeremy Frey Jonathan Essex University.
H v 1: How A Voltage-sensor May Form A Channel Younes Mokrab Biophysical Society 53 rd Annual Meeting 2 Mar 2009.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Molecular Dynamics Simulation
SimBioSys Inc.© Slide #1 Enrichment and cross-validation studies of the eHiTS high throughput screening software package.
An FPGA Implementation of the Ewald Direct Space and Lennard-Jones Compute Engines By: David Chui Supervisor: Professor P. Chow.
Applied Bioinformatics Week 12. Bioinformatics & Functional Proteomics How to classify proteins into functional classes? How to compare one proteome with.
Proteins in Bionanotechnology Computational Studies Andrew Hung, Oliver Beckstein, Robert D’Rozario, Sylvanna S.W. Ho and Mark S.P. Sansom Laboratory of.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
1/20 Study of Highly Accurate and Fast Protein-Ligand Docking Method Based on Molecular Dynamics Reporter: Yu Lun Kuo
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
 The generated models are used in various coarse-grain and other molecular modelling studies.  Coarse-grain analysis includes: Gaussian Network Models.
A Pattern Language for Parallel Programming Beverly Sanders University of Florida.
Molecular dynamics simulations of toxin binding to ion channels Quantitative description protein –ligand interactions is a fundamental problem in molecular.
BioSimGrid PI:: Mark Sansom (Oxford) Southampton :: Jon Essex, Stuart Murdock Oxford :: Mark Sansom, Kaihsu Tai Birkbeck College.
Dynameomics: Protein Mechanics, Folding and Unfolding through Large Scale All-Atom Molecular Dynamics Simulations INCITE 6 David A. C. Beck Valerie Daggett.
Molecular Modelling of the Nicotinic Acetylcholine Receptor and Related Proteins Shiva Amiri Professor Mark S. P. Sansom and Dr. Philip C. Biggin D. Phil.
Protein structure prediction Computer-aided pharmaceutical design: Modeling receptor flexibility Applications to molecular simulation Work on this paper.
Forces and Prediction of Protein Structure Ming-Jing Hwang ( 黃明經 ) Institute of Biomedical Sciences Academia Sinica
EBI is an Outstation of the European Molecular Biology Laboratory. A web based integrated search service to understand ligand binding and secondary structure.
Molecular Modelling Studies of the Nicotinic Acetylcholine Receptor
PDBemotif A web based integrated search service to understand ligand binding and secondary structure properties in macromolecular structures.
Centre for Computational Science, University College London
Computer simulations in drug design, and the GRID
Molecular Docking Profacgen. The interactions between proteins and other molecules play important roles in various biological processes, including gene.
Understanding Biomolecular Systems
Computational Analysis
Conformational Change in an MFS Protein: MD Simulations of LacY
An Integrated Approach to Protein-Protein Docking
Bioinformatics and molecular modeling studies of membrane proteins
Interactions of Pleckstrin Homology Domains with Membranes: Adding Back the Bilayer via High-Throughput Molecular Dynamics  Eiji Yamamoto, Antreas C.
Shiva Amiri Professor Mark S. P. Sansom and Dr. Philip C. Biggin
G. Fiorin, A. Pastore, P. Carloni, M. Parrinello  Biophysical Journal 
Volume 87, Issue 6, Pages (December 2004)
Sundeep S. Deol, Peter J. Bond, Carmen Domene, Mark S.P. Sansom 
OmpT: Molecular Dynamics Simulations of an Outer Membrane Enzyme
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Presentation transcript:

BioSimGRID: A GRID Database of Biomolecular Simulations Mark S.P. Sansom

Overview u Introduction to biomolecular simulations u u Why? Case study – added value from comparisons u How? Progress towards a prototype of BioSimGRID u The future? Towards computational systems biology

MD Simulations: from Structure to Dynamics u Molecular simulations as a tool for protein structure analysis u MD – Newtonian simulation of molecular dynamics using an empirical forcefield u Why? - Proteins move X-ray structure: average structure at 100 K in crystal MD simulations: dynamics at 300 K in water (& membrane) u Challenge: to relate structural dynamics to biological function

Molecular Dynamics u Describe the forces on all atoms: bonded (bonds, angles, dihedrals) non-bonded (van der Waals, electrostatics) u Describe the initial atom positions: u Integrate: F = ma (a few million times…) u Result: positions and energies of all atoms during a few nanoseconds u Applications: liquids … peptides … proteins … membranes u Membrane + protein + water = ca. 50,000 atoms u Need for comparative analysis of simulations – GRID data and collaboration u Need for efficient parallelisation – clusters and/or HPC

Current Paradigm for MD Simulations u Target selection: literature based; interesting protein/problem u System preparation: highly interactive; slow; idiosyncratic u Simulation: diversity of protocols u Analysis: highly interactive; slow; idiosyncratic u Dissemination: traditional – papers, posters, talks u Archival: ‘archive’ data … and then mislay the tape!

Integrating Simulations and Structural Biology of Proteins Novel structure (RCSB) Sequence alignment Biomedically relevant homologue(s) Homology model(s) MD simulations Biomolecular simulation database Comparative analysis Evaluation/refinement of model Biological and pharmacological simulation & modelling e.g. drug discovery bacterial K channel mammalian K channel dynamics in membrane drug docking calculations Interaction site dynamics bioinformatics & structural biology BioSimGRID drug discovery

Comparative Simulations: Drug Receptors u Why? – increase significance of results u Sampling – long simulations and multiple simulations u Sampling via biology – exploiting evolution u Biology emerges from comparisons… u e.g. mammalian receptor vs. bacterial binding protein u Rat GluR2 EC fragment u Major receptor in mammalian brains – drug target u MD simulations with/without bound ligands u Analyse inter-domain motions glutamate S1 S2

GluR2 – Flexibility & Gating… u Flexibility depends on ligand occupancy & species u Gating mechanism – decrease in flexibility on channel activation u But … incomplete sampling u Need: longer simulations & comparative simulations empty Kainate Glutamate >> > “OFF” “ON” time (ns) RMSD (Å) 0 empty +Kai +Glu 2.0

GlnBP – A Bacterial Binding Protein u GlnBP – bacterial 2-domain periplasmic binding protein u Similar fold to mammalian GluR2 u X-ray shows ligand binding induces domain closure u MD shows ligand binding reduces inter-domain motions - cf. GluR2 simulations + Gln empty Gln bound X-ray structures MD Simulation empty Gln bound

Main Initial Tasks u To establish a distributed database environment u To develop Grid/Web services using GT3/OGSA infrastructure u To develop software tools for interrogation and data-mining u To develop generic analysis tools u Annotation of simulation data with biological and structural data from other databases York Nottingham Birmingham Oxford RAL Southampton London collaborating groups

Oxford –database management system (Bing Wu) –(meta)data curatorship & integration (Kaihsu Tai) Southampton –application programming interface & data retrieval (Muan Hong Ng) –generic analysis tools (Stuart Murdock) Dividing up the Tasks

table trajectory: one entry for each trajectory table coordinate: {x, y, z} one entry for each atom in each residue in each frame in each trajectory table atom: one entry for each atom in each residue in each trajectory table residue: one entry for each residue in each trajectory table frame: one entry for each frame in each trajectory dictionary tables metadata tables Database Design: Simplified

Database Design: A More Complete Version

Simulation Metadata u Difficult to extract from published literature u This is a prototype: a needs analysis with users/depositors must be conducted u Annotation/links to other biological databases essential id molecules author depositors affiliations publications method src_stru ref_stru prog ver hardware num_of_proc timestep num_of_frame ens_type thermostat solvent forcefield ele_stat equ_prot hyd_atom unit_shape … metadata

Database Editor & SQL Query Capability

BioSimGRID Prototype Target date for prototype: July 2003

Deliverables to Date… Database schema Sample database (with test trajectories) Prototype shared between 2 sites Analysis tools – preliminary versions Interface to database for data retrieval Python hosting environment

Roadmap u Dec 2002 – project started u July 2003 – (internal) prototype u September 2003 – working prototype (All Hands meeting) u November 2003 – test ‘real world’ applications u December 2003 – multi-site prototype u 2004 – multi-site deposition of data u 2005 – open up to additional groups for deposition/testing

Future Directions u HTMD – simulations coupled to structural genomics Diamond light source u Computational system biology – virtual outer membrane HPCx u Multiscale biomolecular simulations – from QM/MM to meso- scale modelling GRID-enabled simulations u Combine all of these with BioSimGRID…

Structural Genomics & HTMD u Overall vision – simulation as an integral component of structural genomics u Needs capacity computation – GRID? u MD database (distributed) – BioSimGRID synchrotron MD database novel biology… compute GRID

Towards a Virtual Outer Membrane (vOM) u First step towards computational systems biology – a suitable system u Bacterial OMs – 5 or 6 proteins = 90% of protein content u Structures or good homology models of proteins are available u Complex lipid – outer leaflet is lipopolysaccharide (LPS) u Minimum system size ca. 2.5x10 6 atoms; simulation times ca. 50 ns cf. current FhuA – 80,000 atoms & 10 ns – need HPCx

Multiscale Biomolecular Simulations u Membrane bound enzymes – major drug targets (cf. ibruprofen, anti-depressants, endocannabinoids) u Complex multi-scale problem: QM/MM; ligand binding; membrane/protein fluctuations; diffusive motion of substrates/drugs in multiple phases u Need for GRID-based integrated simulations

Oxford Dr Phil Biggin Dr Carmen Domene Dr Alessandro Grottesi Dr Andrew Hung Dr Daniele Bemporad Dr Shozeb Haider Dr Kaihsu Tai Dr Bing Wu George Patargias Oliver Beckstein Yalini Pathy Pete Bond Jonathan Cuthbertson Sundeep Deol Jeff Campbell Loredana Vaccaro Jennifer Johnston Katherine Cox Robert d’Rozario John Holyoake Andrew Pang BBSRCDTI The Wellcome TrustGSK EC (TMR) OeSC (EPSRC & DTI) EPSRC OSC (JIF) MRC BioSimGRID Leo Caves (York) Simon Cox (Southampton) Jon Essex (Southampton) Paul Jeffreys (Oxford) Charles Laughton (Nottingham) David Moss (Birkbeck) Oliver Smart (Birmingham) Southampton Dr Stuart Murdock Dr Muan Hong Ng Dr Richard Maurer Dr Hans Fangohr Steve Johnston