Protein Docking and Interactions Modeling CS 374 Maria Teresa Gil Lucientes November 4, 2004.

Slides:



Advertisements
Similar presentations
Rosetta Energy Function Glenn Butterfoss. Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function.
Advertisements

Bioinformatics Vol. 21 no (Pages ) Reporter: Yu Lun Kuo (D )
A 3-D reference frame can be uniquely defined by the ordered vertices of a non- degenerate triangle p1p1 p2p2 p3p3.
How to approximate complex physical and thermodynamic interactions? Employ rigid or flexible structures for ligand and receptor (Side-chains or Back-bone.
A New Analytical Method for Computing Solvent-Accessible Surface Area of Macromolecules.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
A Versatile Depalletizer of Boxes Based on Range Imagery Dimitrios Katsoulas*, Lothar Bergen*, Lambis Tassakos** *University of Freiburg **Inos Automation-software.
Molecular Docking G. Schaftenaar Docking Challenge Identification of the ligand’s correct binding geometry in the binding site ( Binding Mode ) Observation:
Computing Protein Structures from Electron Density Maps: The Missing Loop Problem I. Lotan, H. van den Bedem, A. Beacon and J.C. Latombe.
Improved prediction of protein-protein binding sites using a support vector machine ( James Bradford, et al (2004)) Tapan Patel CISC841 Trypsin (and inhibitor.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
Docking Algorithm Scheme Part 1: Molecular shape representation Part 2: Matching of critical features Part 3: Filtering and scoring of candidate transformations.
Alignment of Flexible Molecular Structures. Motivation Proteins are flexible. One would like to align proteins modulo the flexibility. Hinge and shear.
Agenda A brief introduction The MASS algorithm The pairwise case Extension to the multiple case Experimental results.
“Inverse Kinematics” The Loop Closure Problem in Biology Barak Raveh Dan Halperin Course in Structural Bioinformatics Spring 2006.
Seminar in BioInformatics A Method for Biomolecular Structural Recognition and Docking Allowing Conformational Flexibility (1998) Bilha Sandak, Ruth Nussinov.
Docking of Protein Molecules
FLEX* - REVIEW.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
An Integrated Approach to Protein-Protein Docking
BL5203: Molecular Recognition & Interaction Lecture 5: Drug Design Methods Ligand-Protein Docking (Part I) Prof. Chen Yu Zong Tel:
Object Recognition Using Geometric Hashing
QSD – Quadratic Shape Descriptors Surface Matching and Molecular Docking Using Quadratic Shape Descriptors Goldman BB, Wipke WT. Quadratic Shape Descriptors.
Object Recognition. Geometric Task : find those rotations and translations of one of the point sets which produce “large” superimpositions of corresponding.
Unbound Docking of Rigid Molecules. Problem Definition Given two molecules find their correct association: +=
Protein Structure Prediction Samantha Chui Oct. 26, 2004.
Protein-protein and Protein- ligand Docking The geometric filtering.
RAPID: Randomized Pharmacophore Identification for Drug Design PW Finn, LE Kavraki, JC Latombe, R Motwani, C Shelton, S Venkatasubramanian, A Yao Presented.
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Module 2: Structure Based Ph4 Design
NUS CS5247 A dimensionality reduction approach to modeling protein flexibility By, By Miguel L. Teodoro, George N. Phillips J* and Lydia E. Kavraki Rice.
Molecular Docking S. Shahriar Arab. Overview of the lecture Introduction to molecular docking: Definition Types Some techniques Programs “Protein-Protein.
ClusPro: an automated docking and discrimination method for the prediction of protein complexes Stephen R. Comeau, David W.Gatchell, Sandor Vajda, and.
Similarity Methods C371 Fall 2004.
A genetic algorithm for structure based de-novo design Scott C.-H. Pegg, Jose J. Haresco & Irwin D. Kuntz February 21, 2006.
Shape Matching for Model Alignment 3D Scan Matching and Registration, Part I ICCV 2005 Short Course Michael Kazhdan Johns Hopkins University.
y-sa/2.0/. Integrating the Data Prof:Rui Alves Dept Ciencies Mediques Basiques, 1st.
Structural biology should be computable! Protein structures determined by amino acid sequences Protein structures and complexes correspond to global free.
In molecular switching, the recognition of an external signal such as ligand binding by one protein is coupled to the catalytic activity of a second protein.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Conformational Entropy Entropy is an essential component in ΔG and must be considered in order to model many chemical processes, including protein folding,
A two-state homology model of the hERG K + channel: application to ligand binding Ramkumar Rajamani, Brett Tongue, Jian Li, Charles H. Reynolds J & J PRD.
Altman et al. JACS 2008, Presented By Swati Jain.
Hierarchical Database Screenings for HIV-1 Reverse Transcriptase Using a Pharmacophore Model, Rigid Docking, Solvation Docking, and MM-PB/SA Junmei Wang,
Protein Design with Backbone Optimization Brian Kuhlman University of North Carolina at Chapel Hill.
Selecting Diverse Sets of Compounds C371 Fall 2004.
R L R L L L R R L L R R L L water DOCKING SIMULATIONS.
Modeling Protein Flexibility with Spatial and Energetic Constraints Yi-Chieh Wu 1, Amarda Shehu 2, Lydia Kavraki 2,3  Provided an approach to generating.
Surflex: Fully Automatic Flexible Molecular Docking Using a Molecular Similarity-Based Search Engine Ajay N. Jain UCSF Cancer Research Institute and Comprehensive.
Topics in bioinformatics CS697 Spring 2011 Class 12 – Mar Molecular distance measurements Molecular transformations.
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
Mean Field Theory and Mutually Orthogonal Latin Squares in Peptide Structure Prediction N. Gautham Department of Crystallography and Biophysics University.
Protein structure prediction Computer-aided pharmaceutical design: Modeling receptor flexibility Applications to molecular simulation Work on this paper.
Protein Structure Prediction: Threading and Rosetta BMI/CS 576 Colin Dewey Fall 2008.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
Elon Yariv Graduate student in Prof. Nir Ben-Tal’s lab Department of Biochemistry and Molecular Biology, Tel Aviv University.
Protein-Protein Interactions. A Protein may interact with: –Other proteins –Nucleic Acids –Small molecules Protein Interactions.
A new protein-protein docking scoring function based on interface residue properties Reporter: Yu Lun Kuo (D )
Structural Bioinformatics Elodie Laine Master BIM-BMC Semester 3, Genomics of Microorganisms, UMR 7238, CNRS-UPMC e-documents:
Simplified picture of the principles used for multiple copy simultaneous search (MCSS) and for computational combinatorial ligand design (CCLD). Simplified.
Rong Chen Boston University
Molecular Docking Profacgen. The interactions between proteins and other molecules play important roles in various biological processes, including gene.
Geometric Hashing: An Overview
Ligand Docking to MHC Class I Molecules
An Integrated Approach to Protein-Protein Docking
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Presentation transcript:

Protein Docking and Interactions Modeling CS 374 Maria Teresa Gil Lucientes November 4, 2004

Overview of the lecture Introduction to molecular docking: Definition Types Some techniques Programs Algorithm for Protein-Protein docking based in paper: “Protein-Protein Docking with Simultaneous Optimization of Rigid-body Displacement and Side-chain Conformations” Jeffrey J. Gray, Stewart Moughon, Chu Wang, Ora Schueler-Furman Brian Kuhlman, Carol A. Rohl and David Baker J. Mol. Biol. (2003) 331,

What is Docking? Docking attempts to find the “best” matching between two molecules

… a more serious definition… Given two biological molecules determine: -Whether the two molecules “interact” -If so, what is the orientation that maximizes the “interaction” while minimizing the total “energy” of the complex Goal: To be able to search a database of molecular structures and retrieve all molecules that can interact with the query structure

Why is docking important? It is of extreme relevance in cellular biology, where function is accomplished by proteins interacting with themselves and with other molecular components It is the key to rational drug design: The results of docking can be used to find inhibitors for specific target proteins and thus to design new drugs. It is gaining importance as the number of proteins whose structure is known increases

Example: HIV-1 Protease Active Site (Aspartyl groups)

Example: HIV-1 Protease

Why is this difficult? Both molecules are flexible and may alter each other’s structure as they interact: Hundreds to thousands of degrees of freedom (DOF) Total possible conformations are astronomical

Types of Docking studies Protein-Protein Docking Both molecules usually considered rigid 6 degrees of freedom First apply steric constraints to limit search space and the examine energetics of possible binding conformations Protein-Ligand Docking Flexible ligand, rigid-receptor Search space much larger Either reduce flexible ligand to rigid fragments connected by one or several hinges, or search the conformational space using monte-carlo methods or molecular dynamics

Some techniques Surface representation, that efficiently represents the docking surface and identifies the regions of interest (cavities and protrusions) Connolly surface Lenhoff technique Kuntz et al. Clustered-Spheres Alpha shapes Surface matching that matches surfaces to optimize a binding score: Geometric Hashing

Surface Representation Each atomic sphere is given the van der Waals radius of the atom Rolling a Probe Sphere over the Van der Waals Surface leads to the Solvent Reentrant Surface or Connolly surface

Lenhoff technique Computes a “complementary” surface for the receptor instead of the Connolly surface, i.e. computes possible positions for the atom centers of the ligand Atom centers of the ligand van der Waals surface

Kuntz et al. Clustered-Spheres Uses clustered-spheres to identify cavities on the receptor and protrusions on the ligand Compute a sphere for every pair of surface points, i and j, with the sphere center on the normal from point i Regions where many spheres overlap are either cavities (on the receptor) or protrusions (on the ligand) i j

Alpha Shapes Formalizes the idea of “shape” In 2D an “edge” between two points is “alpha-exposed” if there exists a circle of radius alpha such that the two points lie on the surface of the circle and the circle contains no other points from the point set

Alpha Shapes: Example Alpha=infinity Alpha=3.0 Å

Surface Matching Find the transformation (rotation + translation) that will maximize the number of matching surface points from the receptor and the ligand First satisfy steric constraints… Find the best fit of the receptor and ligand using only geometrical constraints … then use energy calculations to refine the docking Selet the fit that has the minimum energy

Geometric Hashing Building the Hash Table: –For each triplet of points from the ligand, generate a unique system of reference –Store the position and orientation of all remaining points in this coordinate system in the Hash Table Searching in the Hash Table –For each triplet of points from the receptor, generate a unique system of reference –Search the coordinates for each remaining point in the receptor and find the appropriate hash table bin: For every entry there, vote for the basis

Geometric Hashing –Determine those entries that received more than a threshold of votes, such entry corresponds to a potential match –For each potential match recover the transformation T that results in the best least-squares match between all corresponding triplets –Transform the features of the model according to the recovered transformation T and verify it. If the verification fails, choose a different receptor triplet and repeat the searching.

Docking Programs More information in: The programs are: DOCK (I. D. Kuntz, UCSF) AutoDOCK (Arthur Olson, The Scripps Research Institute) RosettaDOCK (Baker, Washington Univ., Gray, Johns Hopkins Univ.)

DOCK DOCK works in 5 steps: Step 1 Start with crystal coordinates of target receptorStep 1 Start with crystal coordinates of target receptor Step 2 Generate molecular surface for receptor Step 3 Generate spheres to fill the active site of the receptor: The spheres become potential locations for ligand atoms Step 4 Matching: Sphere centers are then matched to the ligand atoms, to determine possible orientations for the ligand Step 5 Scoring: Find the top scoring orientation

DOCK: Example HIV-1 protease is the target receptor - Aspartyl groups are its active side

DOCK DOCK works in 5 steps: Step 1 Start with crystal coordinates of target receptor Step 2 Generate molecular surface for receptorStep 2 Generate molecular surface for receptor Step 3 Generate spheres to fill the active site of the receptor: The spheres become potential locations for ligand atoms Step 4 Matching: Sphere centers are then matched to the ligand atoms, to determine possible orientations for the ligand Step 5 Scoring: Find the top scoring orientation

DOCK: Example HIV-1 protease is the target receptor - Aspartyl groups are its active side

DOCK DOCK works in 5 steps: Step 1 Start with crystal coordinates of target receptor Step 2 Generate molecular surface for receptor Step 3 Generate spheres to fill the active site of the receptor: The spheres become potential locations for ligand atomsStep 3 Generate spheres to fill the active site of the receptor: The spheres become potential locations for ligand atoms Step 4 Matching: Sphere centers are then matched to the ligand atoms, to determine possible orientations for the ligand Step 5 Scoring: Find the top scoring orientation

DOCK: Example HIV-1 protease is the target receptor - Aspartyl groups are its active side

DOCK DOCK works in 5 steps: Step 1 Start with crystal coordinates of target receptor Step 2 Generate molecular surface for receptor Step 3 Generate spheres to fill the active site of the receptor: The spheres become potential locations for ligand atoms Step 4 Matching: Sphere centers are then matched to the ligand atoms, to determine possible orientations for the ligandStep 4 Matching: Sphere centers are then matched to the ligand atoms, to determine possible orientations for the ligand Step 5 Scoring: Find the top scoring orientation

DOCK: Example 45 Three scoring schemes: Shape scoring, Electrostatic scoring and Force-field scoring Image 5 is a comparison of the top scoring orientation of the molecule thioketal with the orientation found in the crystal structure

DOCK DOCK works in 5 steps: Step 1 Start with crystal coordinates of target receptor Step 2 Generate molecular surface for receptor Step 3 Generate spheres to fill the active site of the receptor: The spheres become potential locations for ligand atoms Step 4 Matching: Sphere centers are then matched to the ligand atoms, to determine possible orientations for the ligand Step 5 Scoring: Find the top scoring orientationStep 5 Scoring: Find the top scoring orientation

DOCK: Example 45 Three scoring schemes: Shape scoring, Electrostatic scoring and Force-field scoring Image 5 is a comparison of the top scoring orientation of the molecule thioketal with the orientation found in the crystal structure

Other Docking programs AutoDock –AutoDock was designed to dock flexible ligands into receptor binding sites –The strongest feature of AutoDock is the range of powerful optimization algorithms available RosettaDOCK –It models physical forces and creates a very large number of decoys –It uses degeneracy after clustering as a final criterion in decoy selection

CAPRI Challenge (2002) At least one docking partner presented in its unbound form Participants permitted 5 attempts for each target The 7 CAPRI Docking Targets

CAPRI Challenge Participants & Algorithms

Results: CAPRI Challenge This were the results for the different predictors and targets:

A Protein-Protein Docking Algorithm (Gray & Baker) Our goal is to try to predict protein-protein complexes from the coordinates of the unbound monomer components. The method is divided in two steps: A low-resolution Monte Carlo search and a final optimization using Monte Carlo minimization. Up to 10 5 independent simulations are carried out, and the resulting “decoys” are ranked using an energy function. The top-ranking decoys are clustered to select the final predictions.

Docking protocol

Docking protocol: Step 1 RANDOM START POSITION Creation of a decoy begins with a random orientation of each partner and a translation of one partner along the line of protein centers to create a glancing contact between the proteins

Docking protocol

Docking protocol: Step 2 LOW-RESOLUTION MONTE CARLO SEARCH One partner is translated and rotated around the surface of the other through 500 Monte Carlo move attempts We use a low-resolution representation: N, C , C, O for the backbone and a “centroid” for the side-chain The score is based in the correctness of each decoy: A reward contacting residues, a penalty for overlapping residues, an alignment score, residue environment and residue-residue interacions terms

Docking protocol

Docking protocol: Step 3 HIGH-RESOLUTION REFINEMENT Explicit side-chains are added to the protein backbones using a rotameter packing algorithm, thus changing the energy surface An explicit minimization finds the nearest local minimum accessible via rigid body translation and rotation Start and Finish positions are compared by the Metropolis criterion

Docking protocol

Docking protocol: Step 3 Before each cycle, the position of one protein is perturbed by random translations and by random rotations To simultaneously optimize the side-chain conformations and the rigid body position, the side-chain packing and the minimization operations are repeated 50 times

Docking protocol: Step 3 COMPUTATIONAL EFFICIENCY 1.The packing algorithm usually varies the conformation of only one residue at a time: A combinatorial rotamer optimization is performed only once every eight cycles 2.A filter is employed periodically to detect inferior decoys and and reject them without further refinement

Docking protocol

Docking protocol: Step 4 CLUSTERING & PREDICTIONS The search procedure is repeated to create approximately 10 5 decoys per target The 200 best-scoring decoys are then clustered on the basis of the root-mean-squared distance (rmsd) using a hierarchical clustering algorithm The clusters with the most members are selected as the final predictions and ranked according to cluster sizes

Docking protocol: Results

Conclusions The so-called computational molecular docking problem is far from being solved. There are two major bottle- necks: 1.The algorithms can handle only a limited extent of backbone flexibility 2.The availability of selective and efficient scoring functions … and Thanks!!Questions??