Presentation is loading. Please wait.

Presentation is loading. Please wait.

Protein Docking and Interactions Modeling CS 374 Maria Teresa Gil Lucientes November 4, 2004.

Similar presentations


Presentation on theme: "Protein Docking and Interactions Modeling CS 374 Maria Teresa Gil Lucientes November 4, 2004."— Presentation transcript:

1 Protein Docking and Interactions Modeling CS 374 Maria Teresa Gil Lucientes November 4, 2004

2 Overview of the lecture Introduction to molecular docking: Definition Types Some techniques Programs Algorithm for Protein-Protein docking based in paper: “Protein-Protein Docking with Simultaneous Optimization of Rigid-body Displacement and Side-chain Conformations” Jeffrey J. Gray, Stewart Moughon, Chu Wang, Ora Schueler-Furman Brian Kuhlman, Carol A. Rohl and David Baker J. Mol. Biol. (2003) 331, 281-299

3 What is Docking? Docking attempts to find the “best” matching between two molecules

4 … a more serious definition… Given two biological molecules determine: -Whether the two molecules “interact” -If so, what is the orientation that maximizes the “interaction” while minimizing the total “energy” of the complex Goal: To be able to search a database of molecular structures and retrieve all molecules that can interact with the query structure

5 Why is docking important? It is of extreme relevance in cellular biology, where function is accomplished by proteins interacting with themselves and with other molecular components It is the key to rational drug design: The results of docking can be used to find inhibitors for specific target proteins and thus to design new drugs. It is gaining importance as the number of proteins whose structure is known increases

6 Example: HIV-1 Protease Active Site (Aspartyl groups)

7 Example: HIV-1 Protease

8 Why is this difficult? Both molecules are flexible and may alter each other’s structure as they interact: Hundreds to thousands of degrees of freedom (DOF) Total possible conformations are astronomical

9 Types of Docking studies Protein-Protein Docking Both molecules usually considered rigid 6 degrees of freedom First apply steric constraints to limit search space and the examine energetics of possible binding conformations Protein-Ligand Docking Flexible ligand, rigid-receptor Search space much larger Either reduce flexible ligand to rigid fragments connected by one or several hinges, or search the conformational space using monte-carlo methods or molecular dynamics

10 Some techniques Surface representation, that efficiently represents the docking surface and identifies the regions of interest (cavities and protrusions) Connolly surface Lenhoff technique Kuntz et al. Clustered-Spheres Alpha shapes Surface matching that matches surfaces to optimize a binding score: Geometric Hashing

11 Surface Representation Each atomic sphere is given the van der Waals radius of the atom Rolling a Probe Sphere over the Van der Waals Surface leads to the Solvent Reentrant Surface or Connolly surface

12 Lenhoff technique Computes a “complementary” surface for the receptor instead of the Connolly surface, i.e. computes possible positions for the atom centers of the ligand Atom centers of the ligand van der Waals surface

13 Kuntz et al. Clustered-Spheres Uses clustered-spheres to identify cavities on the receptor and protrusions on the ligand Compute a sphere for every pair of surface points, i and j, with the sphere center on the normal from point i Regions where many spheres overlap are either cavities (on the receptor) or protrusions (on the ligand) i j

14 Alpha Shapes Formalizes the idea of “shape” In 2D an “edge” between two points is “alpha-exposed” if there exists a circle of radius alpha such that the two points lie on the surface of the circle and the circle contains no other points from the point set

15 Alpha Shapes: Example Alpha=infinity Alpha=3.0 Å

16 Surface Matching Find the transformation (rotation + translation) that will maximize the number of matching surface points from the receptor and the ligand First satisfy steric constraints… Find the best fit of the receptor and ligand using only geometrical constraints … then use energy calculations to refine the docking Selet the fit that has the minimum energy

17 Geometric Hashing Building the Hash Table: –For each triplet of points from the ligand, generate a unique system of reference –Store the position and orientation of all remaining points in this coordinate system in the Hash Table Searching in the Hash Table –For each triplet of points from the receptor, generate a unique system of reference –Search the coordinates for each remaining point in the receptor and find the appropriate hash table bin: For every entry there, vote for the basis

18 Geometric Hashing –Determine those entries that received more than a threshold of votes, such entry corresponds to a potential match –For each potential match recover the transformation T that results in the best least-squares match between all corresponding triplets –Transform the features of the model according to the recovered transformation T and verify it. If the verification fails, choose a different receptor triplet and repeat the searching.

19 Docking Programs More information in: http://www.bmm.icnet.uk/~smithgr/soft.html The programs are: DOCK (I. D. Kuntz, UCSF) AutoDOCK (Arthur Olson, The Scripps Research Institute) RosettaDOCK (Baker, Washington Univ., Gray, Johns Hopkins Univ.)

20 DOCK DOCK works in 5 steps: Step 1 Start with crystal coordinates of target receptorStep 1 Start with crystal coordinates of target receptor Step 2 Generate molecular surface for receptor Step 3 Generate spheres to fill the active site of the receptor: The spheres become potential locations for ligand atoms Step 4 Matching: Sphere centers are then matched to the ligand atoms, to determine possible orientations for the ligand Step 5 Scoring: Find the top scoring orientation

21 DOCK: Example 12 3 - HIV-1 protease is the target receptor - Aspartyl groups are its active side

22 DOCK DOCK works in 5 steps: Step 1 Start with crystal coordinates of target receptor Step 2 Generate molecular surface for receptorStep 2 Generate molecular surface for receptor Step 3 Generate spheres to fill the active site of the receptor: The spheres become potential locations for ligand atoms Step 4 Matching: Sphere centers are then matched to the ligand atoms, to determine possible orientations for the ligand Step 5 Scoring: Find the top scoring orientation

23 DOCK: Example 12 3 - HIV-1 protease is the target receptor - Aspartyl groups are its active side

24 DOCK DOCK works in 5 steps: Step 1 Start with crystal coordinates of target receptor Step 2 Generate molecular surface for receptor Step 3 Generate spheres to fill the active site of the receptor: The spheres become potential locations for ligand atomsStep 3 Generate spheres to fill the active site of the receptor: The spheres become potential locations for ligand atoms Step 4 Matching: Sphere centers are then matched to the ligand atoms, to determine possible orientations for the ligand Step 5 Scoring: Find the top scoring orientation

25 DOCK: Example 12 3 - HIV-1 protease is the target receptor - Aspartyl groups are its active side

26 DOCK DOCK works in 5 steps: Step 1 Start with crystal coordinates of target receptor Step 2 Generate molecular surface for receptor Step 3 Generate spheres to fill the active site of the receptor: The spheres become potential locations for ligand atoms Step 4 Matching: Sphere centers are then matched to the ligand atoms, to determine possible orientations for the ligandStep 4 Matching: Sphere centers are then matched to the ligand atoms, to determine possible orientations for the ligand Step 5 Scoring: Find the top scoring orientation

27 DOCK: Example 45 Three scoring schemes: Shape scoring, Electrostatic scoring and Force-field scoring Image 5 is a comparison of the top scoring orientation of the molecule thioketal with the orientation found in the crystal structure

28 DOCK DOCK works in 5 steps: Step 1 Start with crystal coordinates of target receptor Step 2 Generate molecular surface for receptor Step 3 Generate spheres to fill the active site of the receptor: The spheres become potential locations for ligand atoms Step 4 Matching: Sphere centers are then matched to the ligand atoms, to determine possible orientations for the ligand Step 5 Scoring: Find the top scoring orientationStep 5 Scoring: Find the top scoring orientation

29 DOCK: Example 45 Three scoring schemes: Shape scoring, Electrostatic scoring and Force-field scoring Image 5 is a comparison of the top scoring orientation of the molecule thioketal with the orientation found in the crystal structure

30 Other Docking programs AutoDock –AutoDock was designed to dock flexible ligands into receptor binding sites –The strongest feature of AutoDock is the range of powerful optimization algorithms available RosettaDOCK –It models physical forces and creates a very large number of decoys –It uses degeneracy after clustering as a final criterion in decoy selection

31 CAPRI Challenge (2002) At least one docking partner presented in its unbound form Participants permitted 5 attempts for each target The 7 CAPRI Docking Targets

32 CAPRI Challenge Participants & Algorithms

33 Results: CAPRI Challenge This were the results for the different predictors and targets:

34 A Protein-Protein Docking Algorithm (Gray & Baker) Our goal is to try to predict protein-protein complexes from the coordinates of the unbound monomer components. The method is divided in two steps: A low-resolution Monte Carlo search and a final optimization using Monte Carlo minimization. Up to 10 5 independent simulations are carried out, and the resulting “decoys” are ranked using an energy function. The top-ranking decoys are clustered to select the final predictions.

35 Docking protocol

36 Docking protocol: Step 1 RANDOM START POSITION Creation of a decoy begins with a random orientation of each partner and a translation of one partner along the line of protein centers to create a glancing contact between the proteins

37 Docking protocol

38 Docking protocol: Step 2 LOW-RESOLUTION MONTE CARLO SEARCH One partner is translated and rotated around the surface of the other through 500 Monte Carlo move attempts We use a low-resolution representation: N, C , C, O for the backbone and a “centroid” for the side-chain The score is based in the correctness of each decoy: A reward contacting residues, a penalty for overlapping residues, an alignment score, residue environment and residue-residue interacions terms

39 Docking protocol

40 Docking protocol: Step 3 HIGH-RESOLUTION REFINEMENT Explicit side-chains are added to the protein backbones using a rotameter packing algorithm, thus changing the energy surface An explicit minimization finds the nearest local minimum accessible via rigid body translation and rotation Start and Finish positions are compared by the Metropolis criterion

41 Docking protocol

42 Docking protocol: Step 3 Before each cycle, the position of one protein is perturbed by random translations and by random rotations To simultaneously optimize the side-chain conformations and the rigid body position, the side-chain packing and the minimization operations are repeated 50 times

43 Docking protocol: Step 3 COMPUTATIONAL EFFICIENCY 1.The packing algorithm usually varies the conformation of only one residue at a time: A combinatorial rotamer optimization is performed only once every eight cycles 2.A filter is employed periodically to detect inferior decoys and and reject them without further refinement

44 Docking protocol

45 Docking protocol: Step 4 CLUSTERING & PREDICTIONS The search procedure is repeated to create approximately 10 5 decoys per target The 200 best-scoring decoys are then clustered on the basis of the root-mean-squared distance (rmsd) using a hierarchical clustering algorithm The clusters with the most members are selected as the final predictions and ranked according to cluster sizes

46 Docking protocol: Results

47 Conclusions The so-called computational molecular docking problem is far from being solved. There are two major bottle- necks: 1.The algorithms can handle only a limited extent of backbone flexibility 2.The availability of selective and efficient scoring functions … and Thanks!!Questions??


Download ppt "Protein Docking and Interactions Modeling CS 374 Maria Teresa Gil Lucientes November 4, 2004."

Similar presentations


Ads by Google