Using Motion Planning to Study Ligand Binding and Protein Folding Nancy Amato,Guang Song and Burchan Bayazit Department of Computer Science Texas A&M University.

Slides:



Advertisements
Similar presentations
Complete Motion Planning
Advertisements

Probabilistic Roadmaps. The complexity of the robot’s free space is overwhelming.
By Lydia E. Kavraki, Petr Svestka, Jean-Claude Latombe, Mark H. Overmars Emre Dirican
NUS CS5247 The Gaussian Sampling Strategy for Probalistic Roadmap Planners Valdrie Boor, Mark H. Overmars, A. Frank van der Stappen, 1999 Wai.
Probabilistic Roadmap
A Comparative Study of Probabilistic Roadmap Planners Roland Geraerts Mark Overmars.
Probabilistic Roadmap Methods (PRMs)
Probabilistic Roadmaps Sujay Bhattacharjee Carnegie Mellon University.
By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.
Kinodynamic Path Planning Aisha Walcott, Nathan Ickes, Stanislav Funiak October 31, 2001.
DESIGN OF A GENERIC PATH PATH PLANNING SYSTEM AILAB Path Planning Workgroup.
Iterative Relaxation of Constraints (IRC) Can’t solve originalCan solve relaxed PRMs sample randomly but… start goal C-obst difficult to sample points.
Geometric Algorithms for Conformational Analysis of Long Protein Loops J. Cortess, T. Simeon, M. Remaud- Simeon, V. Tran.
Randomized Motion Planning for Car-like Robots with C-PRM Guang Song and Nancy M. Amato Department of Computer Science Texas A&M University College Station,
Sampling and Connection Strategies for PRM Planners Jean-Claude Latombe Computer Science Department Stanford University.
1 Last lecture  Configuration Space Free-Space and C-Space Obstacles Minkowski Sums.
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
CS 326 A: Motion Planning Probabilistic Roadmaps Basic Techniques.
1 Single Robot Motion Planning - II Liang-Jun Zhang COMP Sep 24, 2008.
Planning Paths for Elastic Objects Under Manipulation Constraints Florent Lamiraux Lydia E. Kavraki Rice University Presented by: Michael Adams.
1 On the Probabilistic Foundations of Probabilistic Roadmaps D. Hsu, J.C. Latombe, H. Kurniawati. On the Probabilistic Foundations of Probabilistic Roadmap.
Providing Haptic ‘Hints’ to Automatic Motion Planners Providing Haptic ‘Hints’ to Automatic Motion Planners Burchan Bayazit Joint Work With Nancy Amato.
CS 326A: Motion Planning ai.stanford.edu/~latombe/cs326/2007/index.htm Probabilistic Roadmaps: Basic Techniques.
Probabilistic Roadmaps for Path Planning in High-Dimensional Configuration Spaces Kavraki, Svestka, Latombe, Overmars 1996 Presented by Dongkyu, Choi.
Graphical Models for Protein Kinetics Nina Singhal CS374 Presentation Nov. 1, 2005.
Laboratory for Perceptual Robotics – Department of Computer Science Whole-Body Collision-Free Motion Planning Brendan Burns Laboratory for Perceptual Robotics.
Using Motion Planning to Map Protein Folding Landscapes
Planning for Humanoid Robots Presented by Irena Pashchenko CS326a, Winter 2004.
1 Path Planning in Expansive C-Spaces D. HsuJ. –C. LatombeR. Motwani Prepared for CS326A, Spring 2003 By Xiaoshan (Shan) Pan.
NUS CS 5247 David Hsu1 Last lecture  Multiple-query PRM  Lazy PRM (single-query PRM)
Randomized Motion Planning for Car-like Robots with C-PRM Guang Song, Nancy M. Amato Department of Computer Science Texas A&M University College Station,
A General Framework for Sampling on the Medial Axis of the Free Space Jyh-Ming Lien, Shawna Thomas, Nancy Amato {neilien,
Stochastic roadmap simulation for the study of ligand-protein interactions Mehmet Serkan Apaydin, Carlos E. Guestrin, Chris Varma, Douglas L. Brutlag and.
Robot Motion Planning Bug 2 Probabilistic Roadmaps Bug 2 Probabilistic Roadmaps.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL Constraint-Based Motion Planning using Voronoi Diagrams Maxim Garber and Ming C. Lin Department of Computer.
Workspace-based Connectivity Oracle An Adaptive Sampling Strategy for PRM Planning Hanna Kurniawati and David Hsu Presented by Nicolas Lee and Stephen.
RNA Folding Kinetics Bonnie Kirkpatrick Dr. Nancy Amato, Faculty Advisor Guang Song, Graduate Student Advisor.
Stochastic Roadmap Simulation: An Efficient Representation and Algorithm for Analyzing Molecular Motion Mehmet Serkan Apaydin, Douglas L. Brutlag, Carlos.
CS 326A: Motion Planning Probabilistic Roadmaps: Sampling and Connection Strategies.
CS 326 A: Motion Planning Probabilistic Roadmaps Basic Techniques.
Probabilistic Roadmaps for Path Planning in High-Dimensional Configuration Spaces Kavraki, Svestka, Latombe, Overmars 1996 Presented by Chris Allocco.
Providing Haptic ‘Hints’ to Automatic Motion Planners Providing Haptic ‘Hints’ to Automatic Motion Planners Burchan Bayazit Joint Work With Nancy Amato.
Providing Haptic ‘Hints’ to Automatic Motion Planners Providing Haptic ‘Hints’ to Automatic Motion Planners by Burchan Bayazit Department of Computer Science.
Probabilistic Roadmaps for Path Planning in High-Dimensional Configuration Spaces Lydia E. Kavraki Petr Švetka Jean-Claude Latombe Mark H. Overmars Presented.
A Randomized Approach to Robot Path Planning Based on Lazy Evaluation Robert Bohlin, Lydia E. Kavraki (2001) Presented by: Robbie Paolini.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
NUS CS5247 A dimensionality reduction approach to modeling protein flexibility By, By Miguel L. Teodoro, George N. Phillips J* and Lydia E. Kavraki Rice.
Generating Better Conformations for Roadmaps in Protein Folding PARASOL Lab, Department of Computer Science, Texas A&M University,
Using Motion Planning to Study Protein Folding Pathways Susan Lin, Guang Song and Nancy M. Amato Department of Computer Science Texas A&M University
Randomized Motion Planning: From Intelligent CAD to Digital Actors to Protein Folding Nancy M. Amato Department of Computer Science Texas A&M University.
Robotics Chapter 5 – Path and Trajectory Planning
Probabilistic Roadmaps for Path Planning in High-Dimensional Configuration Spaces (1996) L. Kavraki, P. Švestka, J.-C. Latombe, M. Overmars.
On Delaying Collision Checking in PRM Planning – Application to Multi-Robot Coordination By: Gildardo Sanchez and Jean-Claude Latombe Presented by: Michael.
Deterministic Sampling Methods for Spheres and SO(3) Anna Yershova Steven M. LaValle Dept. of Computer Science University of Illinois Urbana, IL, USA.
Aimée Vargas, Jyh-Ming Lien, Marco Morales and Nancy M. Amato Algorithms & Applications Group Parasol Lab, Dept. of Computer Science, Texas A&M University.
Introduction to Motion Planning
UNC Chapel Hill M. C. Lin Introduction to Motion Planning Applications Overview of the Problem Basics – Planning for Point Robot –Visibility Graphs –Roadmap.
Tree-Growing Sample-Based Motion Planning
Modeling Protein Flexibility with Spatial and Energetic Constraints Yi-Chieh Wu 1, Amarda Shehu 2, Lydia Kavraki 2,3  Provided an approach to generating.
Randomized Kinodynamics Planning Steven M. LaVelle and James J
Filtering Sampling Strategies: Gaussian Sampling and Bridge Test Valerie Boor, Mark H. Overmars and A. Frank van der Stappen Presented by Qi-xing Huang.
Randomized KinoDynamic Planning Steven LaValle James Kuffner.
CS 326A: Motion Planning Probabilistic Roadmaps for Path Planning in High-Dimensional Configuration Spaces (1996) L. Kavraki, P. Švestka, J.-C. Latombe,
PRM based Protein Folding
Last lecture Configuration Space Free-Space and C-Space Obstacles
Probabilistic Roadmap Motion Planners
Presented By: Aninoy Mahapatra
Sampling and Connection Strategies for Probabilistic Roadmaps
Giovanni Settanni, Antonino Cattaneo, Paolo Carloni 
Configuration Space of an Articulated Robot
Presentation transcript:

Using Motion Planning to Study Ligand Binding and Protein Folding Nancy Amato,Guang Song and Burchan Bayazit Department of Computer Science Texas A&M University

Given: an environment (descriptions of moveable object A and obstacles B), and start and goal positions of A Find: a valid path (continuous sequence of valid configurations of A) from start to goal Motion Planning start goal obstacles

Motivation: Paper Folding Box: 12 => 5 dofPeriscope: 11 dof A Motion planning approach to ligand binding. Singh, Latombe, Brutlag, ISMB’99

Outline PRMs for Computational Biology (PRM: Probabilistic Roadmap Method) —Conformation Space —Modeling —Potential Functions —Roadmap Construction Protein Folding Pathways Ligand Binding

1. Connect start and goal to roadmap Query processing startgoal Probabilistic Roadmap Methods (PRMs) [Kavraki, Svestka, Latombe,Overmars 1995] C-obst Roadmap Construction (Pre-processing) 2. Connect pairs of nodes to form roadmap - simple, deterministic local planner (e.g., straightline) - discard paths that are in collision (collision check) 1. Randomly generate configurations (nodes) - discard nodes that are in collision (collision check) C-obst 2. Find path in roadmap between start and goal - regenerate plans for edges in roadmap Configuration Space

Node Generation Node Connection (build edges) Query System Structure of PRM: object oriented approach Node Validation PRM motion planning framework Dynamics Local Connection Potential Functions Collision Detector Collision-free edge

Motion Planning in robotics and computational biology Configuration Space A robot Collision Detector Roadmap Construction (A connectivity graph) Query (find collision-free path) Conformation Space Model ‘ligand’ or protein as articulated robots Potential Calculation Roadmap Construction (A connectivity graph) Query (find energetically feasible path) Although the problems appear different, they can both use the same motion planning framework, which works in the abstract C-space.

PRM : Node Generation 1.Randomly generate a conformation, determine all atoms’ coordinates. 2.compute potential energy E of conformation and retain node with probability P(E): ( note, this is just one way to do it.)

PRM: Roadmap Connection 1.Find k closest nodes for each roadmap node 2.Calculate weight of straightline path [Singh, Latombe, Brutlag, 1999] where Pi is probability of moving between intermediate configurations i and i+1 Low weight more energetically feasible.

PRM: Querying the Roadmap ( A 10-ALA folding case) start goal Add start and goal to roadmap 1. Extract smallest weight path (energetically most feasible) between them

I. Protein Folding Pathways

Key Issues Goal: Study folding pathways to the known native fold. Validation —PRM roadmaps give us folding pathways, but how can we test if they are (close to) the natural folding path? Potential Functions — The paths in the PRM roadmaps are selected based on the potential function used to create the map. — How accurate does the potential function need to be to produce roadmaps with good paths? More accurate => more time and fewer nodes (less coverage)

Validation of Folding Pathways Compare the order that secondary structures form in our paths with experimental results — We used pulse labeling & N-state exchange results from Woodward et al as suggested by Dr. Marty Scholtz at Texas A&M. Protein GB1 — 56 residues — 1 alpha helix — 4 beta-strands Pulse-labeling result alpha helix (yellow) and beta-4 (red, and beta-3) form first

Potential Functions: Goal: simple, yet accurate enough for PRMs Strategy: study importance of various terms and design a potential for PRM usage — may also yield insight into which terms are most influential in governing the folding process Potentials we have used (derivations of the potential function in Levitt 1983) —van der Waals only —van der Waals + hydrogen bonds — van der Waals + hydrogen bonds + disulphide bonds — … + hydrophobic effect

Protein G results: PHI vs. PSI distribution. Nodes sampled around native fold using a set of normal distributions: {5,10,20,40, 80,160}. Protein G’s native fold has one alpha-helix & four beta strands, which is reflected here by the dense samples around alpha & beta regions.

RMSD is the distance from protein G’s native fold. The sample (10000 nodes) suggests a funnel structure around the native fold. Distribution of roadmap nodes :

Peaks show where some atoms are close and Van der Waals term dominates (which is approximated by a constant). Bigger roadmaps have smoother paths. Peaks right before goal indicate the tight packing of the native fold. Potential Profiles for different size of roadmaps:

While some peaks result from local sparse distribution of nodes, to optimize the final folding path, more samples around the peaks can improve the path. More samples around the peaks

Folding pathways for different start conformations d ef abc Letter labels Numerical Labels : Folding from the extended conformation (reverse order). : Folding path from a random node in the roadmap.

Protein GB1 Movie

Results of Protein A Nodes are sampled around the native fold using a set of normal distributions. Protein A is an all-alpha protein, which is shown by dense sampling at the alpha region.

The overall ‘funnel’ shape is different from that of protein G, which possibly reflects different folding behaviors. Note also the the narrow ‘funnel’ for RMSD< 10 A. This may suggest that region contains only the packing of secondary structures to native fold (therefore potential changes little) Protein A: Potential vs. RMSD distribution

the packing of secondary structure. Both paths share some segments near the goal, suggesting the folding path near the native fold may be quite independent of start conformations. Folding from different start conformations a bcdef

Protein Folding pathways: Summary of Preliminary Results Our results seem to be in accordance with the pulse- labeling experiments for proteins GB1and A — more proteins must be studied Potential Energy Function… —van der Waals terms + Hydrogen & disulphide bonds can capture some (high level) characteristics of the folding process — but they are probably not enough - what next? How can we analyze the paths contained in the PRM roadmaps?

II. Ligand Binding

Ligand Binding Automated docking algorithms — AutoDock,Dock,FlexX,FLOG,FTDock,Gold, etc. — Often simplified by rigid ligand assumption PRM Approach (Singh, et.al., 1999) —rapidly explores high dimensional space —We use PRM variant better suited for ligand binding User-Computer interaction — haptics (sense of touch) helps the user to understand molecular interaction — User may improve the PRM by suggesting candidate site regions.

Our Approach Generate Binding Site Canditates —Generate sample nodes (automated or user collected) —Push nodes to local minima —Connect nodes Recognize Binding Sites —Choose largest connected component ( accessibility) —Discard nodes with potential larger than E max —Score remaining nodes

Generate Candidates: Automatic Node Generation Generate a collision free base Generate random values for other joint angles Keep this configuration if the potential is less than E max Protein Ligand base

Grid Potential/Force Create a grid in space Calculate the contribution of protein atoms to each grid point Precalculate Real-Time Find the grid points where ligand atoms are located Calculate the potential/force

PHANToM User attaches haptic device to ligand, and moves it around user feels the forces on ligand ligand is rigid force calculation is too slow, so use extrapolation techniques (grid potential) Ligand configurations (candidate sites) passed to planner automatically sampled at regular intervals Generate Canditates: Haptic Interaction

Approximate Gradient Descent Push nodes to local minima For each node sample n close nodes Choose the node with lowest potential among them Repeat until local minima or iteration limit is reached

Recognizing Binding Sites Pick low energy configurations in the largest connected component For each node in the selected configuration, uniformly sample n configurations within a distance r (i.e., construct a local roadmap) Find the score of the selected configuration as the average potential of the sampled configurations (local roadmap)

Potential v.s. Scoring Function 1STP (protein=streptavidin, ligand (11 dof) =biotin) PotentialScore

Experiments Questions: What is the effect of rigid vs. flexible ligands? Can OBPRM identify binding sites? Can user provide helpful information?

Experiments First Experiment treat ligand as rigid body Second Experiment user collects and automated planner fine tunes Third Experiment treat ligand as articulated (i.e., flexible) Complexes(protein:ligand:degree of freedom) 1A5Z:L-Lactate Dehyrogenase:Oxamate:7 1LDM: M 4 -Lactate Dehyrogenase:Oxamate:7 1STP:Streptavidin:Biotin:11 } In Singh,et.al., 1999

Results (1A5Z and 1STP) Timing. Rigid. User. Articulated. Binding RMSD vs Score (1A5Z, 7dof) Timing. Rigid. User. Articulated. Binding RMSD vs Score (1STP, 11dof)

Results (1LDM) Timing RMSD vs Score (7dof). User. Articulated. Binding

Conclusion In our examples, we could generate and identify configurations in the true binding site Scoring function may be improved Our results may used as input for other automated docking programs User input improves efficiency, and haptic feedback helps the user better understand the problem We need a better user interface

More at For contact: