By Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi.

Slides:



Advertisements
Similar presentations
Amino Acids and Proteins B.2. there are about 20 amino acids that occur naturally they are the basic “building blocks” of life/proteins.
Advertisements

Proteins - Many Structures, Many Functions 1.A polypeptide is a polymer of amino acids connected to a specific sequence 2.A protein’s function depends.
Probabilistic Roadmap
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Geometric Algorithms for Conformational Analysis of Long Protein Loops J. Cortess, T. Simeon, M. Remaud- Simeon, V. Tran.
Randomized Motion Planning for Car-like Robots with C-PRM Guang Song and Nancy M. Amato Department of Computer Science Texas A&M University College Station,
1 Last lecture  Configuration Space Free-Space and C-Space Obstacles Minkowski Sums.
Biology 107 Macromolecules II September 9, Macromolecules II Student Objectives:As a result of this lecture and the assigned reading, you should.
Using Motion Planning to Study Ligand Binding and Protein Folding Nancy Amato,Guang Song and Burchan Bayazit Department of Computer Science Texas A&M University.
Polypeptides – a quick review A protein is a polymer consisting of several amino acids (a polypeptide) Each protein has a unique 3-D shape or Conformation.
Graphical Models for Protein Kinetics Nina Singhal CS374 Presentation Nov. 1, 2005.
Biology 107 Macromolecules II September 5, Macromolecules II Student Objectives:As a result of this lecture and the assigned reading, you should.
College 4. Coordination interaction A dipolar bond, or coordinate covalent bond, is a description of covalent bonding between two atoms in which both.
Mossbauer Spectroscopy in Biological Systems: Proceedings of a meeting held at Allerton House, Monticello, Illinois. Editors: J. T. P. DeBrunner and E.
Biology 107 Macromolecules II September 8, 2003.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Randomized Motion Planning for Car-like Robots with C-PRM Guang Song, Nancy M. Amato Department of Computer Science Texas A&M University College Station,
1. Primary Structure: Polypeptide chain Polypeptide chain Amino acid monomers Peptide linkages Figure 3.6 The Four Levels of Protein Structure.
Stochastic roadmap simulation for the study of ligand-protein interactions Mehmet Serkan Apaydin, Carlos E. Guestrin, Chris Varma, Douglas L. Brutlag and.
RNA Folding Kinetics Bonnie Kirkpatrick Dr. Nancy Amato, Faculty Advisor Guang Song, Graduate Student Advisor.
Proteins Structures Primary Structure.
Stochastic Roadmap Simulation: An Efficient Representation and Algorithm for Analyzing Molecular Motion Mehmet Serkan Apaydin, Douglas L. Brutlag, Carlos.
Probabilistic Roadmaps for Path Planning in High-Dimensional Configuration Spaces Kavraki, Svestka, Latombe, Overmars 1996 Presented by Chris Allocco.
Daily Starter  Explain how a peptide bond is formed. (What is the reaction called and how does it happen?)
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
Proteins account for more than 50% of the dry mass of most cells
Diverse Macromolecules. V. proteins are macromolecules that are polymers formed from amino acids monomers A. proteins have great structural diversity.
Generating Better Conformations for Roadmaps in Protein Folding PARASOL Lab, Department of Computer Science, Texas A&M University,
02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC.
STRUCTURAL ORGANIZATION
Using Motion Planning to Study Protein Folding Pathways Susan Lin, Guang Song and Nancy M. Amato Department of Computer Science Texas A&M University
Amino Acids and Proteins B.2. Properties of 2-amino acids (B.2.2) Zwitterion (dipolar) – amino acids contain both acidic and basic groups in the same.
3.2 Proteins Mini Lecture Radjewski. Major functions of proteins: Enzymes—catalytic proteins Defensive proteins (e.g., antibodies) Hormonal and regulatory.
ProteinShop: A Tool for Protein Structure Prediction and Modeling Silvia Crivelli Computational Research Division Lawrence Berkeley National Laboratory.
Probabilistic Roadmaps for Path Planning in High-Dimensional Configuration Spaces (1996) L. Kavraki, P. Švestka, J.-C. Latombe, M. Overmars.
Biochemical Overview on Amino Acids Structure and Protein Structure and Function Dr. Waheed Al-Harizi Medical Biochemistry Department College of Medicine.
Conformational Entropy Entropy is an essential component in ΔG and must be considered in order to model many chemical processes, including protein folding,
Proteins are instrumental in about everything that an organism does. These functions include structural support, storage, transport of other substances,
THE STRUCTURE AND FUNCTION OF MACROMOLECULES Proteins - Many Structures, Many Functions 1.A polypeptide is a polymer of amino acids connected to a specific.
Proteins. Slide 2 of 19 Proteins  Polymers composed of amino acids  Protein = Polypeptide (polymer)  Monomer = Amino acids  Peptide bonds  Amino.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Proteins: Molecules with Diverse Structures and Functions
Objective 7: TSWBAT recognize and give examples of four levels of protein conformation and relate them to denaturation.
Intended Learning Objectives You should be able to… 1. Give 3 examples of proteins that are important to humans and are currently produced by transgenic.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein backbone Biochemical view:
Proteins Structure and Function. PROTEINS Proteins are essential to the structures and activities of life...
PROTEIN STRUCTURE Brianne Morgan, Adrienne Trotto, Alexis Angstadt.
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Proteins A protein chain will have somewhere in the range of 50 to 2000 amino acid residues. This term is used because strictly speaking a peptide chain.
Protein Structure. Insulin Infinite variety The number of possible sequences is infinite An average protein has 300 amino acids, At each position there.
PROTEINS Characteristics of Proteins Contain carbon, hydrogen, oxygen, nitrogen, and sulfur Serve as structural components of animals Serve as control.
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Structural organization of proteins
Protein Proteins are biochemical compounds consisting of one or more polypeptides typically folded into a globular or fibrous form in a biologically functional.
Chapter 5 Proteins.
Proteins account for more than 50% of the dry mass of most cells
© SSER Ltd..
Chemical agents PROTEINS: The Molecular Tools of the Cell
Proteins have a very wide range of functions in living organisms.
CS 326A: Motion Planning Probabilistic Roadmaps for Path Planning in High-Dimensional Configuration Spaces (1996) L. Kavraki, P. Švestka, J.-C. Latombe,
Proteins account for more than 50% of the dry mass of most cells
Diverse Macromolecules
Chapter 3 Proteins.
Protein Structure Prediction
Protein Structure Chapter 14.
Proteins account for more than 50% of the dry mass of most cells
Proteins.
Presentation transcript:

by Guang Song and Nancy M. Amato Journal of Computational Biology, April 1, 2002 Presentation by Athina Ropodi

 Introduction- Definition Motion planning Probabilistic Roadmap Method Denaturation C-space  Probabilistic Roadmap Method Basic steps Degrees of freedom/C-space Node Generation Sampling strategy Potential Energy Computations  Results

 Motion planning is a term used in robotics for the process of detailing a task into atomic robotic motions.  This issue is also known as the “navigation problem”:  Given an environment, a start and a goal position of an object, the objective is to find a valid path (continuous sequence of valid configurations) from start to goal.

 Probabilistic Roadmap (PRM) motion planning techniques are applied to “small” proteins (up to 60 residues) in order to compute folding pathways from a denaturated state to its native fold.  Denaturate: To cause the structure to unfold, so that some of its original properties, especially its biological activity, are diminished or eliminated. Usually caused by extreme conditions, e.g. high temperature.

 In quaternary structure denaturation, protein sub-units are dissociated and/or the spatial arrangement of protein subunits is disrupted.  Tertiary structure denaturation involves the disruption of:  Covalent interactions between amino acid side chains (such as disulfide bridges between cysteine groups)  Noncovalent dipole-dipole interactions between polar amino acid side chains (and the surrounding solvent)  Van der Waals (induced dipole) interactions between nonpolar amino acid side chains.  In secondary structure denaturation, proteins lose all regular repeating patterns such as alpha-helices and beta- pleated sheets, and adopt a random coil configuration.  Primary structure, such as the sequence of amino acids held together by covalent peptide bonds, is not disrupted by denaturation

 The article investigates the folding mechanisms of a protein assuming we know its native fold.  Results are validated by comparing the formation order to pulse-labeling experimental results.  The configuration space (C-Space) of a movable object is the space consisting of all positions and orientations of that object.  Pulse labeling is a biochemistry technique of identifying the target molecule presence by inclusion of a pulse of a radioactive compound.

 Introduction- Definition Motion planning Probabilistic Roadmap Method Denaturation C-space  Probabilistic Roadmap Method Basic steps Degrees of freedom/C-space Node Generation Sampling strategy Potential Energy Computations  Results

 Any complete motion planner would require time exponential in the number of degrees of freedom (dof).  Several methods such as energy minimization, molecular dynamics, Monte-Carlo and genetic algorithms have been used.  This method tries to simulate the true dynamics of the folding process using the classical Newton’s motion equations.  However, an exact simulation would depend on the start conformation and could result in local minima.

The basic steps are:  First of all, PRM samples points randomly from C-space and retains those that satisfy certain requirements.  Then, the points are connected and form a graph using a simple planning method.  Finally, paths connecting the start and goal configurations are extracted using standard graph search techniques.  [Kavraki, Svestka, Latombe,Overmars 1996] In this case, low-energy conformations are preferable.

 All atomic bond lengths and angles are considered to be constants.  We only consider 2 dofs, φ and ψ angles.  Side-chains are modeled as spheres with no dof.  Fold positions (atomic bonds) correspond to joints and atoms correspond to links. Thus, for k residues our model has 2k links and 2k revolute joints…

 Different configurations are produced by assigning possible angle values.  But, the nodes are accepted or rejected based on their potential energy: where E min=50000 and Emax=89000 KJouls/mol A configuration with higher potential is more likely to be rejected.

 Due to the high dimensionality of the problem, a very dense uniform sampling is required.  Since prior knowledge of the native fold is assumed, a sampling strategy biased to the native fold is applied:  Sampling is performed from a set of normal distributions around the native fold. The standard deviations used are {5, 10, 20, 40, 80,160 degrees}. Small STDs capture the detail around the goal, and larger ensure adequate roadmap coverage

 For each node, the k-nearest neighbors are found(k=20 and the metric is Euclidean).  For each connection a feasibility check is performed. (two nodes are connected by a straight line)  For 2 consecutive intermediate conformations, i and i+1, we first check their potential energies and then the probability of moving from i to i+1: RMSD metric proved inferior

 The total weight of the edge is:  Dijkstra’s algorithm is then used to find the smallest weight path.  Path optimization: resampling is performed around the nodes of paths with high potential.

 To reduce the cost of calculations, an approximation function is used. We only consider contribution from side chains and those are modeled as spheres.  The cost is then reduced by 2 orders of magnitude. The 1 st term represents constraints that favor secondary structure, hydrogen and disulphide bonds The 2 nd the van der Waals interactions among atoms.

 Introduction- Definition Motion planning Probabilistic Roadmap Method Denaturation C-space  Probabilistic Roadmap Method Basic steps Degrees of freedom/C-space Node Generation Sampling strategy Potential Energy Computations  Results

 Folding process of GB1:

 Protein GB1(56 residues): 1 α-helix and 4 β-strands  Protein A (60 residues): 3 α-helices

Peaks show where atoms are close and Van der Waals interactions dominate. Bigger roadmaps have smoother paths.

More samples around the peaks improve the path.

 Different pathways tend to come together and appear to have some common channels, as they approach the native fold.

Bibliography: 1. Probabilistic roadmaps for path planning in high-dimensional configuration spaces L. Kavraki, P. Svestka, J-C. Latombe, M. H. Overmars, Protein Folding by Restrained Energy Minimization and Molecular Dynamics Michael Levitt, esearch/folding/index.shtml.OLD2