In studying protein-protein interactions it is important to accurately describe the surface of the proteins, as that is where the interactions occur. The.

Slides:



Advertisements
Similar presentations
Scientific & technical presentation Structure Visualization with MarvinSpace Oct 2006.
Advertisements

Bio-CAD M. Ramanathan Bio-CAD. Molecular surfaces Bio-CAD.
Bioinformatics Vol. 21 no (Pages ) Reporter: Yu Lun Kuo (D )
A Geometric Database for Gene Expression Data Baylor College of Medicine Gregor Eichele Christina Thaller Wah Chiu James Carson Rice University Tao Ju.
A New Analytical Method for Computing Solvent-Accessible Surface Area of Macromolecules.
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
Protein Docking and Interactions Modeling CS 374 Maria Teresa Gil Lucientes November 4, 2004.
Alignment of Flexible Molecular Structures. Motivation Proteins are flexible. One would like to align proteins modulo the flexibility. Hinge and shear.
Protein Primer. Outline n Protein representations n Structure of Proteins Structure of Proteins –Primary: amino acid sequence –Secondary:  -helices &
Mark J. van der Woerd1, Donald Estep2, Simon Tavener2, F
Alpha Shapes and molecular Representations Research Project CSC/MATH 870 SPRING 2007 Philipp Richter.
The Protein Data Bank (PDB)
Subdue Graph Visualizer by Gayathri Sampath, M.S. (CSE) University of Texas at Arlington.
BL5203: Molecular Recognition & Interaction Lecture 5: Drug Design Methods Ligand-Protein Docking (Part I) Prof. Chen Yu Zong Tel:
Geometric molecular surface modeling using mathematical morphology operators. Journal of Molecular Graphics Volume 13, Issue 6, Pages (December.
High Throughput Processing of the Structural Information of the Protein Data Bank Zoltán Szabadka, Vince Grolmusz Department of Computer Science Eötvös.
Geometric molecular surface modeling using mathematical morphology operators. Journal of Molecular Graphics Volume 13, Issue 6, Pages (December.
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Number of released entries Year. Growth of Molecular Complexity Number of Chains Year Number of Structures Containing that Number of Chains.
Structural alignments of Proteins using by TOPOFIT method Vitkup D., Melamud E., Moult J., Sander C. Completeness in structural genomics. Nature Struct.
Lee-Richards Approximation by Intra-Residue Reentrant Surface Exclusion Mario Rosa†, Paul Craig*†, Herbert J. Bernstein*‡
Introduction In recent years, products are required to follow the trend of fashion. It is very popular in using freeform surface to design the model of.
SMART Teams: Students Modeling A Research Topic Jmol Training 101!
Language Objective: Students will be able to practice agreeing and disagreeing with partner or small group, interpret and discuss illustrations, identify.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Being Smart with Graphs This material is based upon work supported by the National Science Foundation under Grant No. DRL ==≠≠ == Any opinions,
Discovering the Correlation Between Evolutionary Genomics and Protein-Protein Interaction Rezaul Kabir and Brett Thompson
Molecular Dynamics Simulation of Bacterial [FeNi]-Hydrogenase Found in Desulfovibrio fructosovorans Solvated in a Water Sphere Introduction Hydrogenase.
Covalent interactions non-covalent interactions + = structural stability of (bio)polymers in the operative molecular environment 1 Energy, entropy and.
Molecular Dynamics Simulations of Compressional Metalloprotein Deformation Andrew Hung 1, Jianwei Zhao 2, Jason J. Davis 2, Mark S. P. Sansom 1 1 Department.
Protein motif extraction with neuro-fuzzy optimization Bill C. H. Chang and Author : Bill C. H. Chang and Saman K. Halgamuge Saman K. Halgamuge Adviser.
LSM3241: Bioinformatics and Biocomputing Lecture 6: Fundamentals of Molecular Modeling Prof. Chen Yu Zong Tel:
Bioinformatics Project BB201 Metabolism A.Nasser
Typically, classifiers are trained based on local features of each site in the training set of protein sequences. Thus no global sequence information is.
FlexWeb Nassim Sohaee. FlexWeb 2 Proteins The ability of proteins to change their conformation is important to their function as biological machines.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
MATLAB ® for Engineers, Holly Moore Fourth Edition, Global Edition © Pearson Education Limited 2015 All rights reserved. Figure 5.1 Simple Plot of Time.
Parameter Reduction for Density-based Clustering on Large Data Sets Elizabeth Wang.
Lecture 11 Introduction to R and Accessing USGS Data from Web Services Jeffery S. Horsburgh Hydroinformatics Fall 2013 This work was funded by National.
EBI is an Outstation of the European Molecular Biology Laboratory. A web based integrated search service to understand ligand binding and secondary structure.
A Computational Study of RNA Structure and Dynamics Rhiannon Jacobs and Harish Vashisth Department of Chemical Engineering, University of New Hampshire,
Cysteine Oxidation Prediction Program (COPP): A New Software Program That Predicts Reversible Protein Cysteine Thiol Oxidation Reactions Ricardo Sanchez,
Cloud-Based Visualization of Value-Added Model Annotations Using Jmol Bob Hanson St. Olaf College, Northfield, MN
J Comput Chem 26: 334–343, 2005 By SHURA HAYRYAN, CHIN-KUN HU, JAROSLAV SKRˇ IVA′ NEK, EDIK HAYRYAN, IMRICH POKORNY.
Bayesian Refinement of Protein Functional Site Matching
Structural Basis for the Highly Selective Inhibition of MMP-13
Daniel M. Freed, Peter S. Horanyi, Michael C. Wiener, David S. Cafiso 
Volume 23, Issue 12, Pages (December 2015)
Walking the Interactome for Prioritization of Candidate Disease Genes
Yeast RNA Polymerase II at 5 Å Resolution
Rapid Detection of TEM-Type Extended-Spectrum β-Lactamase (ESBL) Mutations Using Lights-On/Lights-Off Probes with Single-Stranded DNA Amplification  Kenneth.
Mark Ultsch, Nathalie A Lokker, Paul J Godowski, Abraham M de Vos 
Volume 13, Issue 4, Pages (February 2004)
Cell Size Surface area : Volume
Volume 7, Issue 2, Pages (February 1999)
A Comparison of Genotype-Phenotype Maps for RNA and Proteins
Volume 26, Issue 3, Pages e2 (March 2018)
Volume 26, Issue 3, Pages e2 (March 2018)
Low-Resolution Structures of Proteins in Solution Retrieved from X-Ray Scattering with a Genetic Algorithm  P. Chacón, F. Morán, J.F. Díaz, E. Pantos,
Volume 90, Issue 1, Pages (July 1997)
Volume 9, Issue 1, Pages (January 2002)
Till Siebenmorgen, Martin Zacharias  Biophysical Journal 
Structural Basis for the Highly Selective Inhibition of MMP-13
Absence of Ion-Binding Affinity in the Putatively Inactivated Low-[K+] Structure of the KcsA Potassium Channel  Céline Boiteux, Simon Bernèche  Structure 
Replica Exchange Molecular Dynamics Simulations Provide Insight into Substrate Recognition by Small Heat Shock Proteins  Sunita Patel, Elizabeth Vierling,
Figure 1. Prediction result for birch pollen allergen Bet v 1 (PDB: 1bv1), as obtained by comparison to the cherry ... Figure 1. Prediction result for.
Volume 84, Issue 1, Pages (January 2003)
Volume 22, Issue 10, Pages (October 2014)
Volume 7, Issue 2, Pages (February 1999)
Patrick J. Fleming, Karen G. Fleming  Biophysical Journal 
Presentation transcript:

In studying protein-protein interactions it is important to accurately describe the surface of the proteins, as that is where the interactions occur. The most common surface representation in molecular visualization programs is the Lee-Richards (LR) surface, which is generated by rolling a probe representing a solvent molecule on the van der Waals surface of the protein. This approach is slowed computationally because the program must consider 2 or three atoms at a time. The resulting molecular surface is like a Van der Waals surface, but with reentrant surfaces bridging gaps between reasonably close atoms. Also, the LR surface algorithm sometimes incorrectly assigns atoms to the surface of the protein. We are developing a new algorithm with the hope that it will be both more accurate and faster than current approaches. In the algorithm, atoms are first identified by their accessibility to an imaginary water molecules; it then finds the number of these atoms which are within distance epsilon of atom i. The algorithm then explores a sphere with radius epsilon around atom i. The sphere is divided into 8 sectors and the algorithm determines how many of the n atoms contained in the sphere fall into each of the sectors. If all 8 sectors contain at least one atom, then the atom in question is considered an inside atom and given a value of 0. If at least one sector contains no atoms then the atom in question is considered outside and is given a value of 1. This algorithm is currently implemented in Minitab via the use of a global macro. Atomic coordinates were extracted using an awk script and saved to text files. Preliminary results show that the algorithm is flexible and can identify outside atoms over different shapes of proteins. Also a unique pattern emerged: no matter the protein, there was a limit on the percentage of atoms found on the outside; the graphs show identical limits for all proteins studied thus far. The downside to this approach is that it is computationally burdensome. We are currently testing the algorithm with different proteins and are investigating the association of limits with protein size/shape. Our next step will be to program the algorithm in a different language such as python, java or C++ to see if a more robust language can decrease the run time. In conclusion, this new algorithm is effective on a variety of proteins, shows a unique aspect of proteins as seen in the limits which possibly can lead into new insights into protein surfaces. A STATISTICAL ANALYSIS AND APPROACH TO PROTEIN SURFACE MODELING Luticha Doucette†, James Halavin*, Paul Craig°, Herbert J. Bernstein‡ †RIT Life Sciences, *RIT Mathematical Sciences, °RIT Chemistry ‡Dowling College, Mathematics and Computer Science Figure 1. 3D Minitab Scatter Plot of 1UAQ Figure 1 shows protein 1UAQ with epsilon = 40. Number of outside atoms found = 434 Figure 2 shows how a change in epsilon changes how many atoms are found. Epsilon = 4.6 Number of outside atoms = 1593  The current algorithm is flexible and can handle many different shaped proteins  Proteins show similar asymptotic curves representing the number of atoms that are identified as being on the protein surface  Once implemented in Python, the algorithm runs faster and does not crash like in Minitab  Refine the algorithm in Python  Test the refined algorithm with proteins used in Minitab, compare results  Once the algorithm is refined, expand tests to different types of proteins according to class and size  Incorporate the algorithm into existing ProMol extension as another tool  Publish!  Create a faster, more accurate representation of a protein surface  Implement the algorithm in a more robust language such as Python  Investigate the implications of limits in protein surfaces Created global macros, used awk script to obtain coordinates of atoms from PDB files Opened Minitab in Windows. A global macro was created and added to the Minitab file menu. In Python, reinterpreted the algorithm and tested on the same PDB files as was tested in Minitab  Scott “JT” Mengel  Jon Schull  Eulas Boyd and NSF-LSAMP  The authors gratefully acknowledge the assistance of current and former students who have worked at Dowling College and at RIT on the SBEVSL project. Funding: This work has been supported in part by National Science Foundation Division of Undergraduate Education grant , National Institute of General Medical Sciences grants 2R15GM , 3R15GM S1. The content is solely the responsibility of the authors and does not necessarily represent the official views of the funding agencies. Steinkellner, G; Rader, R.; Thallinger, G.G; Kratky, C.; Gruber, K. VASCo: computation and visualization of annotated surface protein surface contacts. BMC Bioinformatics. 2009, 10, (accessed June 7, 2010). Liu, Y.S; Fang, Y; Ramani, K. IDSS: deformation invariant signatures for molecular shape comparison. BMC Bioinformatics 2009, 10, (accessed June 7, 2010). Hoffmann, B.; Zaslavskiy, M.; Vert, J.P.; Stoven, V. A new protein binding pocket similarity measure based on comparison of clouds of atoms in 3D: application to ligand prediction. BMC Bioinformatics 2010, 11, (accessed June 7, 2010). Bash, P. A.; Pattabiraman, N.; Huang, C.; Ferrin, T.E; Langridge, R. Van Der Waals Surfaces in Molecular Modeling: Implementation with Real-Time Computer Graphics. Science. New Series. 1983, 222, 4630 pg 1325 – (accessed June 8, 2010). Kuntz, I.; Blaney, J.M.; Oatley, S. J.; Langridge, R.; Ferrin, T.E. A geometric approach to macromolecule-ligand interactions. J. Molec. Bio. 1982, 161, 2. Pg 269 – 288 doi: / (82)90153-X (accessed June 8, 2010). Connolly, M.L. Solvent-Accessible Surfaces of Proteins and Nucleic Acids. Science, New Series. 1983, 221,4612 pg 709 – (accessed June 8, 2010). Bernstein, H.J.; Craig, P.A. Efficient molecular surface rendering by linear-time pseudo-Gaussian approximation to Lee- Richards surfaces (PGALRS). Journal of Applied Crystallography. 2010, 43, 2. Pg 356 – 361. Peter J. Artymiuk, Andrew R. Poirrette, Helen M. Grindley, David W. Rice, Peter Willett, A Graph-theoretic Approach to the Identification of Three-dimensional Patterns of Amino Acid Side-chains in Protein Structures, Journal of Molecular Biology, Volume 243, Issue 2, 20 October 1994, Pages , ISSN , /jmbi ( Table 1 shows that as epsilon is increased, the number of outside atoms found reaches a limit. Center (x(i), y(i), z(i)) An imaginary surface is created and divided into 8 sectors. If one sector is is empty, value = 1. If all sectors filled, value = 0. Figure 2. 3D Scatter Plot of 1UAQ with Change in Epsilon EpsilonNumber Inside (0) Number Outside (1) Total Number of Atoms (N) Table 1. Limiting Asymptotic Value for 1UAQ Figure 3. ED Scatter Plot of 3CTK Figure 3 shows protein 3CTK with epsilon set to 70. Maximum radius was found to be Number of outside atoms found = 469 Figure 4. Plot of Maximum Radius vs Percent Outside Proteins studied thus far show very similar asymptotes as seen in Figure 4. All have a sharp decrease until the radius is about 20% of the total number of atoms then it levels off. Figure 6. 1UAQ Implemented in vPython From left to right: vPython places a boxel around each atom, boxel is subdivided into 8 sectors, if other atoms are within those sectors Python returns false and that atom is eliminated. Only outside atoms are left with their corresponding boxels as shown in yellow. Figure 5. 3D Scatter Plot of 1AV1 The new algorithm is flexible in that it can identify outside atoms in unusually shaped proteins as seen in Figure 5. This plot is a different representation of a change in epsilon as seen in figures 1 and 2. Figure 7. Python vs. Minitab Figure 9. 1AV1 Python Implementation 3D scatter On the left is 1AV1 in the next iteration of the algorithm. Done in Python the blue represents the maximum number of atoms found to be on the outside, while green represents interior atoms out of 6588 atoms were found which is 25% of the total, consistent with the asymptotic curves as seen in Figure 4. as well as the 3D scatter plot in Figure 5. On the right, is just the outside atoms, represented in blue. On the left is 1UAQ from Python. The curve is not plotted as percent of maximum radius vs percent outside but shows some discrepancies between Minitab and Python. Further refinement of Python should resolve this issue.