AMPLE – Using de novo protein structure modelling techniques to create and enhance search models for use in Molecular Replacement Figure taken from:

Slides:



Advertisements
Similar presentations
Transmembrane Protein Topology Prediction Using Support Vector Machines Tim Nugent and David Jones Bioinformatics Group, Department of Computer Science,
Advertisements

Structural Classification and Prediction of Reentrant Regions in Alpha-Helical Transmembrane Proteins: Application to Complete Genomes Håkan Viklunda,
AMPLE – Using de novo or ab initio protein structure modelling techniques to create and enhance search models for use in Molecular Replacement Jaclyn Bibby,
Secondary structure prediction from amino acid sequence.
Functional Site Prediction Selects Correct Protein Models Vijayalakshmi Chelliah Division of Mathematical Biology National Institute.
SAN DIEGO SUPERCOMPUTER CENTER Blue Gene for Protein Structure Prediction (Predicting CASP Targets in Record Time) Ross C. Walker.
Protein Structure Prediction using ROSETTA
Presented by Next Generation Simulations in Biology: Investigating biomolecular structure, dynamics and function through multiscale modeling Pratul K.
Structural bioinformatics
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
Garnier-Osguthorpe-Robson
Thomas Blicher Center for Biological Sequence Analysis
Mark J. van der Woerd1, Donald Estep2, Simon Tavener2, F
Comparative ab initio prediction of gene structures using pair HMMs
The Protein Data Bank (PDB)
Reading Report Ce WANG A segment alignment approach to protein comparison.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Solving the Protein Threading Problem in Parallel Nocola Yanev, Rumen Andonov Indrajit Bhattacharya CMSC 838T Presentation.
Evaluating alignments using motif detection Let’s evaluate alignments by searching for motifs If alignment X reveals more functional motifs than Y using.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Protein Structure Prediction Samantha Chui Oct. 26, 2004.
Utilizing Fuzzy Logic for Gene Sequence Construction from Sub Sequences and Characteristic Genome Derivation and Assembly.
Recap Don’t forget to – pick a paper and – me See the schedule to see what’s taken –
Protein Tertiary Structure Prediction
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
28 th March 2007 MrBUMP – Automated Molecular Replacement Ronan Keegan, Martyn Winn CCP4, Daresbury Laboratory.
Protein Structure Prediction. Historical Perspective Protein Folding: From the Levinthal Paradox to Structure Prediction, Barry Honig, 1999 A personal.
Predicting Secondary Structure of All-Helical Proteins Using Hidden Markov Support Vector Machines Blaise Gassend, Charles W. O'Donnell, William Thies,
Lecture 10 – protein structure prediction. A protein sequence.
BALBES (Current working name) A. Vagin, F. Long, J. Foadi, A. Lebedev G. Murshudov Chemistry Department, University of York.
Representations of Molecular Structure: Bonds Only.
ProteinShop: A Tool for Protein Structure Prediction and Modeling Silvia Crivelli Computational Research Division Lawrence Berkeley National Laboratory.
Lecture 12 CS5661 Structural Bioinformatics Motivation Concepts Structure Prediction Summary.
© Wiley Publishing All Rights Reserved. Protein 3D Structures.
Neural Networks for Protein Structure Prediction Brown, JMB 1999 CS 466 Saurabh Sinha.
U N I V E R S I T Y O F S O U T H F L O R I D A Database-centric Data Analysis of Molecular Simulations Yicheng Tu *, Sagar Pandit §, Ivan Dyedov *, and.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Shaping up the protein folding funnel by local interaction: Lesson from a structure prediction study George Chikenji*, Yoshimi Fujitsuka, and Shoji Takada*
Modelling Genome Structure and Function Ram Samudrala University of Washington.
R. Keegan 1, J. Bibby 3, C. Ballard 1, E. Krissinel 1, D. Waterman 1, A. Lebedev 1, M. Winn 2, D. Rigden 3 1 Research Complex at Harwell, STFC Rutherford.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
Patentability Considerations in the 3-D Structure Arts Patentability Considerations in the 3-D Structure Arts Michael P. Woodward Supervisory Patent Examiner.
Structural proteomics
COMPUTERS IN BIOLOGY Elizabeth Muros INTRO TO PERSONAL COMPUTING.
Protein Structure Prediction ● Why ? ● Type of protein structure predictions – Sec Str. Pred – Homology Modelling – Fold Recognition – Ab Initio ● Secondary.
Modelling protein tertiary structure Ram Samudrala University of Washington.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Structural proteomics Handouts. Proteomics section from book already assigned.
Biocomputation: Comparative Genomics Tanya Talkar Lolly Kruse Colleen O’Rourke.
CS 177 Proteins I (Structure-function relationships) Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory.
BLAST, which stands for basic local alignment search tool, is a heuristic algorithm that is used to find similar sequences of amino acids or nucleotides.
Database Similarity Search. 2 Sequences that are similar probably have the same function Why do we care to align sequences?
__________________________________________________________________________________________________ Fall 2015GCBA 815 __________________________________________________________________________________________________.
CISC Machine Learning for Solving Systems Problems Presented by: Eunjung Park Dept of Computer & Information Sciences University of Delaware Solutions.
CCP4 Molecular Replacement Model Generation Create a CCP4i task for generating Molecular Replacement models. - Selecting suitable PDB entries, based on.
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
Structure/function studies of HIV proteins HIV gp120 V3 loop modelling using de novo approaches HIV protease-inhibitor binding energy prediction.
Fitness Guided Fault Localization with Coevolutionary Automated Software Correction Case Study ISC Graduate Student: Josh Wilkerson, Computer Science ISC.
Protein Structure Prediction: Threading and Rosetta BMI/CS 576 Colin Dewey Fall 2008.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
Peptide de novo sequencing Peptide de novo sequencing is the analytical process that derives a peptide’s amino acid sequence from its tandem mass spectrum.
Generating, Maintaining, and Exploiting Diversity in a Memetic Algorithm for Protein Structure Prediction Mario Garza-Fabre, Shaun M. Kandathil, Julia.
Introduction to Bioinformatics II
Rosetta: De Novo determination of protein structure
Protein structure prediction.
Molecular Packing and Packing Defects in Helical Membrane Proteins
Bioinformatics, Vol.17 Suppl.1 (ISMB 2001) Weekly Lab. Seminar
Putative targets of miRNAs directly induced by p53 with down-regulated mRNA- and de novo protein synthesis or reduced de novo protein synthesis only. Putative.
Presentation transcript:

AMPLE – Using de novo protein structure modelling techniques to create and enhance search models for use in Molecular Replacement Figure taken from: Summary Ab initio modelling offers exciting prospects for solving structures for novel proteins that cannot be solved with existing methods. There are many possibilities for extending and improving our approach that we will be exploring in future work. REFS: [1] Simons, K.T., Kooperberg, C., Huang, E., Baker, D., Journal of molecular biology 268, 209–225. [2] Bibby, J., Keegan, R.M., Mayans, O., Winn, M.D., Rigden, D.J., Acta Crystallographica Section D Biological Crystallography 68, 1622–1631. [3] Barth, P., Wallner, B., Baker, D., Proceedings of the National Academy of Sciences 106, 1409–1414. [4] Marks DS, Colwell LJ, Sheridan R, Hopf TA, Pagnani A, et al. (2011) PLoS ONE 6(12):e doi: /journal.pone Ab initio Modelling with Rosetta The Rosetta [1] suite of programs can be used to generate structures for proteins starting only from their sequence. Rosetta collects short fragments of residues of known structures similar to the target and then strings these fragments together to generate a collection of models of the target structures (decoys). Ab initio Models for Molecular Replacement Modelling produces clusters of similar models: MR works effectively with superimposed ensembles approximating the target. Within and between clusters similarity indicates accuracy, so we can trim (truncate) inaccurate regions: MR may only require a partial model. AMPLE AMPLE [2] takes ab initio models (such as those generated with Rosetta) and clusters the most similar ones together. For each cluster it then truncates the most dissimilar residues using a range of different variance thresholds, leaving a core of similar residues. The truncated models are clustered together and aligned to create an ensemble of similar models. These ensembles are then used as candidate models in molecular replacement and have been found to be very successful. Previous work was restricted to relatively small globular proteins, but we have been looking at extending the approach to more complex proteins. We were granted early-access time on the Blue Wonder supercomputer at STFC's Hartree center at Daresbury Laboratory. This is an IBM System iDataPlex cluster comprising 8,192 Intel Xeon E processor cores. AMPLE was extended to make efficient use of the cluster allowing us to rapidly benchmark the code on 113 different protein structures. Transmembrane Proteins Transmembrane proteins sit within the cell membrane and are in a non-aqueous environment that makes them difficult to work with experimentally. However, the membrane environment imposes additional constraints on the protein that can be used to guide and focus the modelling. This work has looked at extending AMPLE to work with Rosetta transmembrane models [3] for 18 different proteins. Coiled-Coils These are proteins where the alpha-helix coils are themselves coiled around each other. Placement of a small portion of one helix (sometimes even the wrong one) enables MR to "bootstrap" into the correct whole structure, so we are also looking at a collection of 95 different coiled-coil structures. Contact Prediction Recent work has significantly improved the accuracy with which residues that are in close contact with each other ("contacts") can be predicted from the sequence alone; this is due to a combination of the huge numbers of new sequences that are available, but also improved contact prediction algorithms [4]. Small numbers of contacts can be used to guide modelling, significantly improving the results and extending it to much larger proteins then had previously been possible. We are currently extending AMPLE to take advantage of these contacts. Figure from reference 4. High-Performance Computing Ronan Keegan, Jens Thomas, Jaclyn Bibby, Martyn Winn and Dan Rigden CCP4, STFC Rutherford Appleton Laboratory and University of Liverpool, UK,