Predicting Protein Structure: Comparative Modeling (homology modeling)

Slides:



Advertisements
Similar presentations
PROTEOMICS 3D Structure Prediction. Contents Protein 3D structure. –Basics –PDB –Prediction approaches Protein classification.
Advertisements

Tutorial Homology Modelling. A Brief Introduction to Homology Modeling.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Protein Tertiary Structure Prediction
Structural bioinformatics
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Tertiary protein structure viewing and prediction July 1, 2009 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Homology modelling ? X-ray ? NMR ?. Homology Modelling !
Thomas Blicher Center for Biological Sequence Analysis
The Protein Data Bank (PDB)
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Tertiary protein structure modelling May 31, 2005 Graded papers will handed back Thursday Quiz#4 today Learning objectives- Continue to learn how to manipulate.
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Protein Tertiary Structure. Primary: amino acid linear sequence. Secondary:  -helices, β-sheets and loops. Tertiary: the 3D shape of the fully folded.
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
Protein structure prediction May 30, 2002 Quiz#4 on June 4 Learning objectives-Understand difference between primary secondary and tertiary structure.
1 Protein Structure Prediction Charles Yan. 2 Different Levels of Protein Structures The primary structure is the sequence of residues in the polypeptide.
Protein Tertiary Structure Prediction Structural Bioinformatics.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Homology Modeling Seminar produced by Hanka Venselaar.
Protein Tertiary Structure Prediction Structural Bioinformatics.
An introduction and homology modeling
Bioinformatics Ayesha M. Khan Spring 2013.
Protein Structure Prediction and Analysis
Computer-Aided Protein Structure Prediction Dr. G.P.S. Raghava, F.N.A. Sc. Bioinformatics Centre Institute of Microbial Technology Institute of Microbial.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Forces and Prediction of Protein Structure Ming-Jing Hwang ( 黃明經 ) Institute of Biomedical Sciences Academia Sinica
Tertiary Structure Prediction Methods Any given protein sequence Structure selection Compare sequence with proteins have solved structure Homology Modeling.
COMPARATIVE or HOMOLOGY MODELING
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
Representations of Molecular Structure: Bonds Only.
Lecture 12 CS5661 Structural Bioinformatics Motivation Concepts Structure Prediction Summary.
Comparative Modelling Threading Baldomero Oliva Miguel UniversitatPompeuFabra.
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
© Wiley Publishing All Rights Reserved. Protein 3D Structures.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Applied Bioinformatics Week 12. Bioinformatics & Functional Proteomics How to classify proteins into functional classes? How to compare one proteome with.
Module 3 Protein Structure Database/Structure Analysis Learning objectives Understand how information is stored in PDB Learn how to read a PDB flat file.
Structure prediction: Homology modeling
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Modelling protein tertiary structure Ram Samudrala University of Washington.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Protein Structure Prediction Graham Wood Charlotte Deane.
Homology Modeling 原理、流程,還有如何用該工具去預測三級結構 Lu Chih-Hao 1 1.
Structural alignment methods Like in sequence alignment, try to find best correspondence: –Look at atoms –A 3-dimensional problem –No a priori knowledge.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
Protein Structure Prediction: Threading and Rosetta BMI/CS 576 Colin Dewey Fall 2008.
Forces and Prediction of Protein Structure Ming-Jing Hwang ( 黃明經 ) Institute of Biomedical Sciences Academia Sinica
Lab Lab 10.2: Homology Modeling Lab Boris Steipe Departments of Biochemistry and.
PROTEIN MODELLING Presented by Sadhana S.
Computational Structure Prediction
Protein Structure Prediction and Protein Homology modeling
Protein dynamics Folding/unfolding dynamics
Protein Structures.
Molecular Modeling By Rashmi Shrivastava Lecturer
Homology Modeling.
Protein structure prediction.
Computer-Aided Protein Structure Prediction
Computer-Aided Protein Structure Prediction
Computer-Aided Protein Structure Prediction
Homology modeling in short…
Presentation transcript:

Predicting Protein Structure: Comparative Modeling (homology modeling)

? KQFTKCELSQNLYDIDGYGRIALPELICTMF HTSGYDTQAIVENDESTEYGLFQISNALWCK SSQSPQSRNICDITCDKFLDDDITDDIMCAK KILDIKGIDYWIAHKALCTEKLEQWLCEKE Predicting Protein Structure: Comparative Modeling (formerly, homology modeling) Use as template & model 8lyz 1alc KVFGRCELAAAMKRHGLDNYRGYSLGNWVCAAK FESNFNTQATNRNTDGSTDYGILQINSRWWCND GRTPGSRNLCNIPCSALLSSDITASVNCAKKIV SDGNGMNAWVAWRNRCKGTDVQAWIRGCRL Share Similar Sequence Homologous

In an ideal world, we would be able to accurately predict protein structure from the sequence only! Because of the myriad possible configurations of a protein chain – This goal can’t reliably be achieved, yet. Knowledge based prediction vs. Simulation based on physical forces. Here we will only concern ourselves with knowledge-based methods, although we might use simulation in order to optimize our models. Structure prediction

MNIFEMLRID EGLRLKIYKD TEGYYTIGIG HLLTKSPSLN AAKSELDKAI GRNCNGVITK DEAEKLFNQD VDAAVRGILR NAKLKPVYDS LDAVRRCALI NMVFQMGETG VAGFTNSLRM LQQKRWDEAA VNLAKSRWYN QTPNRAKRVI TTFRTGTWDA YKNL Can we predict protein structures ? ab initio folding simulation: not yet... Rosetta approach: neither... Fold recognition (threading): Often works, but... ???

obtain sequence (target) fold assignment comparative modeling ab initio modeling build, assess model Approaches to predicting protein structures

Homology Modelling of Proteins Definition: Prediction of three dimensional structure of a target protein from the amino acid sequence (primary structure) of a homologous (template) protein for which an X-ray or NMR structure is available. Why a Model: A Model is desirable when either X-ray crystallography or NMR spectroscopy cannot determine the structure of a protein in time or at all. The built model provides a wealth of information of how the protein functions with information at residue property level. This information can than be used for mutational studies or for drug design.

Homology modeling = Comparative protein modeling = Knowledge-based modeling Idea: Extrapolation of the structure for a new (target) sequence from the known 3D-structures of related family members (templates).

Homology models have RMSDs less than 2Å more than 70% of the time. Homology models can be very smart!

identity Number of residues aligned Percentage sequence identity/similarity (B.Rost, Columbia, NewYork) Sequence identity implies structural similarity Don’t know region..... Sequence similarity implies structural similarity?

Step 1 in Homology Modeling - Fold Identification Aim: To find a template or templates structures from protein data base Improved Multiple sequence alignment methods improves sensitivity - remote homologs PSIBLAST, CLUSTAL pairwise sequence alignment - finds high homology sequences BLAST

Comparative Modeling Known Structures (Templates) Target Sequence Template Selection Alignment Template - Target Structure modeling Structure Evaluation & Assessment Homology Model(s) Protein Data Bank PDB  Database of templates Separate into single chains Remove bad structures (models) Create BLAST database

Model Building from template Multiple templates Protein Fold Core conserved regions Variable Loop regions Side chains Calculate the framework from average of all template structures Generate one model for each template and evaluate

I. Manual Modeling [ ]

averaging core template backbone atoms (weighted by local sequence similarity with the target sequence) Leave non-conserved regions (loops) for later …. a) Build conserved core framework II. Template based fragment assembly

Dressing up the Core Model Core Model-Rigid Body Assembly Add Side chains Add loops End Game in protein folding - Molecular dynamics of all atoms in explicit solvent

use the “spare part” algorithm to find compatible fragments in a Loop-Database “ab-initio” rebuilding of loops (Monte Carlo, molecular dynamics, genetic algorithms, etc.) b) Loop modeling II. Template based fragment assembly

Loop Builders Mini protein folding problem- 3 to 10 residues longer in membrane proteins Ab Initio methods - generates various random conformations of loops and score Compare the loop sequence string to DB and get hits and evaluate. Some Homology modeling methods have less number of loops to be added because of extensive multiple sequence alignment of profiles Loops result from substitutions, insertions and deletions in the same family

Using database of loops which appear in known structures. The loops could be catagorised by their length or sequence Ab initio methods - without any prior knowledge. This is done by empirical scoring functions that check large number of conformations and evaluates each of them. Construction of loops might be done by:

c) Side Chain placement Find the most probable side chain conformation, using homologues structures back-bone dependent rotamer libraries energetic and packing criteria II. Template based fragment assembly

modeling will produce unfavorable contacts and bonds  idealization of local bond and angle geometry extensive energy minimization will move coordinates away  keep it to a minimum SwissModel is using GROMOS 96 force field for a steepest descent d) Energy minimization II. Template based fragment assembly

d) Energy minimization II. Template based fragment assembly

Homology Modeling Programs Modeller ( Swiss-Model ( Whatif (

Swiss-Model Method: Knowledge-based approach. Requirements: At least one known 3D-structure of a related protein. Good quality sequence alignements. Procedures: Superposition of related 3D-structures. Generation of a multiple a alignement. Generation of a framework for the new sequence. Rebuild lacking loops. Complete and correct backbone. Correct and rebuild side chains. Verify model structure quality and check packing. Refine structure by energy minimisation and molecular dynamics.

Model Confidence Factors The Model B-factors are determined as follows: The number of template structures used for model building. The deviation of the model from the template structures. The Distance trap value used for framework building. The Model B-factor is computed as: 85.0 * (1/ # selected template str.) * (Distance trap / 2.5) and 99.9 for all atoms added during loop and side-chain building

Verifying the Model PROCHECK WHAT IF PROSA II VERIFY 3D, Profile3D

Errors in Models !!! Incorrect template selection Incorrect alignments Errors in positioning of sidechains and loops

General Structure Prediction Scheme Any given protein sequence Structure selection Check sequence identity with proteins with known structure Homology Modeling > 35% Fold Recognition ab initio Folding < 35% Structure refinement Final Structure Structure selection

Baker and Sali (2000)

EVA Evaluation of Automatic protein structure prediction [ Burkhard Rost, Andrej Sali, ] CASP Community Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction 3D - Crunch Very Large Scale Protein Modelling Project Model Accuracy Evaluation

Several web pages for homology modeling COMPOSER – felix.bioccam.ac.uksoft-base.html MODELLER – guitar.rockefeller.edu/modeller/modeller.html WHAT IF – SWISS-MODEL –