Protein Structure Prediction and Protein Homology modeling

Slides:



Advertisements
Similar presentations
Tutorial Homology Modelling. A Brief Introduction to Homology Modeling.
Advertisements

Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Structural bioinformatics
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Tertiary protein structure viewing and prediction July 1, 2009 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Thomas Blicher Center for Biological Sequence Analysis
The Protein Data Bank (PDB)
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Protein Tertiary Structure. Primary: amino acid linear sequence. Secondary:  -helices, β-sheets and loops. Tertiary: the 3D shape of the fully folded.
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein Structures.
Bioinformatics Ayesha M. Khan Spring 2013.
Protein Structure Prediction and Analysis
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Computational Chemistry. Overview What is Computational Chemistry? How does it work? Why is it useful? What are its limits? Types of Computational Chemistry.
Protein modelling ● Protein structure is the key to understanding protein function ● Protein structure ● Topics in modelling and computational methods.
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
8/30/2015 5:26 PM Homology modeling Dinesh Gupta ICGEB, New Delhi.
Protein Tertiary Structure Prediction
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
COMPARATIVE or HOMOLOGY MODELING
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Representations of Molecular Structure: Bonds Only.
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
Lecture 12 CS5661 Structural Bioinformatics Motivation Concepts Structure Prediction Summary.
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
Biomolecular Nuclear Magnetic Resonance Spectroscopy BASIC CONCEPTS OF NMR How does NMR work? Resonance assignment Structure determination 01/24/05 NMR.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Conformational Entropy Entropy is an essential component in ΔG and must be considered in order to model many chemical processes, including protein folding,
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
Applied Bioinformatics Week 12. Bioinformatics & Functional Proteomics How to classify proteins into functional classes? How to compare one proteome with.
Protein Folding and Modeling Carol K. Hall Chemical and Biomolecular Engineering North Carolina State University.
Altman et al. JACS 2008, Presented By Swati Jain.
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Protein Structure Prediction ● Why ? ● Type of protein structure predictions – Sec Str. Pred – Homology Modelling – Fold Recognition – Ab Initio ● Secondary.
Predicting Protein Structure: Comparative Modeling (homology modeling)
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Structure prediction: Ab-initio Lecture 9 Structural Bioinformatics Dr. Avraham Samson Let’s think!
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Protein Structure Prediction Graham Wood Charlotte Deane.
Developing a Force Field Molecular Mechanics. Experimental One Dimensional PES Quantum mechanics tells us that vibrational energy levels are quantized,
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
How NMR is Used for the Study of Biomacromolecules Analytical biochemistry Comparative analysis Interactions between biomolecules Structure determination.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
Molecular modelling José R. Valverde CNB/CSIC © José R. Valverde, 2014 CC-BY-NC-SA.
PROTEIN MODELLING Presented by Sadhana S.
Protein Structure Visualisation
Computational Structure Prediction
Protein dynamics Folding/unfolding dynamics
Protein dynamics Folding/unfolding dynamics
Computational Analysis
Protein Structure Prediction
Protein Structure Prediction
Protein Structures.
Molecular Modeling By Rashmi Shrivastava Lecturer
Homology Modeling.
Protein structure prediction.
Protein Homology Modelling
Protein structure prediction
Presentation transcript:

Protein Structure Prediction and Protein Homology modeling Dinesh Gupta ICGEB, New Delhi 9/19/2018 8:51 PM

Why protein structures Identifying active and binding sites Determine protein’s mechanism (catalysis & interactions) Searching ligands for a binding site Understanding the molecular basis of diseases Designing mutants Drug design Mechanism - type 1 and 2 KdarinimDiseases - myosin 6/7- deafnessPredix Pharmaceuticals Holdings - GPCR ... Drugs Latzhiimer

Why protein structure prediction? There is a discrepancy in number of protein sequences and number of experimentally determined structures. Release 2017_01 of 18th-Jan-17 of UniProtKB/Swiss- Prot contains 5,53,474 sequence entries (www.uniprot.org) whereas only 40094 distinct protein structures in PDB database (www.rcsb.org, 24th-Jan-17). Experimental determination: slow, expensive- failed in certain cases! Hence methods which can reasonably predict protein structures are important. 9/19/2018 8:51 PM

9/19/2018 8:51 PM

Protein structure prediction Methods: Homology (comparative) modeling Threading Ab-initio 9/19/2018 8:51 PM

Protein Homology modeling Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template. Basis: proteins with similar sequences are likely to assume same folding Certain proteins with as low as 25% similarity have been observed to assume same 3D structure 9/19/2018 8:51 PM

The accuracy of modeling is proportional to the similarity in primary sequences 9/19/2018 8:51 PM

Homology modeling consists of the following steps Template recognition Database search using BLAST Sequence alignment Pair-wise Multiple alignment Model generation Model validation Statistical potentials Physics-based energy calculations

Template recognition The sequence of the target protein is first scanned against the sequences of known structures using pair wise sequence alignment program like BLAST or FASTA. However more sensitive methods based on multiple sequence alignment like PSI-BLAST are preferred for remote homologs. Selection of best template is decided by several factors like, high sequence identity (> 30%), better coverage of the aligned region, quality of the template structure etc..

Model generation Once an initial target-template alignment is built, a variety of methods are available to construct a 3D model. The most commonly used one is distance geometry and optimization techniques to satisfy spatial restraints obtained from template-target alignment (Modeller). Spatial restraints are extracted from two sources: first, homology-derived restraints on the distances and dihedral angles in the target sequence are extracted from its alignment with the template structures. Second, stereo chemical restraints such as bond length and bond angle preferences are obtained from the molecular mechanics force field of CHARMM-22

Model validation In general the homology model is susceptible to error and needs to be assessed for quality by either statistical potentials or physics-based energy calculations. Energy-based calculations checks if the bond-length and bond-angles are with normal ranges, and if there are lot of bumps in the model which corresponds to high vanderwaals energy (PROCHECK). Statistical potentials are empirical methods based on observed residue-residue contact frequencies among proteins of known structure in the PDB. They assign a probability or energy score to each possible pairwise interaction between amino acids and combine these pair wise interaction scores into a single score for the entire model (Prosa and DOPE).

Software for homology molecular modeling Molecular graphics: PyMol, RasMol, etc. Freeware: available for all OS Downloadable Modeller (Sali, 1998) https://salilab.org/modeller/ DeepView (SwissPDB viewer) http://spdbv.vital-it.ch YASARA (WHATIF) http://yasara.org Web based: SWISS MODEL server (https://swissmodel.expasy.org/interactive) CPH model server (http://www.cbs.dtu.dk/services/CPHmodels) Phyre2 (http://www.sbg.bio.ic.ac.uk/phyre2/html/page.cgi?id=index) 9/19/2018 8:51 PM

Common errors in homology modeling Errors in modeling side chains Errors in modeling sequence segments without a template (e.g., variable and/or loop regions) Errors due to selecting the wrong template for modeling 9/19/2018

9/19/2018 8:51 PM Petry, D. and Honig, B. Molecular Cell, Vol. 20, 811–819, December 22, 2005

9/19/2018 8:51 PM

9/19/2018 8:51 PM

9/19/2018

9/19/2018

9/19/2018

9/19/2018

9/19/2018

Databases of protein models 9/19/2018

9/19/2018

9/19/2018

Homology modeling summary Homology modeling methods can efficiently predict structure if the target protein has similarity to known folds The greater the sequence identity to template, the better is the quality of the model Homology modeling can yield high accuracy and high resolution atomic models With increasing numbers of unique structures and solved experimentally homology modeling methods should become more powerful 9/19/2018

Protein structure prediction Methods: Homology (comparative) modelling Threading Ab-initio 9/19/2018 8:51 PM

Threading Structure prediction method that picks up where homology modelling leaves off. Homology Modeling: Align sequence to sequence Threading: Align sequence to structure Threading is based on the fact that only limited number of protein folds are possible in nature. Threading recognizes folds in proteins having no similarity to known proteins structures Very approximate models Check by forcing a sequence of structure into known folds checking the packing of aa residues, including sides chains, in each fold. Scoring functions. 9/19/2018 8:51 PM

9/19/2018 8:51 PM

Threading software I-TASSER http://zhanglab.ccmb.med.umich.edu/I-TASSER/ RaptorX http://raptorx.uchicago.edu 9/19/2018 8:51 PM

9/19/2018 8:51 PM

Protein structure prediction Methods: Homology (comparative) modelling Threading Ab-initio 9/19/2018 8:51 PM

Ab initio structure prediction Still experimental Less accurate ROSETTA (David Baker) http://robetta.bakerlab.org/ 9/19/2018 8:51 PM

Energy minimization (Molecular Mechanics, MM) Heart of any protein modeling technique Energy minimization techniques try to minimize energies of molecules with the help of equations also called force fields which represent energies of molecular systems Assumption: Molecular systems try to achieve lowest energies in equilibrium. MM could be used to calculate large scale conformational changes over long periods of time, but currently computationally infeasible. 9/19/2018 8:51 PM

What do Force Fields represent? How do atoms stretch, vibrate, rotate, etc.? Represent the constraints on atomic motion (e.g. van der Waals, electrostatic, bonds, etc.) Must also represent solvation effects etc. Quantum solutions exist, but are too complex to calculate for such large systems Empirical (approximate) energy functions must be used. No single best function exists. 9/19/2018 8:51 PM