IV. Protein Structure Prediction and Determination Methods of protein structure determination Critical assessment of structure prediction Homology modelling.

Slides:



Advertisements
Similar presentations
Protein x-ray crystallography
Advertisements

Introduction to protein x-ray crystallography. Electromagnetic waves E- electromagnetic field strength A- amplitude  - angular velocity - frequency.
Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.
PROTEOMICS 3D Structure Prediction. Contents Protein 3D structure. –Basics –PDB –Prediction approaches Protein classification.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
1 Protein Structure, Structure Classification and Prediction Bioinformatics X3 January 2005 P. Johansson, D. Madsen Dept.of Cell & Molecular Biology, Uppsala.
Structural bioinformatics
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
Chapter 9 Structure Prediction. Motivation Given a protein, can you predict molecular structure Want to avoid repeated x-ray crystallography, but want.
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Thomas Blicher Center for Biological Sequence Analysis
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
Protein structure determination & prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray.
1 Protein Structure Prediction Charles Yan. 2 Different Levels of Protein Structures The primary structure is the sequence of residues in the polypeptide.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Genetic Threading By J.Yadgari and A.Amir Published: special issue on Bioinformatics in Journal of Constraints, June 2001 Alexandre Tchourbanov University.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Computing and Chemistry 3-41 Athabasca Hall Sept. 16, 2013.
Bioinformatics Ayesha M. Khan Spring 2013.
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Protein Tertiary Structure Prediction
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Computer-Assisted Drug Design (1) i)Random Screening ii)Lead Development and Optimization using Multivariate Statistical Analyses. iii)Lead Generation.
Tertiary Structure Prediction Methods Any given protein sequence Structure selection Compare sequence with proteins have solved structure Homology Modeling.
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
COMPARATIVE or HOMOLOGY MODELING
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Representations of Molecular Structure: Bonds Only.
ProteinShop: A Tool for Protein Structure Prediction and Modeling Silvia Crivelli Computational Research Division Lawrence Berkeley National Laboratory.
Lecture 12 CS5661 Structural Bioinformatics Motivation Concepts Structure Prediction Summary.
Biomolecular Nuclear Magnetic Resonance Spectroscopy BASIC CONCEPTS OF NMR How does NMR work? Resonance assignment Structure determination 01/24/05 NMR.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Considerations for Protein Crystallography (BT Chapter 18) 1.Growing crystals Usually require 0.5mm in shortest dimension, except if using Synchrotron.
Biomolecular Nuclear Magnetic Resonance Spectroscopy FROM ASSIGNMENT TO STRUCTURE Sequential resonance assignment strategies NMR data for structure determination.
Protein Classification II CISC889: Bioinformatics Gang Situ 04/11/2002 Parts of this lecture borrowed from lecture given by Dr. Altman.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Multiple Mapping Method with Multiple Templates (M4T): optimizing sequence-to-structure alignments and combining unique information from multiple templates.
Altman et al. JACS 2008, Presented By Swati Jain.
Structural proteomics
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Predicting Protein Structure: Comparative Modeling (homology modeling)
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Structural proteomics Handouts. Proteomics section from book already assigned.
JG/10-09 NMR for structural biology DNA purification Protein domain from a database Protein structure possible since 1980s, due to 2-dimensional (and 3D.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Protein Structure Prediction Graham Wood Charlotte Deane.
Atomic structure model
BMC Bioinformatics 2005, 6(Suppl 4):S3 Protein Structure Prediction not a trivial matter Strict relation between protein function and structure Gap between.
Structural alignment methods Like in sequence alignment, try to find best correspondence: –Look at atoms –A 3-dimensional problem –No a priori knowledge.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
CS-ROSETTA Yang Shen et al. Presented by Jonathan Jou.
Lecture 10 CS566 Fall Structural Bioinformatics Motivation Concepts Structure Solving Structure Comparison Structure Prediction Modeling Structural.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein Structure Prediction: Threading and Rosetta BMI/CS 576 Colin Dewey Fall 2008.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
Conformat ional ENTROPY Sannali M Dittli 14 October 2015.
Computational Structure Prediction
Protein Structure Prediction and Protein Homology modeling
Molecular Docking Profacgen. The interactions between proteins and other molecules play important roles in various biological processes, including gene.
Nobel Laureates of X Ray Crystallography
Protein structure prediction.
Presentation transcript:

IV. Protein Structure Prediction and Determination Methods of protein structure determination Critical assessment of structure prediction Homology modelling Threading Prediction of novel folds Protein design

Methods to determine protein structure X-Ray and NMR methods allow to determine the structure of proteins and protein complexes These methods are expensive and difficult –Could take several work months to process one proteins A centralized database (PDB) contains all solved protein structures –XYZ coordinate of atoms within specified precision –~19,000 solved structures

X-ray crystallography and NMR are the two major techniques for determining protein structures Protein isolation Protein Purification Protein Crystallisation X-ray crystallography: Crystal X-ray Phases of diffracted rays Electron density Protein model

Liquid nitrogen is used to freeze the crystal which allows for increased reliability of information gathered from testing. The area detector, which collects the diffracted x-rays once they pass through the crystal, is the black plate located behind the nitrogen stream, (right) sample x-ray diffraction pattern. X-ray crystallography

The phase problem: Isomorphous Replacement: combination of diffraction data from the native crystal with data from other crystals containing the same protein packed in the same way but adding a heavy atom Molecular Replacement: placement of a known relative structure in different positions and orientations, providing approximate phases Multiwavelength Anomalous Dispersion: Measurements of the variation of the intensity distribution in the diffraction pattern over a range of wavelengths Direct Methods: Knowledge of electron density distributions in crystals permits calculation of phases directly from experimental data

Phase determined Model built over it Experimental data: Three dimentional coordinates Relative mobility of atoms Refinement of the model comparing with empirical data Optimised protein structure

Limitations An extremely pure protein sample is needed. The protein sample must form crystals that are relatively large without flaws. Generally the biggest problem. Many proteins aren’t amenable to crystallization at all (i.e., proteins that do their work inside of a cell membrane). X-ray crystallography

Measures of structural quality R-factor is a measure of how well the model reproduces the experimental intensity data, the lower this factor the better the structure. R = 0%There is no experimental error (ideal) R = 60%Atoms placed randomly in the crystal R  20Good structure prediction The free R-factor is an unbiased measure of the agreement between the model and a subset of experimental data withheld during the refinement process

Good protein structures: 1.Are compact as measured by their surface area and packing density 2.Have hydrogen bonds with a reasonable geometry, and with all the hydrogen bonds determined 3.Their backbone conformation angles are confined to the allowed areas of the Sasikharan-Ramakrishnan-Ramachandran diagram

Nuclear Magnetic Resonance Nuclear magnetic resonance (NMR) spectra measure the energy level of the magnetic nuclei in atoms This energy depends on the effect transmitted between atoms affecting the precise frequency of the signal from an atom (chemical shift). This chemical shift can define secondary structures NMR can determine the value of conformational angles Interactions between spatially proximal atoms (< 5Å) can be used by NMR to determine the closeness of atoms in the structure (Nuclear Overhauser effect (NOE)

Peaks correspond to the interaction of pairs of atoms The spectroscopists has first to correlate peaks with amino acids in the sequence (Assign the spectrum) The data generated provide a set of distance constraints and determine the secondary structure and some indications of the tertiary interactions

Solving an NMR structure means producing a model or set of models that manage to satisfy all known NMR distance constraints (generated by the experiment). NMR models are often released in groups of models because the solution to NMR structure determination is much more ambiguous than x-ray. NMR is limited to small, soluble proteins only. Nuclear Magnetic Resonance

NMR models An X-Ray liquid crystal NMR vs. X-ray crystallography

Protein Structures in theory, a protein structure can solved computationally a protein folds into a 3D structure to minimizes its potential energy the problem can be formulated as a search problem for minimum energy –the search space is defined by psi/phi angles of backbone and side-chain rotamers –the search space is enormous even for small proteins! –the number of local minima increases exponentially of the number of residues

Protein Structure Prediction ab initio folding methods –use first principles to computationally fold proteins –not practical (yet) due to its high computational complexity Comparative modeling –Protein threading – make structure prediction through identification of “good” sequence-structure fit –Homology modeling – identification of homologous proteins through sequence alignment; structure prediction through placing residues into “corresponding” positions of homologous structure models

Protein Threading the basic idea –placing a protein sequence onto a structural template “optimally” –assessing how good the structure is energetically key components: –a structural template database –an “energy” function for measuring quality of a placement (alignment) –an algorithm for finding an optimal placement –a capability for assessing the reliability of prediction query sequence MTYKLILNGKTKGETTTEAVDAATAEKVFQYANDNGVDGEWTYTE template set

PROSPECT Predictions actualpredicted actual predicted t49 t68 t57 t70

How and Why Threading? The idea of threading came from the observation that most of the proteins adopt one of a limited number of folds: Just 10 folds account for the 50% of similarities between protein superfamilies Rather than trying to predict the correct structure from the unlimited number of possible structures, the protein structure might have been surely determined before for other proteins In case that our protein shares obvious similarity with other protein with a known 3D structure the folding problem is trivial It is desired, however, that threading might be able to detect structural similarities that are not accompanied by any detectable similarity

1. Library of protein structures (fold library) all known structures representative subset (seq. similarity filters) structural cores with loops removed 2. Binary alignment algorithm with Scoring function contact potential environments Instead of aligning a sequence to a sequence, align strings of descriptors that represent 3D structual features. Usual Dynamic Programming: score matrix relates two amino acids Threading Dynamic Programming: relates amino acids to environments in 3D structure 3. Method for generating models via alignments Algoritmos de threading. General. ALMVWTGH

Threading Algorithms Puntuation function ALMVWTGH Amino acids are in similar environments to those where known structures are found - Solvatation potentials - Contact potentials - Coincidence of predicted and real secondary structures and calculation of accessibilities - Homology matrices obtained from alignment of structures - HMMs

Count pairs of each residue type at different separations Threading algorithms Contact potential Energy of interaction = -KT ln (frequency of interactions) Boltzmann principle d E counts d Jones, 1992; Sippl, 1995

Threading algorithms Sequence profiles + secundary structure Kelley et al.,

threading. Examples

threading Post-processing of the results Combining with additional information

De Juan et al., 2001 threading Post-processing of the results Filtering models

threading Evaluation of methods I) CASP 94, 96, 98, 00, 02 Databases Algorithm Computer evaluation MAKEFGIPAAVAGTVLNVVEAGGWVTTIVSILTAVGSG GLSLLAAAGRESIKAYLKKEI KKGKRAVIAW 1/3 correct fold (ali?) MODEL(S) EVALUATION

PROSPECT Predictions actualpredicted actual predicted t49 t68 t57 t70

Why engineer proteins? 1) Engineered macromolecules could have experimental use as experimental tools, or for development and production of therapeutics 2) During the process of said engineering, new techniques are developed which expand options available to research community as whole 3) By approaching macromolecule as engineer, better understanding of how native molecules function (Doyle, Chem & Bio, 1998)

Ligand Binding – protein flexibility “In this study, we set out to elucidate the cause for the discrepancy in affinity of a range of serine proteinase inhibitors for trypsin variants designed to be structurally equivalent to factor Xa.” (Rauh, J. Mol. Biol., 2004) Def: Ligand Any molecule that binds specifically to a receptor site of another molecule; proteins embedded in the membrane exposed to extracellular fluid.