Computer-Aided Protein Structure Prediction Dr. G.P.S. Raghava, F.N.A. Sc. Bioinformatics Centre Institute of Microbial Technology Institute of Microbial.

Slides:



Advertisements
Similar presentations
Secondary structure prediction from amino acid sequence.
Advertisements

Protein Structure Prediction
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
1 Protein Structure, Structure Classification and Prediction Bioinformatics X3 January 2005 P. Johansson, D. Madsen Dept.of Cell & Molecular Biology, Uppsala.
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Chapter 9 Structure Prediction. Motivation Given a protein, can you predict molecular structure Want to avoid repeated x-ray crystallography, but want.
An Introduction to Bioinformatics Protein Structure Prediction.
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Thomas Blicher Center for Biological Sequence Analysis
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
CISC667, F05, Lec20, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction Protein Secondary Structure.
Structure Prediction in 1D
Protein structure determination & prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Detecting the Domain Structure of Proteins from Sequence Information Niranjan Nagarajan and Golan Yona Department of Computer Science Cornell University.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein Structures.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
Protein Tertiary Structure Prediction
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Rising accuracy of protein secondary structure prediction Burkhard Rost
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Predicting Secondary Structure of All-Helical Proteins Using Hidden Markov Support Vector Machines Blaise Gassend, Charles W. O'Donnell, William Thies,
Protein Secondary Structure Prediction Some of the slides are adapted from Dr. Dong Xu’s lecture notes.
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
Lecture 12 CS5661 Structural Bioinformatics Motivation Concepts Structure Prediction Summary.
Protein Secondary Structure Prediction. Input: protein sequence Output: for each residue its associated Secondary structure (SS): alpha-helix, beta-strand,
Protein Secondary Structure Prediction Based on Position-specific Scoring Matrices Yan Liu Sep 29, 2003.
© Wiley Publishing All Rights Reserved. Protein 3D Structures.
Neural Networks for Protein Structure Prediction Brown, JMB 1999 CS 466 Saurabh Sinha.
1 Computer-aided Drug Discovery G.P.S. Raghava  Annotation of genomes  Searching drug targets  Properties of drug molecules  Protein-chemical interaction.
Secondary structure prediction
Web Servers for Predicting Protein Secondary Structure (Regular and Irregular) Dr. G.P.S. Raghava, F.N.A. Sc. Bioinformatics Centre Institute of Microbial.
Protein structure prediction May 26, 2011 HW #8 due today Quiz #3 on Tuesday, May 31 Learning objectives-Understand the biochemical basis of secondary.
Protein Structure Prediction
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
Protein Secondary Structure Prediction G P S Raghava.
1 Protein Structure Prediction (Lecture for CS397-CXZ Algorithms in Bioinformatics) April 23, 2004 ChengXiang Zhai Department of Computer Science University.
Meng-Han Yang September 9, 2009 A sequence-based hybrid predictor for identifying conformationally ambivalent regions in proteins.
Protein Tertiary Structure. Protein Data Bank (PDB) Contains all known 3D structural data of large biological molecules, mostly proteins and nucleic acids:
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Protein Structure Prediction ● Why ? ● Type of protein structure predictions – Sec Str. Pred – Homology Modelling – Fold Recognition – Ab Initio ● Secondary.
Predicting Protein Structure: Comparative Modeling (homology modeling)
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Techniques for Protein Sequence Alignment and Database Searching (part2) G P S Raghava Scientist & Head Bioinformatics Centre, Institute of Microbial Technology,
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Protein Structure Prediction Graham Wood Charlotte Deane.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein backbone Biochemical view:
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein Structure Prediction. Protein Sequence Analysis Molecular properties (pH, mol. wt. isoelectric point, hydrophobicity) Secondary Structure Super-secondary.
3.3b1 Protein Structure Threading (Fold recognition) Boris Steipe University of Toronto (Slides evolved from original material.
Computational Structure Prediction
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
Protein Structure Prediction and Protein Homology modeling
Introduction to Bioinformatics II
Protein Structure Prediction
Protein Structure Prediction
Protein Structures.
Protein structure prediction.
Computer-Aided Protein Structure Prediction
Computer-Aided Protein Structure Prediction
Computer-Aided Protein Structure Prediction
Protein structure prediction
Presentation transcript:

Computer-Aided Protein Structure Prediction Dr. G.P.S. Raghava, F.N.A. Sc. Bioinformatics Centre Institute of Microbial Technology Institute of Microbial Technology Chandigarh, INDIA Web: Phone: Fax: Protein Sequence + Structure

MNIFEMLRID EGLRLKIYKD TEGYYTIGIG HLLTKSPSLN AAKSELDKAI GRNCNGVITK DEAEKLFNQD VDAAVRGILR NAKLKPVYDS LDAVRRCALI NMVFQMGETG VAGFTNSLRM LQQKRWDEAA VNLAKSRWYN QTPNRAKRVI TTFRTGTWDA YKNL ?

Protein Structure Prediction Experimental Techniques –X-ray Crystallography –NMR Limitations of Current Experimental Techniques –Protein DataBank (PDB) -> 30,000 protein structures –Unique structure 4000 to 5000 only –Non-Redudant (NR) -> 10,00,000 proteins Importance of Structure Prediction –Fill gap between known sequence and structures –Protein Engg. To alter function of a protein –Rational Drug Design World Wide Recognition of ProblemWorld Wide Recognition of Problem –CASP/CAFASP Competition (Olympic 2000) –Most Wanted (TOP 10) –Metaserver for Structure Prediction

Peptide Bond

Dihedral Angles

Ramachandran Plot

Different Levels of Protein Structure

Techniques of Structure Prediction Computer simulation based on energy calculation –Based on physio-chemical principles –Thermodynamic equilibrium with a minimum free energy –Global minimum free energy of protein surface Knowledge Based approaches –Homology Based Approach –Threading Protein Sequence –Hierarchical Methods

Energy Minimization Techniques Energy Minimization based methods in their pure form, make no priori assumptions and attempt to locate global minma. Static Minimization Methods –Classical many potential-potential can be construted –Assume that atoms in protein is in static form –Problems(large number of variables & minima and validity of potentials) Dynamical Minimization Methods –Motions of atoms also considered –Monte Carlo simulation (stochastics in nature, time is not cosider) –Molecular Dynamics (time, quantum mechanical, classical equ.) Limitations –large number of degree of freedom,CPU power not adequate –Interaction potential is not good enough to model

Homology Modelling –Need homologues of known protein structure –Backbone modelling –Side chain modelling –Fail in absence of homology Threading Based Methods –New way of fold recognition –Sequence is tried to fit in known structures –Motif recognition –Loop & Side chain modelling –Fail in absence of known example

Hierarcial Methods Intermidiate structures are predicted, instead of predicting tertiary structure of protein from amino acids sequence Prediction of backbone structure –Secondary structure (helix, sheet,coil) –Beta Turn Prediction –Super-secondary structure Tertiary structure prediction Limitation Accuracy is only % Only three state prediction

Protein Structure Prediction Tertiary Structure Prediction (TSP) Comparative Modelling Energy Minimization Techniques Ab-Initio Prediction (Segment Based) Threading Based Approach Limitations of TSP Difficult to predict in absence of homology Computation requirement too high Fail in absence of known examples Secondary Structure prediction (SSP) An Intermidiate Step in TSP Most Successful in absence of homology Helix (3), Strand (2) and Coil (3) DSSP for structure assignment

Protein Secondary Structure Prediction Existing SSP Methods Statistical Methods (Chou,GOR) Physio-chemical Methods A.I. (Neural Network Approach) Consensus and Multiple Alignment Our Method APSSP of SSP Neural Network Example Based Learnning Multiple Alignment Steps involved in APSSP Blast search against protein sequence (NR) Multiple Alignment (ClustalW) Profile by HMMER, Result by Recogntion: CASP,CAFASP,LiveBench, MetaServer

Protein Secondary Structure Secondary Structure Regular Secondary Structure (  -helices,  - sheets) Irregular Secondary Structure (Tight turns, Random coils, bulges)

Secondary structure prediction No information about tight turns ?

Tight turns TypeNo. of residuesH-bonding  -turn 2 NH(i)-CO(i+1)  -turn 3 CO(i)-NH(i+2)  -turn 4 CO(i)-NH(i+3)  -turn 5 CO(i)-NH(i+4)  -turn 6 CO(i)-NH(i+5)

Prediction of tight turns Prediction of  -turns Prediction of  -turn types Prediction of  -turns Prediction of  -turns Use the tight turns information, mainly  -turns in tertiary structure prediction of bioactive peptides

Definition of  -turn A  -turn is defined by four consecutive residues i, i+1, i+2 and i+3 that do not form a helix and have a C  (i)-C  (i+3) distance less than 7Å and the turn lead to reversal in the protein chain. (Richardson, 1981). The conformation of  -turn is defined in terms of  and  of two central residues, i+1 and i+2 and can be classified into different types on the basis of  and . i i+1 i+2 i+3H-bond D <7Å

Gamma turns The  -turn is the second most characterized and commonly found turn, after the  -turn. A  -turn is defined as 3-residue turn with a hydrogen bond between the Carbonyl oxygen of residue i and the hydrogen of the amide group of residue i+2. There are 2 types of  -turns: classic and inverse.

Existing  -turn prediction methods Residue Hydrophobicities (Rose, 1978) Positional Preference Approach –Chou and Fasman Algorithm (Chou and Fasman, 1974; 1979) –Thornton’s Algorithm (Wilmot and Thornton, 1988) –GORBTURN (Wilmot and Thornton, 1990) –1-4 & 2-3 Correlation Model (Zhang and Chou, 1997) –Sequence Coupled Model (Chou, 1997) Artificial Neural Network –BTPRED (Shepherd et al., 1999) ( ) ) BetatPred: Consensus method for Beta Turn prediction (Kaur and Raghava 2002, Bioinformatics)

BetaTPred2: Prediction of  -turns in proteins from multiple alignment using neural network Two feed-forward back-propagation networks with a single hidden layer are used where the first sequence-structure network is trained with the multiple sequence alignment in the form of PSI-BLAST generated position specific scoring matrices. The initial predictions from the first network and PSIPRED predicted secondary structure are used as input to the second sequence-structure network to refine the predictions obtained from the first net. The final network yields an overall prediction accuracy of 75.5% when tested by seven- fold cross-validation on a set of 426 non-homologous protein chains. The corresponding Qpred., Qobs. and MCC values are 49.8%, 72.3% and 0.43 respectively and are the best among all the previously published  -turn prediction methods. A web server BetaTPred2 ( has been developed based on this approach. Harpreet Kaur and G P S Raghava (2003) Prediction of  -turns in proteins from multiple alignment using neural network. Protein Science 12,

BetaTurns: A web server for prediction of  -turn types (

Gammapred: A server for prediction of  -turns in proteins ( Harpreet Kaur and G P S Raghava (2003) A neural network based method for prediction of  -turns in proteins from multiple sequence alignment. Protein Science 12,

AlphaPred: A web server for prediction of  -turns in proteins ( Harpreet Kaur and G P S Raghava (2003) Prediction of  -turns in proteins using PSI-BLAST profiles and secondary structure information. Proteins.

Contribution of  -turns in tertiary structure prediction of bioactive peptides 3D structures of 77 biologically active peptides have been selected from PDB and other databases such as PSST ( and PRF ( have been selected. The data set has been restricted to those biologically active peptides that consist of only natural amino acids and are linear with length varying between 9-20 residues.

3 models have been studied for each peptide. The first model has been (  =  = 180 o ). The second model is build up by constructed by taking all the peptide residues in the extended conformation assigning the peptide residues the ,  angles of the secondary structure states predicted by PSIPRED. The third model has been constructed with ,  angles corresponding to the secondary states predicted by PSIPRED and  -turns predicted by BetaTPred2. Peptide Extended (  =  = 180 o ). PSIPRED + BetaTPred2 Root Mean Square Deviation has been calculated…….

Averaged backbone root mean deviation before and after energy minimization and dynamics simulations.

Protein Structure Prediction Regular Secondary Structure Prediction (  -helix  -sheet) –APSSP2: Highly accurate method for secondary structure predictionAPSSP2: –Participate in all competitions like EVA, CAFASP and CASP (In top 5 methods) –Combines memory based reasoning ( MBR) and ANN methods Irregular secondary structure prediction methods (Tight turns) –Betatpred: Consensus method for  -turns predictionBetatpred Statistical methods combined Kaur and Raghava (2001) Bioinformatics –Bteval : Benchmarking of  -turns predictionBteval Kaur and Raghava (2002) J. Bioinformatics and Computational Biology, 1:495:504 –BetaTpred2: Highly accurate method for predicting  -turns (ANN, SS, MA)BetaTpred2 Multiple alignment and secondary structure information Kaur and Raghava (2003) Protein Sci 12: –BetaTurns: Prediction of  -turn types in proteinsBetaTurns Evolutionary information Kaur and Raghava (2004) Bioinformatics 20: –AlphaPred: Prediction of  -turns in proteinsAlphaPred Kaur and Raghava (2004) Proteins: Structure, Function, and Genetics 55:83-90 –GammaPred: Prediction of  -turns in proteinsGammaPred Kaur and Raghava (2004) Protein Science; 12:

Protein Structure Prediction BhairPred: Prediction of Supersecondary structure predictionBhairPred –Prediction of Beta Hairpins –Utilize ANN and SVM pattern recognition techniques –Secondary structure and surface accessibility used as input –Manish et al. (2005) Nucleic Acids Research (In press) TBBpred: Prediction of outer membrane proteinsTBBpred: –Prediction of trans membrane beta barrel proteins –Prediction of beta barrel regions –Application of ANN and SVM + Evolutionary information –Natt et al. (2004) Proteins: 56:11-8 ARNHpred: Analysis and prediction side chain, backbone interactionsARNHpred: –Prediction of aromatic NH interactions –Kaur and Raghava (2004) FEBS Letters 564: SARpred: Prediction of surface accessibility (real accessibility)SARpred –Multiple alignment (PSIBLAST) and Secondary structure information –ANN: Two layered network (sequence-structure-structure) –Garg et al., (2005) Proteins (In Press) PepStr: Prediction of tertiary structure of Bioactive peptidesPepStr Performance of SARpred, Pepstr and BhairPred were checked on CASP6 proteins

Thankyou