Presentation is loading. Please wait.

Presentation is loading. Please wait.

Alejandro Giorgetti 3rd Permanent School in Bioinformatics Madrid 2005 Protein Structure Modeling Alejandro Giorgetti.

Similar presentations


Presentation on theme: "Alejandro Giorgetti 3rd Permanent School in Bioinformatics Madrid 2005 Protein Structure Modeling Alejandro Giorgetti."— Presentation transcript:

1 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it 3rd Permanent School in Bioinformatics Madrid 2005 Protein Structure Modeling Alejandro Giorgetti

2 alejandro.giorgetti@uniroma1.it The number of different protein folds is limited: [ last update: Oct 2001 ] New Folds Known Folds Public Database Holdings:

3 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it Fold recognition Principle: Find a compatible fold >Target Sequence XY MSTLYEKLGGTTAVDLAVAAVA GAPAHKRDVLNQ Build model of target protein based on each template structure Rank models according to SCORE or ENERGY Profile methods Threading M A T E A F T S G Q  Fold =f(environment)  Local Secondary Structure  Solvent Accessibility  Degree of burial of polar / apolar Fragment based methods: New fold prediction

4 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it Some remarks concerning fold recognition:  Capable of detecting quite remote sequence-structure matches.  Sensitivity depends on the size of the protein and its secondary structure content.  The two most versatile enzymatic functions (hydrolases and o- glycosyl-glucosidases) are associated with seven folds each.  Better for detecting: all-α > αβ > all-β

5 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it Homology modeling  Comparative protein modeling Idea: Proteins evolving from a common ancestor maintained similar core 3D structures.  Known structure/s is/are used as a template to model an unknown structure with known sequence.  Both of them should be related by evolution.  First applied in late 1970’s by Tom Blundell

6 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it 10 % 30 % 50 % 70 % 90 % Drug design? Biochemistry? Molecular Biology? [ Chothia & Lesk (1986) ] Evolution of protein structure families X-ray cristallography: MR

7 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it Template(s) selection Sequence Alignment Structure Modeling Structure Evaluation Final Structural Models Comparative Modeling Known Structures (templates) Target sequence >hTEII MSSPQAPEDGQGCGDRGDPPGDLRSVLVTTV LNLEPLDEDLFRGRHYWVPAKRLFGGQIVGQ ALVAAAKSVSEDVHVHSLHCYFVRAGDPKLP

8 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it Template(s) selection Sequence Alignment Structure Modeling Structure Evaluation Final Structural Models Target sequence Protein Data Bank PDB http://www.pdb.org http://www.pdb.org Database of templates Separate into single chains Remove bad structures (models) Create BLAST database Comparative Modeling Known Structures (templates)

9 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it Known Structures (templates) Sequence Alignment Structure Modeling Structure Evaluation Final Structural Models Target sequence  Sequence Similarity / Fold recognition  Structure quality (resolution, experimental method)  Experimental conditions (ligands and cofactors) Comparative Modeling Template(s) selection

10 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it Known Structures (templates) Template(s) selection Structure Modeling Structure Evaluation Final Structural Models Target sequence  Key step in homology modeling  Global alignment is required  Small error in alignment can lead to big error in model  Multiple alignments are better than pairwise alignments  Do we know something else? Experiments? Comparative Modeling Sequence Alignment

11 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it Known Structures (templates) Template(s) selection Structure Evaluation Final Structural Models Target sequence  Template based fragment Assembly (SwissMod).  Satisfaction of Spatial Restraints: MODELLER Comparative Modeling Sequence Alignment Structure Modeling

12 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it Known Structures (templates) Template(s) selection Sequence Alignment Structure Modeling Final Structural Models Target sequence  Errors in template selection or alignment result in bad models  Iterative cycles of alignment, modeling and evaluation Comparative Modeling Structure Evaluation

13 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it [ http://www.expasy.org/spdbv/ ] I. Template based fragment assembly (SwissModel)

14 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it SwissPdb downloading: a)Read and Accept licence b)Download: SwissPdb viewer v3.7sp5(linux) Installation: a)gunzip spdbv37sp5-Linux.tar.gz b)tar –xvf spdbv37sp5-Linux.tar c)cd SPDBV_Distribution and do:./install.sh d)Local installation :..../guestxx/ e)Run: /guestxx/SPDBV/bin/spdbv SwissModel submission: a)Search template: http://www.expasy.org/swissmod/SWISS- MODEL.html Interactive tools: Search the template...(paste sequence)http://www.expasy.org/swissmod/SWISS- MODEL.html b)Model request submission: Save project (SwissPdb Viewer); c)Swiss- Model web page: Modelling Requests – Optimise mode. d)Fill the form and Upload your project asking for Short mode output. Day activities. Morning

15 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it  Corresponds to the most stable regions.  Highest sequence conservation and fewer gaps.  In general: secondary structures elements. a) Build conserved core framework (Structurally conserved regions -SCRs) I. Template based fragment assembly [ http://www.expasy.org/spdbv/ ]

16 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it  Least stable or more flexible regions.  Highest level of gapping  Lowest sequence conservation  Loops and turns  Loop-Database  “ab-initio” rebuilding of loops (Monte Carlo, molecular dynamics, genetic algorithms, etc.) b) Loop modeling (Structural variable regions - SVRs) and backbone completion I. Template based fragment assembly [ http://www.expasy.org/spdbv/ ]

17 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it c) Side Chain placement  Find the most probable side chain conformation, using homologues structures back-bone dependent rotamer libraries energetic and packing criteria I. Template based fragment assembly [ http://www.expasy.org/spdbv/ ]

18 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it  modeling will produce unfavorable contacts and bonds  idealization of local bond and angle geometry  extensive energy minimization will move coordinates away  keep it to a minimum  SwissModel is using GROMOS 96 force field d) Energy minimization I. Template based fragment assembly

19 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it  Find the most probable structure given its alignment  Satisfy spatial restraints derived from the alignment.  Uses probability density functions.  Minimizes violations on restraints. Comparative protein modeling by satisfaction of spatial restraints. A. Šali and T.L. Blundell. J. Mol. Biol. 234, 779-815 II. Modeling by Satisfaction of Spatial restraints

20 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it Model Evaluation ? Topics:  correct fold  model coverage (%)  C  - deviation (rmsd)  alignment accuracy (%)  side chain placement  Structure Analysis and Verification Server : http://nihserver.mbi.ucla.edu/SAVS/

21 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it EVA Evaluation of Automatic protein structure prediction [ Burkhard Rost, Andrej Sali, http://maple.bioc.columbia.edu/eva ]http://maple.bioc.columbia.edu/eva CASP Community Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction http://PredictionCenter.llnl.gov/casp6 3D - Crunch Very Large Scale Protein Modelling Project http://www.expasy.org/swissmod/SM_LikelyPrecision.html Model Accuracy Evaluation

22 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it Protein Structure Resources PDBhttp://www.pdb.org PDB – Protein Data Bank of experimentally solved structures (RCSB)http://www.pdb.org CATHhttp://www.biochem.ucl.ac.uk/bsm/cathhttp://www.biochem.ucl.ac.uk/bsm/cath Hierarchical classification of protein domain structures SCOPhttp://scop.mrc-lmb.cam.ac.uk/scop Alexey Murzin’s Structural Classification of proteinshttp://scop.mrc-lmb.cam.ac.uk/scop DALIhttp://www2.ebi.ac.uk/dali Lisa Holm and Chris Sander’s protein structure comparison serverhttp://www2.ebi.ac.uk/dali SS-Prediction and Fold Recognition PHDhttp://cubic.bioc.columbia.edu/predictproteinhttp://cubic.bioc.columbia.edu/predictprotein Burkhard Rost’s Secondary Structure and Solvent Accessibility Prediction Server PSIPRED http://bioinf.cs.ucl.ac.uk/psipred/http://bioinf.cs.ucl.ac.uk/psipred/ L.J McGuffin, K Bryson & David T. Jones Secndary struture prediction Server 3DPSSMhttp://www.sbg.bio.ic.ac.uk/~3dpss Fold Recognition Server using 1D and 3D Sequence Profiles coupled.http://www.sbg.bio.ic.ac.uk/~3dpss THREADER: http://bioinf.cs.ucl.ac.uk/threader/threader.html http://bioinf.cs.ucl.ac.uk/threader/threader.html David T. Jones threading program

23 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it UCL, Janet Thornton & Christine Orengo Class (C), Architecture(A), Topology(T), Homologous superfamily (H) Protein Structure Classification CATH - Protein Structure Classification [ http://www.biochem.ucl.ac.uk/bsm/cath_new/ ] SCOP - Structural Classification of Proteins MRC Cambridge (UK), Alexey Murzin, Brenner S. E., Hubbard T., Chothia C. created by manual inspection comprehensive description of the structural and evolutionary relationships [ http://scop.mrc-lmb.cam.ac.uk/scop/ ]

24 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it Class(C) derived from secondary structure content is assigned automatically Architecture(A) describes the gross orientation of secondary structures, independent of connectivity. Topology(T) clusters structures according to their topological connections and numbers of secondary structures Homologous superfamily (H)

25 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it Protein Homology Modeling Resources SWISS MODEL: http://www.expasy.org/swissmod/SWISS-MODEL.html http://www.expasy.org/swissmod/SWISS-MODEL.html Deep View - SPDBV: homepage: http://www.expasy.ch/spdbvhttp://www.expasy.ch/spdbv Tutorials http://www.expasy.org/spdbv/text/tutorial.htm http://www.expasy.org/spdbv/text/tutorial.htm WhatIf http://www.cmbi.kun.nl:1100/ Gert Vriend’s protein structure modeling analysis program WhatIf Modeller: http://guitar.rockefeller.edu/modellerhttp://guitar.rockefeller.edu/modeller Andrej Sali's homology protein structure modelling by satisfaction of spatial restraints ROBETTA: http://robetta.bakerlab.org/ http://robetta.bakerlab.org/ Full-chain Protein Structure Prediction Server Programs and www servers very useful in Comparative modeling: http://salilab.org/tools/http://salilab.org/tools/

26 Alejandro Giorgetti alejandro.giorgetti@uniroma1.it


Download ppt "Alejandro Giorgetti 3rd Permanent School in Bioinformatics Madrid 2005 Protein Structure Modeling Alejandro Giorgetti."

Similar presentations


Ads by Google