MolIDE2: Homology Modeling Of Protein Oligomers And Complexes Qiang Wang, Qifang Xu, Guoli Wang, and Roland L. Dunbrack, Jr. Fox Chase Cancer Center Philadelphia,

Slides:



Advertisements
Similar presentations
Overview of Key ICM Features NIBR - Emeryville July
Advertisements

Tutorial Homology Modelling. A Brief Introduction to Homology Modeling.
Development of SCWRL4 for improved prediction of protein side-chain conformations In collaboration with Moscow Engineering & Physics Institute © George.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Protein Tertiary Structure Prediction
Structural bioinformatics
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Tertiary protein structure viewing and prediction July 1, 2009 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Thomas Blicher Center for Biological Sequence Analysis
Computational Biology, Part 10 Protein Structure Prediction and Display Robert F. Murphy Copyright  1996, 1999, All rights reserved.
The Protein Data Bank (PDB)
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Tertiary protein structure modelling May 31, 2005 Graded papers will handed back Thursday Quiz#4 today Learning objectives- Continue to learn how to manipulate.
Protein Tertiary Structure. Primary: amino acid linear sequence. Secondary:  -helices, β-sheets and loops. Tertiary: the 3D shape of the fully folded.
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
Protein Tertiary Structure Prediction Structural Bioinformatics.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProt Jennifer McDowall, Ph.D. Senior InterPro Curator Protein Sequence Database:
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein Structures.
Protein Sequence Analysis - Overview Raja Mazumder Senior Protein Scientist, PIR Assistant Professor, Department of Biochemistry and Molecular Biology.
Protein Structure Prediction and Analysis
Comparative Genomics of Viruses: VirGen as a case study Dr. Urmila Kulkarni-Kale Bioinformatics Centre University of Pune Pune
Current Status of Homology Modeling Using MCSG Structures 319 MCSG structures in PDB have over 400,000 sequence homologues. These structures represent.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
Tertiary Structure Prediction Methods Any given protein sequence Structure selection Compare sequence with proteins have solved structure Homology Modeling.
Protein domains. Protein domains are structural units (average 160 aa) that share: Function Folding Evolution Proteins normally are multidomain (average.
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
COMPARATIVE or HOMOLOGY MODELING
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
BALBES (Current working name) A. Vagin, F. Long, J. Foadi, A. Lebedev G. Murshudov Chemistry Department, University of York.
UniProt Non-redundant Reference Cluster (UniRef) Databases Swiss Institute of Bioinformatics (SIB) European Bioinformatics Institute (EMBL-EBI)
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
HOMOLOGY MODELLING Chris Wilton. Homology Modelling   What is it and why do we need it? principles of modelling, applications available   Using Swiss-Model.
© Wiley Publishing All Rights Reserved. Protein 3D Structures.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Copyright OpenHelix. No use or reproduction without express written consent1.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Module 3 Protein Structure Database/Structure Analysis Learning objectives Understand how information is stored in PDB Learn how to read a PDB flat file.
Protein Tertiary Structure. Protein Data Bank (PDB) Contains all known 3D structural data of large biological molecules, mostly proteins and nucleic acids:
Structure prediction: Homology modeling
Bioinformatics how to … use publicly available free tools to predict protein structure by comparative modeling.
Protein Sequence Analysis - Overview - NIH Proteomics Workshop 2007 Raja Mazumder Scientific Coordinator, PIR Research Assistant Professor, Department.
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Predicting Protein Structure: Comparative Modeling (homology modeling)
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Homology modeling with SWISS-MODEL
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Protein Structure Prediction Graham Wood Charlotte Deane.
Protein Homologue Clustering and Molecular Modeling L. Wang.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
Structural alignment methods Like in sequence alignment, try to find best correspondence: –Look at atoms –A 3-dimensional problem –No a priori knowledge.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
HANDS-ON ConSurf! Web-Server: The ConSurf webserver.
Protein Tertiary Structure Prediction Structural Bioinformatics.
HANDS-ON ConSurf! Web-Server: The ConSurf webserver.
Protein Structure Prediction: Threading and Rosetta BMI/CS 576 Colin Dewey Fall 2008.
Protein Structure Visualisation
Computational Structure Prediction
Protein dynamics Folding/unfolding dynamics
Methods: The IntFOLD Server
Protein dynamics Folding/unfolding dynamics
Protein Structures.
Homology Modeling.
Protein structure prediction.
Homology modeling in short…
Presentation transcript:

MolIDE2: Homology Modeling Of Protein Oligomers And Complexes Qiang Wang, Qifang Xu, Guoli Wang, and Roland L. Dunbrack, Jr. Fox Chase Cancer Center Philadelphia, PA 19111

Dunbrack Lab, FCCC - NIGMS Workshop Agenda Background MolIDE in retrospect MolIDE2 Demonstration Discussion

Dunbrack Lab, FCCC - NIGMS Workshop Background Homology Modelling  What is it and why do we need it? Given a protein without known structure, predict its 3D structure based on its sequence: Search structure databases for homologous sequences Transfer coordinates of known protein onto unknown MQEQLTDFSKVETNLISW-QGSLETVEQMEPWAGSDANSQTEAY MHQQVSDYAKVEHQWLYRVAGTIETLDNMSPANHSDAQTQAA | |..|. |||... |..||.|.| | |||..| | = Identity. = positively related

Dunbrack Lab, FCCC - NIGMS Workshop Background An inconvenient truth: huge gap between known structures and known sequences. Experimentally determined structures (through x-ray crystallography & NMR spectroscopy.) As of 2/24/2009, PDB has 56,066 entries (< 52K protein structures) Decoded protein sequences  As of 2/10/2009, UniProtKB/Swiss-Prot (Release 56.8 ) contains 410,518 sequence entries.  As of 2/10/2009, UniProtKB/TrEMBL(Release 39.8 ) contains 7,157,600 sequence entries

Dunbrack Lab, FCCC - NIGMS Workshop Background Various methods  Swiss-model – fully automated modeling server  Modeller – satisfaction of spatial restraints  Nest – based on artificial evolution Similar steps  Template identification  Sequence alignment  Backbone generation  Side-chain prediction & loop modeling  Structure refinement Homology modeling

Dunbrack Lab, FCCC - NIGMS Workshop In Retrospect MolIDE: A graphical IDE for homology modeling A. Canutescu and R.L. Dunbrack, Bioinformatics, 2005

Dunbrack Lab, FCCC - NIGMS Workshop In Retrospect sequence search MolIDE: open-source, cross-platform multiple-round psiblast alignments secondary structure prediction assisted alignment editing (joint with a template viewer) side chain conformation prediction loop building

Dunbrack Lab, FCCC - NIGMS Workshop In Retrospect MolIDE automatically downloads these large sequence databases (nr or uniref100) and formats them for use with BLAST. MolIDE1.6 (released on July 1, 2008) Easy installer for Windows version One-step updating of PDB and Non-redundant protein sequence databases PSI-BLAST search of non-redundant database can be opened as a sortable table for browsing homologues of query List of templates from PDB includes protein names and species Works with current remediated XML files from the PDB NCBI's non-redundant protein database (nr) can be replaced with Uniprot's Uniref100 database: Wang et al. Nature Protocols, Dec., 2008

Dunbrack Lab, FCCC - NIGMS Workshop In Retrospect Some examples: Structure with ligands Multi-chain protein complex Modeling of biological unit Modeling with multiple templates Structural or functional restraints in modeling … Previous MolIDE CAN NOT deal with protein oligomers and protein complexes

Dunbrack Lab, FCCC - NIGMS Workshop MolIDE2 Identify an appropriate template; One-to-many sequence alignment; Better understanding of Biological Unit (BU). Challenge: Develop a new homology modeling program capable of modeling protein oligomers and protein complexes. Goal:

Dunbrack Lab, FCCC - NIGMS Workshop MolIDE2 Able to model protein oligomers and complexes; Modeling process based on Biological Unit (BU). Identifying structural template based on domain and family information; Integrated database providing protein structural and sequence information; Better organization and representation of template information; User-friendly graphical interface for selecting template; Integration of profile-profile sequence alignment method; Improved graphical editing of sequence alignment; Key features:

Dunbrack Lab, FCCC - NIGMS Workshop MolIDE2 Screenshot

Dunbrack Lab, FCCC - NIGMS Workshop MolIDE2 Operation flowchart

Dunbrack Lab, FCCC - NIGMS Workshop Demonstration 1.Open sequence file 2.Run hmmpfam (generate domain file) 3.Run psiblast (generate query profile) 4.Run psipred (predict secondary structure of query sequence) 5.Open domain file 6.Search PDB for potential template; get profile-profile alignment result 7.Open alignment file 8.Manually modify sequence alignment (optional) 9.Copy backbone structure 10.Run SCWRL for sidechain replacement. 11.Build loops (not implemented yet). A typical modeling process:

Dunbrack Lab, FCCC - NIGMS Workshop Demonstration Sequences and domains

Dunbrack Lab, FCCC - NIGMS Workshop Demonstration Finding template (1)

Dunbrack Lab, FCCC - NIGMS Workshop Demonstration Finding template (2)

Dunbrack Lab, FCCC - NIGMS Workshop Demonstration Editing sequence alignment & generating model

Dunbrack Lab, FCCC - NIGMS Workshop Demonstration Editing sequence alignment & generating model (cont’d)

Dunbrack Lab, FCCC - NIGMS Workshop Discussion Some future work Loop modeling component dealing with space symmetry Involvement of protein-protein interaction information (ProtBuD) Modeling with multiple templates modeling of ligands refinement of models with Rosetta and RosettaDock …

Dunbrack Lab, FCCC - NIGMS Workshop Acknowledge Dr. Adrian Canutescu, Dr. Mark Andrake NIH R01 GM84453 NIH R01 GM73784

Dunbrack Lab, FCCC - NIGMS Workshop Q & A