Protein domains. Protein domains are structural units (average 160 aa) that share: Function Folding Evolution Proteins normally are multidomain (average.

Slides:



Advertisements
Similar presentations
Protein Structure.
Advertisements

Profile Hidden Markov Models Bioinformatics Fall-2004 Dr Webb Miller and Dr Claude Depamphilis Dhiraj Joshi Department of Computer Science and Engineering.
Chimera. Chimera 1/3 Starting Chimera Open Chimera from desktop (ZDV app) (If there is an update it will take a minute or two) Open a 3D structure by.
©CMBI 2005 Exploring Protein Sequences - Part 2 Part 1: Patterns and Motifs Profiles Hydropathy Plots Transmembrane helices Antigenic Prediction Signal.
Homology 3D modeling and effect of mutations. X-ray crystallography (70,714 in PDB) need crystals Nuclear Magnetic Resonance (NMR) (9,312) proteins in.
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Today’s menu: -UniProt - SwissProt/TrEMBL -PROSITE -Pfam -Gene Onltology Protein and Function Databases Tutorial 7.
Identifying functional residues of proteins from sequence info Using MSA (multiple sequence alignment) - search for remote homologs using HMMs or profiles.
Computational Biology, Part 10 Protein Structure Prediction and Display Robert F. Murphy Copyright  1996, 1999, All rights reserved.
Tertiary protein structure modelling May 31, 2005 Graded papers will handed back Thursday Quiz#4 today Learning objectives- Continue to learn how to manipulate.
Protein Modules An Introduction to Bioinformatics.
Introduction to Bioinformatics - Tutorial no. 8 Predicting protein structure PSI-BLAST.
PREDICTION OF PROTEIN FEATURES Beyond protein structure (TM, signal/target peptides, coiled coils, conservation…)
Introduction to Bioinformatics - Tutorial no. 8 Protein Prediction: - PROSITE - Pfam - SCOP - TOPITS - genThreader.
Detecting the Domain Structure of Proteins from Sequence Information Niranjan Nagarajan and Golan Yona Department of Computer Science Cornell University.
Protein Sequence Analysis - Overview Raja Mazumder Senior Protein Scientist, PIR Assistant Professor, Department of Biochemistry and Molecular Biology.
Pattern databasesPattern databasesPattern databasesPattern databases Gopalan Vivek.
© Wiley Publishing All Rights Reserved. Searching Sequence Databases.
Wellcome Trust Workshop Working with Pathogen Genomes Module 3 Sequence and Protein Analysis (Using web-based tools)
Protein Sequence, Structure, and Function Lab Gustavo Caetano - Anolles 1 PowerPoint by Casey Hanson Protein Sequence, Structure, and Function | Gustavo.
Protein Bioinformatics Course
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
Protein 3D-structure analysis Exercises. Practicals Find update frequency for RCSB PDB: weekly. When was the last update? How many protein structures.
BASys: A Web Server for Automated Bacterial Genome Annotation Gary Van Domselaar †, Paul Stothard, Savita Shrivastava, Joseph A. Cruz, AnChi Guo, Xiaoli.
The Pfam and MEROPS databases EMBO course 2004 Robert Finn
© Wiley Publishing All Rights Reserved. Protein 3D Structures.
1 Enter the following Micro-RNA sequence into the box Run MFold and look at the results MFold Using MFold to predict RNA secondary structure
MolIDE2: Homology Modeling Of Protein Oligomers And Complexes Qiang Wang, Qifang Xu, Guoli Wang, and Roland L. Dunbrack, Jr. Fox Chase Cancer Center Philadelphia,
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
NIGMS Protein Structure Initiative: Target Selection Workshop ADDA and remote homologue detection Liisa Holm Institute of Biotechnology University of Helsinki.
Protein Tertiary Structure. Protein Data Bank (PDB) Contains all known 3D structural data of large biological molecules, mostly proteins and nucleic acids:
Protein Sequence Analysis - Overview - NIH Proteomics Workshop 2007 Raja Mazumder Scientific Coordinator, PIR Research Assistant Professor, Department.
Protein Data Bank: An Introduction Learning to Use the RCSB PDB Portal.
Copyright OpenHelix. No use or reproduction without express written consent1.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Protein Domain Database
PROTEIN PATTERN DATABASES. PROTEIN SEQUENCES SUPERFAMILY FAMILY DOMAIN MOTIF SITE RESIDUE.
Exercises Pairwise alignment Homology search (BLAST) Multiple alignment (CLUSTAL W) Iterative Profile Search: Profile Search –Pfam –Prosite –PSI-BLAST.
Guidelines for sequence reports. Outline Summary Results & Discussion –Sequence identification –Function assignment –Fold assignment –Identification of.
V diagonal lines give equivalent residues ILS TRIVHVNSILPSTN V I L S T R I V I L P E F S T Sequence A Sequence B Dot Plots, Path Matrices, Score Matrices.
V diagonal lines give equivalent residues ILS TRIVHVNSILPSTN V I L S T R I V I L P E F S T Sequence A Sequence B Dot Plots, Path Matrices, Score Matrices.
Homology 3D modeling and effect of mutations. X-ray crystallography (70,714 in PDB) need crystals Nuclear Magnetic Resonance (NMR) (9,312) proteins in.
Summer Bioinformatics Workshop 2008 BLAST Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State University – Rochester Center
EBI is an Outstation of the European Molecular Biology Laboratory. A web based integrated search service to understand ligand binding and secondary structure.
DNA / protein sequence analysis 第九組成員: 吳宇軒 侯卜夫 朱子豪 王俊偉
Protein Sequence, Structure, and Function Lab Gustavo Caetano - Anolles Protein Sequence, Structure, and Function Lab v1 | Gustavo Caetano - Anolles 1.
A New Interface to GeneKeyDB Methods for analyzing relationships among proteins based on shared motifs Chris Symons & Xinxia Peng.
Homology 3D modeling and effect of mutations Miguel Andrade Faculty of Biology, Johannes Gutenberg University Institute of Molecular Biology Mainz, Germany.
Sequence: PFAM Used example: Database of protein domain families. It is based on manually curated alignments.
Protein domains Miguel Andrade Mainz, Germany Faculty of Biology,
Homology 3D modeling Miguel Andrade Mainz, Germany Faculty of Biology,
Protein domains Miguel Andrade Mainz, Germany Faculty of Biology,
Take a REST from manual searching: PDBe, programmatically
Demo: Protein Information Resource
Secondary structure prediction
Protein domains Miguel Andrade Mainz, Germany Faculty of Biology,
Homology 3D modeling and effect of mutations
PIR: Protein Information Resource
Protein Bioinformatics Course
Protein Sequence Analysis - Overview -
BLAST.
Protein Sequence Analysis - Overview -
The future of protein secondary structure prediction accuracy
Basic Local Alignment Search Tool
Protein structure prediction.
Protein domains Miguel Andrade Mainz, Germany Faculty of Biology,
Examining repeats with databases
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Presentation transcript:

Protein domains

Protein domains are structural units (average 160 aa) that share: Function Folding Evolution Proteins normally are multidomain (average 300 aa) Introduction

Protein domains are structural units (average 160 aa) that share: Function Folding Evolution Proteins normally are multidomain (average 300 aa) Introduction

Why to search for domains: Protein structural determination methods such as X-ray crystallography and NMR have size limitations that limit their use. Multiple sequence alignment at the domain level can result in the detection of homologous sequences that are more difficult to detect using a complete chain sequence. Methods used to gain an insight into the structure and function of a protein work best at the domain level. Domains

Domain databases SMART Schultz et al (1998) PNAS … Letunic et al (2014) Nucleic Acids Research Peer Borkhttp://smart.embl.de/ Manual definition of domain (bibliography) Generate profile from instances of domain Search for remote homologs (HMMer) Include them in profile Iterate until convergence

Domain databases

SMART

Domain databases SMART Extra features: Signal-peptide, low complexity, TM, coiled coils

Domain databases SMART

Domain databases SMART

Domain databases PFAM Erik Sonnhammer/Ewan Birney/Alex Bateman Sonnhammer et al (1997) Proteins... Finn et al (2014) Nucleic Acids Research

Domain databases PFAM

Domain databases PFAM Wikipedia rules!

Domain databases PFAM

Domain databases CDD Marchler-Bauer et al (2015) Nucleic Acids Res Stephen Bryant

Domain databases CDD

Domain databases SMART PFAM CDD SORLA/SORL1 from Homo sapiens

Exercise 1/3 Let’s see whether human myosin X (UniProt id Q9HD67) or its homologs have a solved structure. Go to NCBI’s BLAST page: Click on the “protein blast” link. Type the UniProt id of the protein sequence “Q9HD67” in the Entry Query Sequence window. In the Choose Search Set you can select the database to search against: select as Database option "Protein Data Bank". Examine a UniProt Entry and find related PDBs

Exercise 1/3 Examine a UniProt Entry and find related PDBs

Exercise 1/3 Hit the BLAST button at the bottom of the page. Considering that your query was a human myosin X, can you interpret the first two hits? Which part of your query was matched? Which protein was hit in the database? What about the 3 rd hit? Examine a UniProt Entry and find related PDBs

Exercise 2/3 Let’s look at the domains predicted for human myosin X. Go to PFAM: Select the option VIEW A SEQUENCE Type in the window the UniProt id of the protein sequence “Q9HD67” and hit the Go button. Compare the positions of the domains predicted with the ranges of the first three BLAST matches in PDB from the previous exercise. Which domains where matched in the human myosin X by each of those hits? Analyse domain predictions with PFAM

Exercise 3/3 Open the structure of the 2 nd hit (3PZD) in Chimera Now colour the fragments corresponding to the PFAM domains MyTH4 (in yellow) and FERM_M (in purple). How many more domains can you identify visually? Chain B in this structure is a small peptide. Which part of the human myosin X is interacting with this peptide in relation to the domains you have coloured? Examine domains in Chimera