Swiss-Prot Protein Database Daniel Amoruso December 2, 2004 BI 420.

Slides:



Advertisements
Similar presentations
Protein Structure.
Advertisements

Bioinformatics Ayesha M. Khan Spring 2013.
On line (DNA and amino acid) Sequence Information Lecture 7.
1 Introduction to Bioinformatics Fall Administration  Adi Doron  Nimrod Rubinstein  Dudu Burstein.
BIOINFORMATICS Ency Lee.
GENBANK, SWISSPROT AND OTHERS As Problem Sources for CSE 549 Andriy Tovkach Genetics.
The European Bioinformatics Institute (EBI) Toolbox Julie Pellegrini Introduction to Bioinformatics.
Biology 224 Dr. Tom Peavy Sept 27 & 29 Protein Structure & Analysis.
Intro to Bioinformatics Summary. What did we learn Pairwise alignment – Local and Global Alignments When? How ? Tools : for local blast2seq, for global.
Protein databases Morten Nielsen. Background- Nucleotide databases GenBank, National Center for Biotechnology Information.
Archives and Information Retrieval
Protein Databases EBI – European Bioinformatics Institute
The Cell, Central Dogma and Human Genome Project.
Biological Databases Chi-Cheng Lin, Ph.D. Associate Professor Department of Computer Science Winona State University – Rochester Center
The Protein Data Bank (PDB)
Protein databases Henrik Nielsen. Background- Nucleotide databases GenBank, National Center for Biotechnology Information.
Proteins and Protein Function Charles Yan Spring 2006.
Class European Resources Protein Focused. Protein Databases EBI – European Bioinformatics Institute
EBI is an Outstation of the European Molecular Biology Laboratory. UniProt Jennifer McDowall, Ph.D. Senior InterPro Curator Protein Sequence Database:
Chapter 2 Sequence databases A list of the databases’ uniform resource locators (URLs) discussed in this section is in Box 2.1.
UniProt - The Universal Protein Resource
Bioinformatics Lecture 3 BCH 550 Arjumand Warsy. Retrieving Protein Sequences.
Protein Sequence Analysis - Overview Raja Mazumder Senior Protein Scientist, PIR Assistant Professor, Department of Biochemistry and Molecular Biology.
Interrelating Different Types of Genomic Data From Proteome to Secretome: ‘Oming in on the Function.
Pattern databasesPattern databasesPattern databasesPattern databases Gopalan Vivek.
On line (DNA and amino acid) Sequence Information
Bioinformatics.
Development of Bioinformatics and its application on Biotechnology
Modeling Biomolecules Using the WWW. Major Classes of Biomolecules n Peptides (Proteins) n Carbohydrates n Nucleic Acids n Lipids.
Bioinformatics for biomedicine
Archives and Information Retrieval
Introduction to databases Tuomas Hätinen. Topics File Formats Databases -Primary structure: UniProt -Tertiary structure: PDB Database integration system.
© Wiley Publishing All Rights Reserved. Protein and Specialized Sequence Databases.
Bsubt.embl complete entry in EMBL format (DNA and Features) bsubt.embl.Z bsubt.fasta complete DNA sequence in Fasta format bsubt.fasta.Z bsubt.con construct.
Secondary Databases Ansuman sahoo Roll: Y Bioinformatics Class Presentation 30 Jan 2013.
Biology 224 Instructor: Tom Peavy Feb 21 & 26, Protein Structure & Analysis.
Biological Databases By : Lim Yun Ping E mail :
Day 2: Protein Sequence Analysis 1.Physico-chemical properties. 2.Cellular localization. 3.Signal peptides. 4.Transmembrane domains. 5.Post-translational.
Biological Databases Biology outside the lab. Why do we need Bioinfomatics? Over the past few decades, major advances in the field of molecular biology,
1 EMBL Outstation — The European Bioinformatics Institute Added-Value Proteome Databases: SWISS-PROT, TrEMBL, InterPro.
Function preserves sequences
PROTEIN DATABASES. The ideal sequence database for computational analyses and data-mining: I t must be complete with minimal redundancy It must contain.
Biological databases Exercises. Discovery of distinct sequence databases using ensembl.
Protein Sequence Analysis - Overview - NIH Proteomics Workshop 2007 Raja Mazumder Scientific Coordinator, PIR Research Assistant Professor, Department.
Sequencing the World of Possibilities for Energy & Environment MGM workshop. 19 Oct 2010 Information Sources for Genomics Konstantinos Mavrommatis Genome.
EBI is an Outstation of the European Molecular Biology Laboratory. Quaternary Structure.
1 EMBL Outstation — The European Bioinformatics Institute Removing redundancy in SWISS-PROT and TrEMBL.
EMBL – EBI European Bioinformatics Institute UniProt - The Universal Protein Resource Claire O’Donovan.
Bioinformatics and Computational Biology
Computer Storage of Sequences
1 Discussion Practical 1. Features of major databases (PubMed and NCBI Protein Db) 2.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
Protein Structure  The structure of proteins can be described at 4 levels – primary, secondary, tertiary and quaternary.  Primary structure  The sequence.
©CMBI 2008 Databases Data must be in a certain format for software to recognize Every database can have its own format but some data elements are essential.
Central hub for biological data UniProtKB/Swiss-Prot is a central hub for biological data: over 120 databases are cross-referenced (EMBL/DDBJ/GenBank,
1 EMBL Outstation — The European Bioinformatics Institute Mus musculus - a model organism in SWISS-PROT.
Protein domain/family db Secondary databases are the fruit of analyses of the sequences found in the primary sequence db Either manually curated (i.e.
1 EMBL Outstation — The European Bioinformatics Institute Large-Scale Characterization of Protein Sequence Data.
Protein Proteins are biochemical compounds consisting of one or more polypeptides typically folded into a globular or fibrous form in a biologically functional.
Protein databases Henrik Nielsen
7.3 Translation udent_view0/chapter3/animation__how_translation_work s.html.
UniProt: Universal Protein Resource
There are four levels of structure in proteins
Introduction to Bioinformatics
Carbohydrates, Lipids, Water, Nucleotides
Protein Sequence Analysis - Overview -
Lesson 3 Bioinformatics Laboratory
Introduction to Databases
7.3 Translation Understanding:
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Presentation transcript:

Swiss-Prot Protein Database Daniel Amoruso December 2, 2004 BI 420

What is Swiss-Prot? Annotated sequence database established in 1986 Consists of sequence entries of different line formats Similar format to European Bioinformatics Institute Nucleotide Sequence Database (EMBL)

TrEMBL A computer annotated supplement to Swiss-Prot contains all the translations of EMBL nucleotide sequence entries not yet integrated in Swiss-Prot

Distinguishing Features of Swiss- Prot Annotation Minimal Redundancy Integration with other databases Documentation

Annotation CORE DATA The sequence data The citation information (bibliographical references) The taxonomic data (description of the biological source of the protein)

Annotation- Additional Data Descriptions include: Function(s) of the protein Posttranslational modification(s) such as carbohydrates, phosphorylation, acetylation and GPI-anchor Domains and sites, for example, calcium-binding regions, ATP-binding sites, zinc fingers, homeoboxes, and SH2 and SH3 domains Secondary structure, e.g. alpha helix, beta sheet Quaternary structure, i.g. homodimer, heterotrimer, etc. Similarities to other proteins Disease(s) associated with any number of deficiencies in the protein Sequence conflicts, variants, etc.

Minimal Redundancy Much of data comes from more than one literature report Data condensed and merged to appear more concise and coherent Conflicts in data are listed for each entry

Integration with other databases 50+ databases for cross-reference Nucleic acid sequences, protein tertiary structure, protein 3-D models, etc. Allows Swiss-PROT to play a major role as the focal point for biomolecular interconnectivity

Documentation All files documented and indexed Documentation kept up-to-date

Applications for the Knowledgebase Provides highly organized data and information on a wide variety of proteins Can be used as a starting point for protein research Allows searches to be conducted starting with various search strings Biochemical encyclopedia