EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Gaurav Sahni, Ph.D. Deposition, Validation, Search and Analysis.

Slides:



Advertisements
Similar presentations
EBI is an Outstation of the European Molecular Biology Laboratory. PDBeChem The Ligand Database.
Advertisements

Integration of Protein Family, Function, Structure Rich Links to >90 Databases Value-Added Reports for UniProtKB Proteins iProClass Protein Knowledgebase.
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
Project 1 Introduction to HTML.
Archives and Information Retrieval
The Protein Data Bank (PDB)
SCHOOL OF EDUCATION Designing web-based language learning materials: authoring with Macromedia ‘Dreamweaver’ and ‘Coursebuilder’ Dr Pamela Rogerson-Revell.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProt Jennifer McDowall, Ph.D. Senior InterPro Curator Protein Sequence Database:
ExPASy - Expert Protein Analysis System The bioinformatics resource portal and other resources An Overview.
HTML 1 Introduction to HTML. 2 Objectives Describe the Internet and its associated key terms Describe the World Wide Web and its associated key terms.
26-28 th April 2004BioXHIT Kick-off Meeting: WP 5.2Slide 1 WorkPackage 5.2: Implementation of Data management and Project Tracking in Structure Solution.
Protein Interfaces, Surfaces and Assemblies
A Scalable Application Architecture for composing News Portals on the Internet Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta Famagusta.
Development of Bioinformatics and its application on Biotechnology
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
Bringing Structure to Biology: Small Molecules and the PDBe
EMBL-EBI MSD-mine. EMBL-EBI MSD-mine overview  Web application for online data analysis and mining For the advanced MSDSD researcher Interactive ad-hoc.
Introduction to databases Tuomas Hätinen. Topics File Formats Databases -Primary structure: UniProt -Tertiary structure: PDB Database integration system.
Gene Expression Omnibus (GEO)
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Protein 3D-structure analysis Exercises. Practicals Find update frequency for RCSB PDB: weekly. When was the last update? How many protein structures.
Design Patterns Phil Smith 28 th November Design Patterns There are many ways to produce content via Servlets and JSPs Understanding the good, the.
PDBe-fold (SSM) A web-based service for protein structure comparison and structure searches Gaurav Sahni, Ph.D.
EMBL-EBI Adel Golovin MSDsite The project is funded by the European Commission as the TEMBLOR, contract-no. QLRI-CT under the RTD programme.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Databank in Europe (PDBe)‏ An Introduction.
GENOME-CENTRIC DATABASES Daniel Svozil. NCBI Gene Search for DUT gene in human.
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes PDBe Protein.
Discovering Computers Fundamentals Fifth Edition Chapter 9 Database Management.
Copyright OpenHelix. No use or reproduction without express written consent1.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Introduction to Computers Lesson 10B. home Database A collection of related data or facts.
EMBL-EBI MSD Search tools. EMBL-EBI MSDlite EMBL-EBI MSDlite.
Copyright OpenHelix. No use or reproduction without express written consent1.
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes PDBe Protein.
PIRSF Classification System PIRSF: Evolutionary relationships of proteins from super- to sub-families Homeomorphic Family: Homologous proteins sharing.
Data Integration and Management A PDB Perspective.
EBI is an Outstation of the European Molecular Biology Laboratory. MSDchem and the chemistry of the wwPDB EMBO 22nd-26th September 2008 EMBL-EBI Hinxton.
The Public Face of TAIR User Interface Design Responsiveness to User Input.
Chapter 4: Working with ASP.NET Server Controls OUTLINE  What ASP.NET Server Controls are  How the ASP.NET run time processes the server controls on.
Protein Data Bank: An Introduction Learning to Use the RCSB PDB Portal.
EBI is an Outstation of the European Molecular Biology Laboratory. Quaternary Structure.
Copyright OpenHelix. No use or reproduction without express written consent1.
EMBL-EBI MSD Search and Visualization tools Jawahar Swaminathan.
EBI is an Outstation of the European Molecular Biology Laboratory. Sanchayita Sen, Ph.D. PDB Depositions Validation & Structure Quality.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Deposition, Validation, Search and Analysis Services.
Macromolecular Structure Database Project EMSD Infra-structure Services for Europe To develop an autonomous structural database capability in Europe
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Gaurav Sahni, Ph.D. Deposition, Validation, Search and Analysis.
Real World Experiences in Operating a Collaboratory: The Protein Data Bank Helen M. Berman Board of Governors Professor of Chemistry.
Primary vs. Secondary Databases Primary databases are repositories of “raw” data. These are also referred to as archival databases. -This is one of the.
Worldwide Protein Data Bank Common D&A Project Sequence Processing Modular Demo May 6, 2010 Project Deliverable.
Introduction to KE EMu Unit objectives: Introduction to Windows Use the keyboard and mouse Use the desktop Open, move and resize a.
AutoDep 4.0 A data deposition and archival system Sameer Velankar.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBeChem The Ligand Database.
HTML Concepts and Techniques Fifth Edition Chapter 1 Introduction to HTML.
EMBL-EBI Dimitris Dimitropoulos MSD-mine. EMBL-EBI MSD-mine overview  Web application for online data analysis and mining  For the advanced MSDSD researcher.
InterPro Sandra Orchard.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBe Search Services (PDBelite, PDBePro and BIObar) Sanchayita Sen, Ph.D. PDB Depositions.
1 Integration of data sources Patrick Lambrix Department of Computer and Information Science Linköpings universitet.
EBI is an Outstation of the European Molecular Biology Laboratory. Protein Databank in Europe (PDBe)‏ An Introduction.
EBI is an Outstation of the European Molecular Biology Laboratory. A web based integrated search service to understand ligand binding and secondary structure.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBe-fold (SSM) A web-based service for protein structure comparison and structure searches.
The Web Web Design. 3.2 The Web Focus on Reading Main Ideas A URL is an address that identifies a specific Web page. Web browsers have varying capabilities.
Sequence: PFAM Used example: Database of protein domain families. It is based on manually curated alignments.
PDBe Protein Interfaces, Surfaces and Assemblies
PDBemotif A web based integrated search service to understand ligand binding and secondary structure properties in macromolecular structures.
Getting the Most out of the PDBe
Introduction to Databases
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Presentation transcript:

EBI is an Outstation of the European Molecular Biology Laboratory. Protein Database in Europe Gaurav Sahni, Ph.D. Deposition, Validation, Search and Analysis Services

worldwide Protein Data Bank (wwPDB) Consists of four sites RCSB (USA), PDB-j (Japan) BMRB (USA) and PDBe. Single repository of macromolecular structures. Started in 1971 and now ~61,000 entries, adding ~200 new entries/week. Deposited by experimentalists and contents is freely available. The format of the archive is flat-files with fixed line format, although an improved flat-file format (mmCIF) and XML are also available.

Protein Databank in Europe (PDBe) group Is one of the four sites around the world that where 3D structures may be deposited. Provides stable and clean repository of macromolecular structure data. Has services that allow users to access, search and retrieve structural data from a single web access point.

EBI is an Outstation of the European Molecular Biology Laboratory. PDBe Tasks Deposition and Validation Database design and implementation Retrieve data Analysis tools & Services

Deposition via AutoDep4 ( Closely collaborate with the other wwPDB members for a single unified archive.. Depositions via EMDEP ( Depositions started June 2002 Depositions and Curation

Validation of Structures Authentication of source That the protein is from human and not rabbit, for example ! Authentication of structure Comparison of structure against raw data. Geometry and Stereochemistry. Provide results back to depositor. Validation of correct methodology used Whether X-Ray, NMR or EM. Conformity to standards Follows PDB format specifications Error checks Consistency checks - to identify simple typos Homo sapiens and not Homo sapien (single human?). Outlier detection - to identify suspect records

EBI is an Outstation of the European Molecular Biology Laboratory. PDBe Tasks Deposition site Database design and implementation Retrieve data Analysis tools & Services

Disadvantages of Flat files… Macromolecular structures are very complex. Existing PDB format is incapable of fully describing few existing structures also. Format is not readily extensible, to cope, for example, with structural genomics data. Historical archive is non-uniform and poorly populated. Search and retrieval of flat files is difficult and/or inaccurate.

Uniform Data Improved Query Functionality Time Effort Usefulness Usage CrystallographersBiologists ProgrammersBioinformaticians PDBe Relational Database

EBI is an Outstation of the European Molecular Biology Laboratory. PDBe Tasks Deposition site Database design and implementation Retrieve data Analysis tools & Services

Some Implementation Issues  The PDBe database is large and complex:  ~61,000 PDB entries  Cross-referenced against SwissProt, PubMed etc.  Making data accessible without adding additional complexity.  Tools for different categories of end-user  Simple – biobar  Intermediate - PDBelite  Advanced – PDBepro  New - PDBeView

biobar A toolbar search application for Mozilla/Netscape or firefox browsers Simple and quick retrieval of data from PDBe and 45 other Databases

PDBelite A simple form-based query system to search the PDBe Databases

PDBelite Search Results

Features of Search Interface Strengths: simple, easy to use form allows multiple search fields to be combined relatively fast, despite performing quite complex SQL queries Weaknesses: not exposing the power of a relational database limited logical operators between search fields: "name" AND "title" AND "keyword“  "name" OR "title" OR "keyword“  ( "name" OR "title" ) AND NOT "keyword" the search form is defined by the authors of the search system, not the author of a query

PDBepro A java-based flexible graphical search interface for advanced searching

Complex searches User have comprehensive control of their query Applet provide a dynamic form, as compared to a static HTML form: choose the fields to be searched specify the relationships between search fields choose the result fields and how results are presented perform “complex” sub-queries e.g. SSM, FASTA PDBepro uses an applet for constructing queries and a server to execute them The user describes their query entirely graphically, including logical operations such as AND, OR and NOT

PDBeView

Search result: The Atlas page

EBI is an Outstation of the European Molecular Biology Laboratory. PDBe Tasks Deposition site Database design and implementation Retrieve data Analysis tools & Services

AstexViewer™: View structures as wireframe, backbone or ribbons Built-in sequence viewer Calculate and display surfaces Various display options: Ramachandran plots Distance matrix B-factors Based on the AstexViewer™ from Astex Technology Limited and modified under licence by the PDBe group

PDBeChem Ligand Database

What is the environment around alpha-D-mannose and beta-D-mannose? PDBeSite

What binds ASP ASP HIS LYS ? PDBeSite

How does ATP generally interact with LYS in all structures ? PDBeSite

Assess Quality of a Structure Ramachandran Plot Bond Distances Bond Angles PDBeAnalysis

PDBePisa What assembly can my structure have ?

PDBeFold Discover unknown relationships… Are there any structures in the PDB that are similar to mine? What SCOP and/or CATH family could my structure belong to ? Can I get some idea about the possible function of my protein based on similarity with others based on structural similarity ? Mutiple alignment of many of my structures ?

ChemSearch Sub-structure based search of a million chemicals

PDBeAnalysis/PDBeValidate Online PDB validation

PDBeStatus PDB Deposition status search

PDBe provides… Clean biological data Integrated data A single web access point Query interfaces for different users (Beginner, Occasional or expert). Interconnected views of the data relating structure, sequence, text & experimental details.

PISA biological assemblies PDBechem ligand data Electron Density Visualisation AstexViewer PDBePro, PDBelite Fold matching Surface Matching Active sites Linking to Domain data, eFamily Sequence Mapping, SIFTS