Network Services for Biologists in the Genome Era The Work of the European Bioinformatics Institute.

Slides:



Advertisements
Similar presentations
Genome Annotation: A Protein-centric Perspective.
Advertisements

European Bioinformatic Institute.
EMBL-EBI Integration of Sequence and 3D structure Databases.
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
On line (DNA and amino acid) Sequence Information Lecture 7.
Basic Genomic Characteristic  AIM: to collect as much general information as possible about your gene: Nucleotide sequence Databases ○ NCBI GenBank ○
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
The European Molecular Biology Laboratory (EMBL) is supported by sixteen countries. Consists of the main Laboratory in Heidelberg (Germany), Outstations.
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
Global Alignment and Collaboration Jo
11 Decembre 2000V. Breton Milan WP6 DataGRID meeting Biological applications in testbed 0 Evaluate GRID added value for handling biological data –What.
Bioinformatics Needs for the post-genomic era Dr. Erik Bongcam-Rudloff The Linnaeus Centre for Bioinformatics.
Archives and Information Retrieval
Asynchronous eLearning overcomes geographical and temporal constraints transforming learning into a process that can occur at the independently determined.
Data Mining in Ensembl with EnsMart. 2 of 24 All genes from a candidate region Genes with a particular protein domain Members of a protein family Genes.
Class European Resources Protein Focused. Protein Databases EBI – European Bioinformatics Institute
EBI is an Outstation of the European Molecular Biology Laboratory. UniProt Jennifer McDowall, Ph.D. Senior InterPro Curator Protein Sequence Database:
Luxembourg, Sep 2001 Pedro Fernandes Inst. Gulbenkian de Ciência, Oeiras, Portugal EMBER A European Multimedia Bioinformatics Educational Resource.
Bio/CS 251 Introduction to Bioinformatics. Class Web Site This site will contain all important documents.
EMBL-EBI and Bioinformatics Steven Newhouse, Head of Technical Services, EMBL-EBI.
UniProt - The Universal Protein Resource
Data retrieval BioMart Data sets on ftp site MySQL queries of databases Perl API access to databases Export View.
ExPASy - Expert Protein Analysis System The bioinformatics resource portal and other resources An Overview.
Welcome to EMBL-EBI Dr Laura Emery. Before we start… Stand up How experienced are you in bioinformatics? Get to know each other by arranging yourselves.
An Introduction to Bioinformatics Molecular Biology Databases.
Genome database & information system for Daphnia Don Gilbert, October 2002 Talk doc at
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
26-28 th April 2004BioXHIT Kick-off Meeting: WP 5.2Slide 1 WorkPackage 5.2: Implementation of Data management and Project Tracking in Structure Solution.
The BioBox Initiative: Bio-ClusterGrid Gilbert Thomas Associate Engineer Sun APSTC – Asia Pacific Science & Technology Center.
Bioinformatics.
Universität Stuttgart Universitätsbibliothek Information Retrieval on the Grid? Results and suggestions from Project GRACE Werner Stephan Stuttgart University.
European Life Sciences Infrastructure for Biological Information ELIXIR
Databases in Bioinformatics and Systems Biology Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
Bioinformatics for biomedicine
Introduction to databases Tuomas Hätinen. Topics File Formats Databases -Primary structure: UniProt -Tertiary structure: PDB Database integration system.
Rahul Raman, Ram Sasisekharan Bioinformatics Core Massachusetts Institute of Technology Glue Grants Bioinformatics Meeting April 22-23, 2004 San Diego,
Biological Databases By : Lim Yun Ping E mail :
UniProt Non-redundant Reference Cluster (UniRef) Databases Swiss Institute of Bioinformatics (SIB) European Bioinformatics Institute (EMBL-EBI)

جلسه اول بیو انفورماتیک گردآوری:مسعود رسول آبادی
Middleware for FIs Apeego House 4B, Tardeo Rd. Mumbai Tel: Fax:
EMBL-EBI EMBL-EBI EMBL-EBI What is the EBI's particular niche? Provides Core Biomolecular Resources in Europe –Nucleotide; genome, protein sequences,
EMBRACE An example of Grid Integration (I): The EMBRACE project Jean SALZEMANN CNRS/IN2P3.
Bioinformatics Core Facility Guglielmo Roma January 2011.
Protein and RNA Families
Mining Biological Data. Protein Enzymatic ProteinsTransport ProteinsRegulatory Proteins Storage ProteinsHormonal ProteinsReceptor Proteins.
EMBL-EBI Integration of Sequence and 3D structure Databases “The key to Bioinformatics is integration, integration, integration” Bioinformatics: Bringing.
Bio-Linux 3.0 An integrated bioinformatics solution for the EG community ClustalX showing DNA polymerase alignment GeneSpring showing yeast transcriptome.
Proteomics databases for comparative studies: Transactional and Data Warehouse approaches Patricia Rodriguez-Tomé, Nicolas Pinaud, Thomas Kowall GeneProt,
EMBOSS over a Grid 1. 1st EELA Grid School December 4th of 2006 Eduardo MURRIETA LEON Romualdo ZAYAS-LAGUNAS Pierre-Alain BRANGER Jérôme VERLEYEN Roberto.
Copyright OpenHelix. No use or reproduction without express written consent1.
EB3233 Bioinformatics Introduction to Bioinformatics.
A collaborative tool for sequence annotation. Contact:
An approach to carry out research and teaching in Bioinformatics in remote areas Alok Bhattacharya Centre for Computational Biology & Bioinformatics JAWAHARLAL.
XML-Based Grid Data System for Bioinformatics Development Noppadon Khiripet, Ph.D Wasinee Rungsarityotin, MS Chularat Tanprasert, Ph.D Royol Chitradon.
EBI is an Outstation of the European Molecular Biology Laboratory. EBI patent related services Jennifer McDowall Senior Scientist, EMBL-EBI 3 rd Annual.
EBI is an Outstation of the European Molecular Biology Laboratory. Gautier Koscielny VectorBase Meeting 08 Feburary 2012, EBI VectorBase Text Search Engine.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
Central hub for biological data UniProtKB/Swiss-Prot is a central hub for biological data: over 120 databases are cross-referenced (EMBL/DDBJ/GenBank,
SEQUENCE RETRIEVAL SYSTEM SRS Tuomas Hätinen. Motivation Structural biology molecular biology genetics medicine Sequencing information physiology toxicology.
Integration of BioInformatics tools at NUS. GenBank Growth Chart Year Bases.
For EGI/EUDAT EMBL/ELIXIR use-cases Tony Wildish
High throughput biology data management and data intensive computing drivers George Michaels.
EMBL’s European Bioinformatics Institute
MATLAB Distributed, and Other Toolboxes
Introduction to Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
LESSON 1 INTNRODUCTION HYE-JOO KWON, Ph.D /
Lesson 3 Bioinformatics Laboratory
Presentation transcript:

Network Services for Biologists in the Genome Era The Work of the European Bioinformatics Institute

Our Genome tcaattctga tcgaataaac gaatttacat atttggtaag ttttggccaa tttcgtagca 60 atatgatgaa attgcgctct tttttaggaa tatcaaattg gaatataaca aaaaaaaaac 120 tgaaactaac caactgaatc taatgtgcat tttaaataat aaaaatggat cattttatac 180 atcatattaa aattaaaaaa atttcataaa aataatacgt agtaaaaaat aaaaattttt 240 aacataaata aannnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 300 MTERENNVYK AKLAEQAERY DEMVEAMKKV ASMDVELTVE ERNLLSVAYK NVIGARRASW RIITSIEQKE ENKGAEEKLE MIKTYRGQVE KELRDICSDI LNVLEKHLIP CATSGESKVF YYKMKGDYHR YLAEFATGSD RKDAAENSLI AYKAASDIAM NDLPPTHPIR LGLALNFSVF YYEILNSPDR ACRLAKAAFD DAIAELDTLS EESYKDSTLI MQLLRDNLTL WTSDMQAEDP NAGDGEPKEQ IQDVEDQDVS Chr basepairs estimated 50, ,000 genes (3286 Mbases) 2/3 of which are completed and in the public domain.

...others... From: Genome MOT at the EBI (April 2000)

Data growth (EMBL DB)

Activity Areas at EBI EMBL –Archiving, development and distribution of DNA sequence data. Swiss-Prot –Archiving, production, development and distribution of Protein sequence data. MSD –Archiving and distribution of macromolecular structural data and structure prediction applications. DALI –Archiving and distribution of 2D/3D prediction databases and tools for their usage. ENSEMBL –Archiving, automatic analysis and distribution of Human genome data. CGG –Genome annotation, data mining, methabolic pathways research. CORBA –Design and implementation of CORBA-based tools for database querying SRS –Development and maintenance of SRS in collab.with Lionbiosciences. Industry –Links to industry and customised R&D (e.g. Gene Expression). External Services –Development and deployment of on-line interactive and non-interactive tools for sequence analysis.

EBI’s Network Services

Our common user interface

srs.ebi.ac.uk Sequence Retrieval System Core text search and retrieval engine for most services offered from EBI. Updates and links together more than 100 databanks. Biggest SRS server in the world (over 130 databases).

Genome & Proteomes Currently more that 30 complete genomes and proteomes are available interactively to the user community and demand for data from the Human genome is being met by providing access the all the material available in the EBI databases.

GPCRDB

A recent initiative: Ensembl Bringing discovery to the scientific community...

The Community EMBnet - European Molecular Biology network. Formed (officially) in 1988 to disseminate up-to-date molecular biology databases within member states. The initiative for the creation of EMBnet was started by EMBO council members in collaboration with EMBL staff in 1986.

Dissemination of EBI data resources to the world through the EMBnet

An EMBnet node (1/2) Hosted by a national academic centre. Has national coverage over the Internet. Provides services to academics as well as industry (ca users per node). Maintains local copies of the mayor biological databases and sequence analysis packages.

An EMBnet node (2/2) Provides training and education in the national language. Each node typically employs 2-3 staff. Each node has at least one major interactive login server and a WWW and ftp server (ca. 300 hosts today).

EMBnet organisation and main tasks ano 2K

EMBnet membership

EMBnet Milestones (1/2) Development of network based tools for database updates: – First transaction of sequence data between EMBL in Heidelberg and InfoBiogen in Paris. – First implementation of the HASSLE protocol between Norway, Switzerland and Italy. Asynchronous sequence database updates where then possible. – First implementation of xNDT between Norway and Sweden. Asynchronous sequence database updates…another solution. – First implementation of SynChron which runs from EBI today on several industrial sites.

EMBnet Milestones (2/2) Financial – The European Union grants support for EMBnet for the first time. Organisation – The Stichting EMBnet is formed granting EMBnet independence from any mayor member (e.g. EMBL). Software – SRS development was partly financed by EMBnet. – EMBOSS (under a GPL) is developed by EMBnet members.

Latest News New MPsrch - Fastest Smith & Waterman searches in the world (1.6 billion cell updates/sec) …available soon. Ensembl - fast delivery of newly predicted human genes and gene products into the public domain and access to similarity and homology searches on up-to- date data sets. Pre-calculated proteomic comparisons of genomes through InterPro. EST clustering, clean-up and redundancy reduction via the EuroGene Index.

Some facts and figures…(1/2) EMBL-EBI is the main provider of biology related sequence databases in Europe. –Sequence Databases (EMBL, TrEMBL, Ensembl (The Human Genome), etc.) –Cartographic Databases (RHdb) –Mutation Databases (HGBASE) –3D/2D Structure Databases (PDB, DSSP, etc.)

Some facts and figures….(2/2) EMBL-EBI produces more than 50 biological databases. EMBL-EBI handles ca. 100K request/day on and 170K requests/day on srs.ebi.ac.uk. (8M req./month) increasing at a rate of 15%/month. EMBL-EBI is moving more than 200Gb of data across the European networks each month.

Main usage is... Sequence querying and retrieval. Sequence comparison and searching. File distribution through ftp.ebi.ac.uk. Replication of data at many international sites (e.g. EMBnet nodes). Systematic use of based services.

Contacts EMBnet: EBI: corba.ebi.ac.uk, msd.ebi.ac.uk, fly.ebi.ac.uk, industry.ebi.ac.uk, interpro.ebi.ac.uk, etc. Ensembl: EMBL: