EBI as a research infrastructure Graham Cameron, EBI.

Slides:



Advertisements
Similar presentations
NEWMEDS – the work leading to these results has received funding from the Innovative Medicines Initiative Joint Undertaking (IMI) NEWMEDS Novel MEthods.
Advertisements

What is the capital of the UK? London What is the capital of France? Paris.
The library as a virtual research environment Bill Hubbard SHERPA Project Manager University of Nottingham.
A centre of expertise in digital information management UKOLN is supported by: Dealing with Data: Roles, Rights, Responsibilities & Relationships.
A centre of expertise in digital information management UKOLN is supported by: Changing Roles, Responsibilities and Relationships Dr Liz.
Paolo Donzelli ESTEC HR Division Den Haag 27 November 2008
Study Project The Countries and Capitals of the European Union.
EBI Proteomics Services Team – Standards, Data, and Tools for Proteomics Henning Hermjakob European Bioinformatics Institute SME forum 2009 Vienna.
The European Molecular Biology Laboratory (EMBL) is supported by sixteen countries. Consists of the main Laboratory in Heidelberg (Germany), Outstations.
EMBRACE Web Services Interoperability Through Standardisation BioHackathon 2008.
Bioinformatics Needs for the post-genomic era Dr. Erik Bongcam-Rudloff The Linnaeus Centre for Bioinformatics.
Protein databases Morten Nielsen. Background- Nucleotide databases GenBank, National Center for Biotechnology Information.
Archives and Information Retrieval
Welcome to CERN Research Technology Training Collaborating.
Luxembourg, Sep 2001 Pedro Fernandes Inst. Gulbenkian de Ciência, Oeiras, Portugal EMBER A European Multimedia Bioinformatics Educational Resource.
EMBL-EBI and Bioinformatics Steven Newhouse, Head of Technical Services, EMBL-EBI.
1 Aventis Pharma. 2 Prescription drugs Aventis Pharma Vaccines Aventis Pasteur Therapeutic proteins Aventis Behring Diagnostics Dade Behring.
Welcome to EMBL-EBI Dr Laura Emery. Before we start… Stand up How experienced are you in bioinformatics? Get to know each other by arranging yourselves.
ISARE : Health indicators in the regions of Europe André Ochoa for Isare team ISARE : Health indicators in the regions of Europe André Ochoa for Isare.
From T. MADHAVAN, & K.Chandrasekaran Lecturers in Zoology.. EXIT.
Bioinformatics Grid Application for Life Science. COMMUNICATION NETWORK DEVELOPMENT SPECIFIC SUPPORT ACTION BIOINFOGRID Luciano Milanesi CNR-ITB.
Deploying Wireless in Chemical Industry. Deploying Wireless in Chemical Industry Peter Schellekens Vice President Sales & Marketing - Global Chemical.
By Alex Wright & Nick Dartizio
1 Public Investment in Film & TV Works: Film Funds in Europe Susan Newman-Baudais Analyst – Film Industry European Audiovisual Observatory Public Investment.
Databases in Bioinformatics and Systems Biology Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
Learning and exploring Life science through the EBI reosurces and tools BIOQUEST workshop_2011 Vicky Schneider, EMBL-EBI Training Programme Project leader.
Research Infrastructures in Structural Biology - NMR Lucia Banci CERM, Florence, Italy Workshop "Future Needs for Research Infrastructures.
Impact of accession to the EU on the Hungarian science policy Hungarian Academy of Sciences 14th October, 2003.
Network Services for Biologists in the Genome Era The Work of the European Bioinformatics Institute.
Hi! I suggest you to guess the country and its capital. Let’s begin!
Challenges for the study of disease in the 21 st century Characterise the function of every gene in the mammalian genome Generate mutations in every gene.
C ross-European data sharing made easy EDAF Luxembourg.
International Regulome Consortium Toronto – October 29, 2005 Cindy Bell, VP, National Genomics Program Genome Canada.
User views from outside of Western Europe MarkoBonac, Arnes, Slovenia.
Working Group 2 – Volcano Observations Meetings of the WG2: -November 2010: KO meeting (Rome; IT) -April 2011: Splinter meeting during the EGU 2011 (Vienna;
S&T Activity of the Hungarian Academy of Sciences: an overview  Knowledge-based society: the European vision  General information on the Hungarian Academy.
Rackspace Analyst Event Tim Bell
IntAct- An Open Standard and Software for Protein-Protein Interaction Data Henning Hermjakob 1, Luisa Montecchi-Palazzi 9, Chris Lewington 1, Dan Wu 1,
EMBL-EBI EMBL-EBI EMBL-EBI What is the EBI's particular niche? Provides Core Biomolecular Resources in Europe –Nucleotide; genome, protein sequences,
EMBRACE An example of Grid Integration (I): The EMBRACE project Jean SALZEMANN CNRS/IN2P3.
Data provenance in biomedical discovery Donald Dunbar Queen’s Medical Research Institute University of Edinburgh Workshop on Principles of Provenance in.
CERN as a World Laboratory: From a European Organization to a global facility CERN openlab Board of Sponsors July 2, 2010 Rüdiger Voss CERN Physics Department.
European Molecular Biology Laboratory: An overview
ArrayExpress - a Public Repository for Microarray Based Gene Expression Data European Bioinformatics Institute - EMBL outstation and German Cancer Research.
The (IMG) Systems for Comparative Analysis of Microbial Genomes & Metagenomes: N America: 1,180 Europe: 386 Asia: 235 Africa: 6 Oceania: 81 S America:
Where? Who? When? What? Why? (How?) Ray Barrett X-ray Optics Group
Stakeholder Relations at Large-Scale Infrastructures The CERN Model Rolf Heuer 7 th Canadian Science Policy Conference, Ottawa, 26 November 2015.
The Mission of CERN  Push back  Push back the frontiers of knowledge E.g. the secrets of the Big Bang …what was the matter like within the first moments.
For EGI/EUDAT EMBL/ELIXIR use-cases Tony Wildish
Someone needed here to point out where Europe is….
Semmelweis University Publication Database Semmelweis University Publication Database László Hunyady MD, PhD, DSc Department of Physiology Semmelweis University,
Welcome to EGI Community Forum 2014 May 19 th, 2014 Anita Lehikoinen Permanent Secretary.
Warsaw University of Technology History since 1826 the main building of WUT Students: Academic staff: 2500 other staff: faculties Welcome.
Warsaw University of Technology history since 1826 students academic staff2 500 other personnel faculties Warsaw University of Technology.
EMBRACE Workshop Appled Gene Ontology ITB – CNR Bari, Italy 7. – 9. November 2007 Domenica D’Elia, Giulia De Sario, Andreas Gisel, Cecilia Saccone, Angelica.
Table 1. Number and rate of Legionnaires’ disease cases per population by country and year, EU/EEA, 2010–2014 ASR: age-standardised rate, C: case-based.
Someone needed here to point out where Europe is…
Countries and Capitals of Western Europe
EMBL’s European Bioinformatics Institute
European Molecular Biology Laboratory
The Integrated Microbial Genome (IMG) systems
EMBL – European Molecular Biology Laboratory
ELIXIR: Authentication and Authorization Infrastructure Requirements
Member States of the EU Austria
The European Union (EU for short)
UniProt: the Universal Protein Resource
European Centre for Continuous Auditing
 Vice Presidency for Research EPFL Presentation - Public | 2017.
WEST EUROPE MAP REVIEW.
Someone needed here to point out where Europe is…
Presentation transcript:

EBI as a research infrastructure Graham Cameron, EBI

Heidelberg Hinxton Monterotondo Hamburg Grenoble ServiceResearchTrainingIndustry EMBL EBI

Member States of EMBL Austria Belgium Denmark Finland France Portugal Spain Sweden Switzerland United Kingdom Germany Greece Israel Italy The Netherlands Norway

Hinxton ServiceResearchTrainingIndustry EBI

~ €3.8 Billion

We have amassed a wealth of knowledge about the molecular processes of living systems Biomacromolecules Biologically active molecules The behaviour and interactions of these molecules The phenotypic effects of molecular changes Mutations Drugs Nutrients Themolecular adjuncts of phenotypic changes Disease Aging Databases Web access Tools to explore the information Systems to capture the information Service centres

DNA

Protein Sequences

Expression

Structures

PDB code 1DIF HIV-1 Protease/Inhibitor Complex A79285 (Difluoroketone) molecules interact

Pathways

Reactome EnsEMBL Genome Annotation EMBL-Bank DNA sequences UniProt Protein Sequences Array-Express Microarray Expression Data EMSD Macromolecular Structure Data IntAct Protein Interactions

Usage Basic research Industry Pharma Diagnostics Medical device research Personal care Nutrition Agriculture Forestries Fishery Patent searching and provenance

Using the information Not Salt TolerantSalt Tolerant Disease proneDisease Resistant Low YieldHigh Yield DiseasedHealthy Suppose a gene’s variation seems important

Using the information Not Salt TolerantSalt Tolerant Disease proneDisease Resistant Low YieldHigh Yield DiseasedHealthy Look in databases for similar genes, their products, and functions, structures, interactions and expression patterns. The processes in which they are involved.

Using the information Not Salt TolerantSalt Tolerant Disease proneDisease Resistant Low YieldHigh Yield DiseasedHealthy Can we influence the processes in which they are involved?

Using the information Not Salt TolerantSalt Tolerant Disease proneDisease Resistant Low YieldHigh Yield DiseasedHealthy Can we influence the processes in which they are involved?

Working out what in the lab what a gene does could easily be a year ’ s work Searching databases can do it in half an hour

Nucleotide Sequence Database Growth Megabases Date A new sequence once a second

Average Web Hits per Day Including Ensembl Quarter Year Average Hits per Day Note: Ensembl is a joint project with The Wellcome Trust Sanger Institute. Equivalent usage data have only been available since A few hundred thousand unique users per month A million unique users per year

European Context BioSapiens EMBRACE ENFIN (and many others)

Biosapiens European Molecular Biology Laboratory - European Bioinformatics Institute, Hinxton, Cambridge, UK. European Molecular Biology Laboratory, Heidelberg, Germany. German National Centre for Environment and Health, Neuherberg, Münich, Germany Université Libre de Bruxelles, Brussels, Belgium Consejo Superior de Investigaciones Cientificas, Madrid, Spain Institut Municipal d'Assistència Sanitària, Barcelona, Spain Genome Research Ltd, Hinxton, Cambridge, UK. Max-Planck Institute for Informatics, Saarbrücken, Germany The Hebrew University of Jerusalem, Girat Ram, Israel Department of Biochemical Sciences University of Rome "La Sapienza", Rome, Italy University of Stockholm, Stockholm, Sweden University of Oxford, Oxford, UK. University College London, London, UK. Radboud University Nijmegen, Nijmegen, The Netherlands Swiss Institute of Bioinformatics, Geneva, Switzerland Technical University of Denmark, Lyngby, Denmark University of Helsinki, Helsinki, Finland University of Geneva, Geneva, Switzerland Institute of Enzymology, Hungarian Academy of Sciences, Budapest, Hungary University of Cologne, Cologne, Germany Institut Pasteur, Paris, France BioInfo Bank Institute, Poznan, Poland Max Planck Institute for Molecular Genetics, Berlin, Germany Genoscope, Evry, France University of Bologna, Bologna, Italy European Molecular Biology Laboratory - European Bioinformatics Institute, Hinxton, Cambridge, UK

EMBRACE European Molecular Biology Laboratory - European Bioinformatics Institute, Hinxton, Cambridge, UK. European Molecular Biology Laboratory, Heidelberg, Germany. Institute of Biomedical Technologies, Section Bari, CNR, Bari, Italy University of Manchester, UK Swiss Institute of Bioinformatics, Geneva, Switzerland Swedish University of Agricultural Sciences.The Linnaeus Centre for Bioinformatics, Sweden Centre National de la Recherche Scientifique, Clermont-Ferrand and Lyon, France Centre for Biological Sequence Analysis,Technical University of Denmark, Lyngby, Denmark Centro Nacional de Biotecnologia/Consejo Superior de Investigaciones Cientificas, Madrid, Spain University of Stockholm, Stockholm Bioinformatics Centre, Sweden Institut National de la Recherche Agronomique, Toulouse, France Max Planck Institute for Molecular Genetics, Berlin, Germany CSC, the Finnish IT Center for Science, Espoo, Finland University College London, London, UK. The Weizmann Institute, Rehovot, Israel Centre for Molecular and Biomolecular Informatics, University of Nijmegen, The Netherlands Carretera de Ajalvir, km. 4, Torrejon de Ardoz, Madrid

ENFIN The European Bioinformatics Institute / The European Molecular Biology Laboratory, Europe The University of Dundee UK Technical University of Denmark University of Rome Tor Vergata Italy) Medical Research Council Mammalian Genetics Unit (MRCMGU), UK Ludwig Institute for Cancer Research, Uppsala (LICR-UPP), Germany The Max Planck Institute, Germany University of Helsinki (UH), Iceland University College London (UCL), UK National Center for Research and Technology, Hellas (CERTH), Greece Universitaet zu Koeln (UNIK), Germany Weizmann Institute (Weizmann), Israel Egeen (EGEEN), Estonia Serono Pharmaceutical Research Institute (SPRI), Switzerland Consejo Superior de Investigaciones Científicas (CSIC), Spain Centre for Integrative Bioinformatics VU (IBIVU), Netherlands

Global Picture DNA – tripartite international collaboration (including patent data acquisition) Protein sequences – Uniprot collaboration Macromolecular structures – tripartite international collaboration Intact international agreements Reactome – USA Europe collaboration Etc.

Flybase MGD SGD BRENDA Chemical data resources Medical data resources Biodiversity data resources IMGT Pasteur DBs Eumorphia/ Phenotypes Core biomolecular resources Specialist biomolecular data resource examples Mutants Large resources in related disciplines Model organism resource examples Mouse Atlas

Large resources in related disciplines Biodiversity data resources Flybase MGD SGD BRENDA Chemical data resources Medical data resources IMGT Pasteur DBs Eumorphia/ Phenotypes Core biomolecular resources Specialist biomolecular data resource examples Mutants Model organism resource examples Mouse Atlas

Medical data resources Core biomolecular resources

Flybase MGD SGD BRENDA Chemical data resources Medical data resources Biodiversity data resources IMGT Pasteur DBs Eumorphia/ Phenotypes Core biomolecular resources Specialist biomolecular data resource examples Mutants Large resources in related disciplines Model organism resource examples Mouse Atlas

Web Hits

EBI Total Running Budget 2005 = € 26 million Projected budget 2011 = €43 million

Read-only or dynamic There’s nothing particularly difficult about archiving unchanging data But most aren’t Todays best bet E.g, Ensembl Provenance E.g., patent searching N.B. Versioning (complex!) Cititation

How much data Canonical vs. episodic Genomes, expression profiles Raw vs. processed Sequence traces Structure factors

Custodianship acquisition and ownership Widely accepted obligation to deposit data Depend on the goodwill of the community Add “organisation” Add “services” Add “value”

Annotation as added value First/second/third party annotation Computational vs. experimental Bundled vs. distributed (DAS)

Openness We approve of it Data must be made available as soon as they are discussed in a publication Data from “community” projects should be made available immediately Confidentiality issues must be addressed

Federation Monolithic solutions fail Centralisation yields more than the sum of the parts Aggregation of institutional repositories is essential

Slice it vertically or horizontally? E.g., the EBI and AstroGrid are domain specific Would it be better if they were jointly managed by data experts? Standardisation Mixed success

Supporting the electronic record of science This is more like libraries than research projects Needs long term commitment With accountability Current funding structures are not well adapted to the task Pitching the information providers in competition with their research community is damaging.

Bioinformatics Infrastructure Has captured the data from several billion Euros worth of science Serves a community of perhaps a million users Supports science on which the UK alone spends €3-4 billion a year Cuts years of lab work down to hours of computer work Is crucial to human well being from medicine to agriculture Sees data volume and usage growing exponentially Might cost a few tens of millions (at most a couple of percent of the cost of the science it supports).