Presentation is loading. Please wait.

Presentation is loading. Please wait.

Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY Hannu Saarenmaa ECOINFORMATICS 2006 JRC, Ispra, 2006-01-18 WWW.GBIF.ORG.

Similar presentations


Presentation on theme: "Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY Hannu Saarenmaa ECOINFORMATICS 2006 JRC, Ispra, 2006-01-18 WWW.GBIF.ORG."— Presentation transcript:

1 Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY Hannu Saarenmaa ECOINFORMATICS 2006 JRC, Ispra, 2006-01-18 WWW.GBIF.ORG Update on the Implementation of the GBIF

2 Global Biodiversity Information Facility GBIF’s objective is l to establish an distributed information infrastructure that serves scientific biodiversity data l with initial focus on primary data at specimen and observation levels, and on names, l expanding to species-level information, l with links to molecular, genetic and ecosystems levels

3 Global Biodiversity Information Facility What makes GBIF work (=recipe for cyberinfrastructure) l Standardised schemata for data sharing l Network of providers l Participant nodes promote and coordinate activities of data providers l Collaborative agreements l Control and ownership of data remains with providers l Procedures for interoperability l Web services, in particular l global registry for advertisement of shared data l Integration at GBIF Data Portal l But anyone can build a thematic or national portal l Vision and leadership l GBIF mandate is unique l GBIF is multi-purpose open-ended cyber-infrastructure that enables biologists to serve the society in new ways. l ”GBIF somos todos.”

4 Global Biodiversity Information Facility How is GBIF data scaling up? l 1-3 billion physical specimens in museums l Label data digitising $1 / specimen - huge backlog l Hundreds of millions already existing digital data records in observer networks l 85 million records are online today through GBIF l 10-20% of existing data

5 Global Biodiversity Information Facility l Observation networks of citizens already contribute about one half of GBIF data l Natural resource surveys, agricultural research, etc., are being linked to GBIF Real-time data

6 Global Biodiversity Information Facility Top 20 of 140 Data Providers

7 Global Biodiversity Information Facility GBIF role in indicator development SEBI2010 (Streamlining European 2010 Biodiversity Indicators) l Selected groups of organisms (birds, butterflies, fish, dragonflies, large mammals) for which large datasets exist will be used for SEBI2010 l No new monitoring - existing data collection networks will be used l These data sources could be mobilised through GBIF to create a real time indicator system

8 Global Biodiversity Information Facility More thrust on data sharing is needed l Solving of global problems requires integrated datasets that no single investigator or project can put together l Efficient work sharing is behind progress and economy: l ”I have been standing on the shoulders of giants” – Isaac Newton l Primary data is recyclable and multi-purpose l Citizens have the right to environmental information (Aarhus Convention) l Addressing ”the digital divide” requires equal access to data

9 Global Biodiversity Information Facility Who is driving data sharing? l GenBank – the molecular biology community l WSIS - World Summit on the Information Society – Geneve 2003, Tunis 2005 l IPCC - The Intergovernmental Panel on Climate Change - Data Distribution Centre l Creative Commons, Science Commons, Conservation Commons l LSST - Large Synoptic Survey Telescope (&SETI) l GBIF (draft) recommendation to national research councils to require sharing of data

10 Global Biodiversity Information Facility Data sharing is part of the modern scientific method l Your work is not yet done after you have published your results. l You also must publish your data! l In molecular biology this is already the state of matter l But: Efficient data sharing requires a supporting information infrastructure

11 Global Biodiversity Information Facility The Next Generation of GBIF Cyberinfrastructure Interoperability, integration and operational infrastructure for science

12 Global Biodiversity Information Facility Informatics priority 2007-11  Empower through web services the GBIF community to build applications on top of the growing pool of GBIF data, and integrate content at the GBIF Data Portal to show in user-friendly manner what is available and what is possible.

13 Global Biodiversity Information Facility Informatics objectives 2007-11 l Improve the user-friendliness and capabilities of the Data Portal l Enable through standard mechanisms Internet searching for biodiversity data across all levels of biological organisation, from molecules to ecosystems l Provide tools for improving data quality and assessing fitness for use early in the 5-year period

14 Global Biodiversity Information Facility Portal Data provider Provider Services Request Marshaller Query Engine Registry Institutions Providers Services ( UDDI ) Resource Metadata Resource Metadata GBIF information system architecture (current) Index Name provider Provider Services Resource Metadata Resource Metadata Cache Metadata Accounting SOAP DiGIR HTTP other Data Portal Data provider Provider Services Provider query Presentation Engine Query Engine Available providers Registry Institutions Providers Services ( UDDI ) User Resource Metadata Resource Metadata Index Name provider Provider Services Resource Metadata Resource Metadata and name query Metadata response Full data query Full data response Metadata and statistics Synonyms Publish availability Cache Metadata Accounting SOAP DiGIR HTTP other

15 Global Biodiversity Information Facility GBIF to establish mirror sites on three continents Oak Ridge (TN), USA - Berlin, Germany - Deijon, Korea Started Q3/2005 Q2/2006 Q1/2006

16 Global Biodiversity Information Facility Data qua- lity is a big issue

17 Global Biodiversity Information Facility

18

19

20 Design goals of the v2 data portal 1. Provision of infrastructure services to support the operation and use of the network, including: l Schema Repository – tools and resources to support use of biodiversity data standards (proposed, under design) l Service Registry – a directory of web-accessible biodiversity data resources (existing prototype) l Data Index – index of the data relating to each taxon accessible through the network (existing prototype) l Globally Unique Identifiers – tools and services to support persistent identifiers for biodiversity data elements (allowing data to be referenced and retrieved subsequently) (proposed) l Feedback Services – tools to allow users to provide comments and feedback to data providers (existing prototype) 2. HTML user interfaces to data held within the GBIF network. l Demonstration of the capabilities of the network and should also ensure that users can discover and access data of interest. Should also serve to stimulate the development of other access portals for specific communities. Does not support all kinds of uses of data! 3. Web services interfaces to data held within the GBIF network.

21 Global Biodiversity Information Facility Planned component architecture l See http://wiki.gbif.org/dadiwiki/http://wiki.gbif.org/dadiwiki/ l See next slide and its notes page l Data pipeline from bottom up l Notice the validation chain (steps of quality checking of data) before in enters the master index l Will be available gradually in 2006-2007

22 Global Biodiversity Information Facility GBIF National Portal GBIF Mirror Portal

23 Global Biodiversity Information Facility

24

25

26

27 Grimoires= UDDI+RDF

28 Global Biodiversity Information Facility


Download ppt "Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY Hannu Saarenmaa ECOINFORMATICS 2006 JRC, Ispra, 2006-01-18 WWW.GBIF.ORG."

Similar presentations


Ads by Google