Presentation is loading. Please wait.

Presentation is loading. Please wait.

GLOBAL BIODIVERSITY INFORMATION FACILITY The Global Biodiversity Information Facility (GBIF ): The distributed architecture Samy Gaiji Head of Informatics.

Similar presentations


Presentation on theme: "GLOBAL BIODIVERSITY INFORMATION FACILITY The Global Biodiversity Information Facility (GBIF ): The distributed architecture Samy Gaiji Head of Informatics."— Presentation transcript:

1 GLOBAL BIODIVERSITY INFORMATION FACILITY The Global Biodiversity Information Facility (GBIF ): The distributed architecture Samy Gaiji Head of Informatics GBIF Biodiversity Information Standards (TDWG) 2009 Conference 9-13 November 2009 WWW.GBIF.ORG

2 Objectives of this presentation  Expose the challenges faced by GBIF in building a global information network;  Present GBIF distributed architecture strategy;  Introduce the key building components of the GBIF Informatics suite;  Call for participation to the community.

3 A growing global network… 53 country participants 43 associated participants 53 country participants 43 associated participants

4 A growing network… 189,4 million records 5% increase/month 8186 data resources 306 data publishers 189,4 million records 5% increase/month 8186 data resources 306 data publishers Million of primary biodiversity records Data publishers

5 Architecture Publishing Indexing Discovering <1% IPT 3% TAPIR 16% BioCASE 80% DiGIR <1% IPT 3% TAPIR 16% BioCASE 80% DiGIR 80% DwC 18% ABCD 2% others 80% DwC 18% ABCD 2% others 189 M records 8-9 M/month >300 publishers 189 M records 8-9 M/month >300 publishers

6 A one-stop entry point to data discovery http:/data.gbif.org

7 What are the challenges today? More data types Richer user interface Better management Richer content Better synchronisation Improved discovery Decentralisation is therefore aimed at empowering GBIF Nodes and Participants

8 What are the key processes? Node Data Publishers Discovering Harvesting Indexing Registry Registering Service Publishers Access

9 What are the key components? Publishing toolkitHarvesting toolkit Portal toolkitRegistry Registration & Discovery Data flow The GBIF Informatics Suite for Participants

10 Publishing Component Data Publishers  Provide a robust and user-friendly publishing tool (TAPIR compliant, WFS-WMS, EML etc.),  Improve the existing standards (DwC, DwC Archive) and enable the provision of richer content through extensions for specialised communities,  Support the publishing of more datatypes such as Metadata, Names, etc… The Integrated Publishing Toolkit (IPT)

11 Harvesting/Indexing component  Provide a tool that will: harvest distributed data publishers using multiple protocols and schemas, harvest multiple datatypes (Primary Biodiversity Data, Metadata, Names), Synchronise with the GBIF Registry (part of the GBRDS), index into a central database. Harvesting Indexing The Harvesting and Indexing Toolkit (HIT)

12 Registry component  Provide a mechanism that will: provide a registry of organisation and resources (collection), provide a registry of schema and extensions, provide a registry of services and tools.  A compass for all the information networks. Registry The Global Biodiversity Resources Discovery System (GBRDS)

13 Portal component  Provide a platform that will publish: Primary Biodiversity Data, Names, Metadata.  Design it as a flexible and customisable platform to meet the needs of a variety of community and needs. Node Access The Nodes Portal Toolkit

14 Where are we today?  Harvesting Indexing Toolkit (HIT)  Global Biodiversity Resources Discovery System (GBRDS) Development/Testing phase  Integrated Publishing Toolkit (IPT) Production phase Planning phase  Node Portal Toolkit (NPT)

15 Some successful examples… The DarwinCore Germplasm Extension Broadening standards

16 Some successful examples… The DarwinCore Germplasm Extension Broadening standards DarwinCore Sample acquisition Collecting event Breeding event ‘IPR’ Trait experiment Trait measurement

17 Some successful examples… The DarwinCore Germplasm Extension Publishing richer content.

18 Towards decentralisation Global Register of Migratory Species World Database on Protected Areas More data types, Increased content, Better data quality, More participants. More data types, Increased content, Better data quality, More participants. Better discovery, Improved integration. Better discovery, Improved integration. Species richness changes…

19 A complex challenge…

20 A call for participation to the community 1.Improving standards (within and across domains); 2.Evaluate/Contribute to the GBIF Informatics Suite; 3.Develop specific use cases (assessing threats to biodiversity, monitor impacts of invasive species, agro- biodiversity…); 4. Actively engage in the decentralisation of the GBIF architecture to meet YOUR needs; 5.Address challenges in data quality and completeness; 6.Constantly monitor data usage and review/prioritise the Informatics developments.

21 Ask the GBIF Team ! Nick King GBIF Executive Secretary Samy Gaiji Head of Informatics David Remsen Senior Programme Officer for ECAT Vishwas Chavan Senior Programme Officer for DIGIT Éamonn Ó Tuama Senior Programme Officer for IDA Andrea Hahn Data Portal Manager José Miguel Cuadra Morales Programmer Kyle Braak Programmer Markus Döring Senior Programmer

22 Challenges: broadening data types!


Download ppt "GLOBAL BIODIVERSITY INFORMATION FACILITY The Global Biodiversity Information Facility (GBIF ): The distributed architecture Samy Gaiji Head of Informatics."

Similar presentations


Ads by Google