GBIF Publishing Platform May 2011. Core publishing focus Primary Biodiversity Data (Specimens & Observations, Ecological Data) - Core data type is an.

Slides:



Advertisements
Similar presentations
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Towards Data Publishing Framework.
Advertisements

UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
© GEO Secretariat The Group on Earth Observations – Status and Post 2015 Osamu Ochiai GEO Secretariat 41 st CGMS Tsukuba, Japan 8-12 July 2013.
Publish or perish? Linking Scratchpads and the new Biodiversity Data Journal for streamlining publication of botanical data D.N Koureas 1, L. Penev 2 &
Entomological Collections Network Meeting, Indianapolis, IN 13 December 2009 Darwin Core Ratified in the Year of Darwin Gail E. Kampmeier Illinois Natural.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen ECAT Program Officer August G Informatics Infrastructure and Portal (IIP)
EU BON citizen science gateway Veljo Runnel University of Tartu Natural History Museum.
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen ECAT Program Officer September G A Darwin-Core Archive solution to publishing and.
BIS TDWG Conference 28 October 2013, Florence Documenting data quality in a global network: the challenge for GBIF Éamonn Ó Tuama, Andrea Hahn, Markus.
Developing Data Attribution and Citation Practices and Standards: An International Symposium and Workshop August , 2011 Hotel Shattuck Plaza Data.
SERNEC Image/Metadata Database Goals and Components Steve Baskauf
II Course on GBIF Node Management Arusha, Tanzania 31 st October and 1 st November 2008 Tim ROBERTSON Systems Architect GBIF Secretariat Data Publishing.
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Data Citation Mechanism and.
GLOBAL BIODIVERSITY INFORMATION FACILITY The Global Biodiversity Information Facility (GBIF ): The distributed architecture Samy Gaiji Head of Informatics.
DAEDALUS Project William J Nixon Service Development Susan Ashworth Advocacy.
11 th GBIF Global NODES Meeting Incentivising and Strategising Publishing of Biodiversity Data Vishwas Chavan Senior Programme Officer for Digitisation.
1 The IODE Ocean Data Portal - current status and future Nikolai Mikhailov, Chair of IODE/JCOMM ETDMP National Oceanographic Data Centre, Russia Four Session.
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
Mid-Term GBIF Committees Meetings eLearning Alberto González Talaván Global Biodiversity Information Facility (GBIF) May 2011.
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen ECAT Program Officer October DarwinCore Archives – Simplified Format for publishing.
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
GLOBAL BIODIVERSITY INFORMATION FACILITY Cataloging and using Taxonomic Data The Global Names Architecture David Remsen Senior Programme Officer, ECAT.
Standards and tools for publishing biodiversity data Yu-Huang Wang June 25, 2012.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
GLOBAL BIODIVERSITY INFORMATION FACILITY Éamonn Ó Tuama Senior Programme Officer, IDA 21 June Metadata publishing with the IPT.
1 GBIF and Ocean Biodiversity, OBI'07 Conference, Oct 2-4, 2007, Dartmouth, Nova Scotia GBIF and Ocean Biodiversity Building the data web with OBIS Éamonn.
Resolving the publishing bottleneck and increasing data interoperability in biodiversity science Lyubomir Penev, Teodor Georgiev, Pavel Stoev, David Roberts,
Linking Tasks, Data, and Architecture Doug Nebert AR-09-01A May 2010.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
Experts Workshop on the IPT, v. 2, Copenhagen, Denmark The Pathway to the Integrated Publishing Toolkit version 2 Tim Robertson Systems Architect Global.
GEOSS Common Infrastructure Internal Structure and Standards Steven F. Browdy (IEEE)
GBIF Mid Term Meetings 2011 Biodiversity Data Portals for GBIF Participants: The NPT Global Biodiversity Information Facility (GBIF) 3 rd May 2011.
Strengthening the Science-Policy Platform on Biodiversity and Ecosystem Services Africa Consultation on IPBES May 2010 Nairobi, Kenya Peter Gilruth,
Isabel Calabuig Lotte Endsleff 1 NODES regional MEETING Europe Digitarium,
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
Laura Russell Programmer VertNet Buenos Aires (Argentina) 28 September 2011 Training course on biodiversity data publishing and.
CBD CoP 11 Special Event National Biodiversity Information Outlook (NBIO) Vishwas Chavan 15 October 2012 Hyderabad.
Consultant Advance Research Team. Outline UNDERSTANDING M&E DATA NEEDS PEOPLE, PARTNERSHIP AND PLANNING 1.Organizational structures with HIV M&E functions.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
Incentives for Biodiversity Data Publishing June 2011.
Report of the Architecture and Data Committee (ADC) R.Shibasaki (ADC, Japan)
IABIN Executive Committee / Coordinating Institution Meeting GBIF and IABIN: status and opportunities in 2011 Juan Bello, Mélianie Raymond & Alberto González-Talaván.
Task XX-0X Task ID-01 GEO Work Plan Symposium April 2014 Task ID-01 “ Advancing GEOSS Data Sharing Principles” Experiences related to data sharing.
The New GBIF Data Portal Web Services and Tools Donald Hobern GBIF Deputy Director for Informatics October 2006.
GBIFS Seminar with the Science Committee and the Nodes Strategy Group Analysis of the content published by the GBIF network – Better understanding what’s.
Amazon Basin Biodiversity Information Facility – ABBIF.
IABIN Species and Specimens Thematic Network (SSTN) IABIN Executive Committee/Coordinating Institution Meeting. Tierras Enamoradas, Costa Rica. February.
GBIF - ECAT  Electronic Catalogue of Names of Known Organisms  Program Officer;  Per de Place Bjørn 
GLOBAL BIODIVERSITY INFORMATION FACILITY Vishwas Chavan Senior Programme Officer for DIGIT 10 th Meeting of the GBIF Participant Node Managers Committee.
Laura Russell VertNet Meherzad Romer NatureServe Canada John Wieczorek
GLOBAL BIODIVERSITY INFORMATION FACILITY Vishwas Chavan and Eric Gilman 10 th Meeting of the GBIF Participant Node Managers Committee 3 – 5 October 2009.
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen Senior Programme Officer, ECAT 3 Oct th Nodes Meeting.
ISWG / SIF / GEOSS OOSSIW - November, 2008 GEOSS “Interoperability” Steven F. Browdy (ISWG, SIF, SCC)
ISWG / SIF / GEOSS OOS - August, 2008 GEOSS Interoperability Steven F. Browdy (ISWG, SIF, SCC)
12 th Meeting of the GBIF Participant Nodes Committee 6-7 October 2013, Berlin, Germany Data mobilization and use for international policy Olaf Bánki Senior.
GBIF NODES Committee Meeting Copenhagen, Denmark 4 th October 2009 The GBIF Integrated Publishing Toolkit Alberto GONZÁLEZ-TALAVÁN Programme Officer for.
Metadata & Repositories Jackie Knowles RSP Support Officer.
Making Sense of the Alphabet Soup of Standards Practical Support for Managing Electronic Resources DDAKBARTTransfer Betty Landesman ER&L Conference February.
2016 WORK PROGRAMME PROGRESS UPDATE May Progress 21 sampling-event datasets.
Sample-based data publication; reflections on semantics and logic 1(1) Hanna - GBIF Finland Lepidoptera collection of Hannu SaarenmaaPublicNo (but DwC.
Paul Eglitis [IEEE] and Siri Jodha S. Khalsa [IEEE]
Making Sense of the Alphabet Soup of Standards
An Overview of Data-PASS Shared Catalog
Flanders Marine Institute (VLIZ)
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
Data publishing from the viewpoint of a biodiversity publisher
GLOBAL BIODIVERSITY INFORMATION FACILITY
GBIF Today and Tomorrow
Presentation transcript:

GBIF Publishing Platform May 2011

Core publishing focus Primary Biodiversity Data (Specimens & Observations, Ecological Data) - Core data type is an occurrence of a taxon Taxonomic Catalogues*, and Annotated Species Checklists. - Core data type is a taxon * To distinguish our efforts from COL – GBIF provides the means not the ends Enriched resource metadata – primarily focused on occurrence and taxon datasets.

Core publishing targets Sufficient coverage of fit-for-use primary biodiversity data to meet identified requirements. A Primary Biodiversity Data clearinghouse. A comprehensive “catalog of catalogues” that provide core taxonomic dictionaries and organisational framework for biodiversity data * To distinguish our efforts from COL – GBIF provides the means not the ends A comprehensive inventory of Primary Biodiversity data collections (digital and un- digitised) and nationally or thematically relevant species checklists.

Data Publishing Platform for who? Institutional publishers in developed countries. Large proportion of current publishers in this category. Smaller institutions with less technical capacity. Many in high-biodiverse regions Small (individual scientist) data holders ‘Disenfranchised” potential publishers who currently don’t recognise GBIF as a publishing option

Consolidate Strengthen Simplify Accelerate Extend Data publishing strategy How

Data publishing today (Often) Unsupported software on dedicated servers Misused protocols Multiple data formats Complex, rigid and difficult to maintain even for those with capacity Requires System administrators and programmers Re-indexing the CURRENT set of resources = 1 MONTH!! these are toothpicks TAPIR

Consolidated data standards Primary Biodiversity Data Taxonomic Data Metadata Darwin Core Ecological Metadata Language (EML) 172 Terms Ratified in 2009 Text files Extensible Rich dataset descriptions GBIF Profile

Darwin Core Archive Primary Biodiversity Data Taxonomic Data Metadata

Accelerate network performance DarwinCore archives – self-contained packages of data Published as a URL! Consolidated – one format for Primary and Taxonomic data From Months to Daily – for some users – faster Published data appears much faster! Simplified harvesting (Priority #3, Participants Report) Darwin Core Archives

Rebuilt following community consultation Provide a supported, evolving publishing tool Supports all three CORE data types (Primary/Taxon/Metadata) Establish a Steering Committee to guide product direction Seek Guidance from SC on directions too! A Platform for offering Community Services Integrated Publishing Toolkit 2.0 Strengthened publishing tool

Integrated Publishing Toolkit Metadata Authoring Primary Biodiversity Data Species Checklists Metadata Authoring Primary Biodiversity Data Species Checklists

Data Hosting Centers Coming soon… Endangered Wildlife Trust SABIF EIA Data Center INBIF EIA Data Center Coming soon… Endangered Wildlife Trust SABIF EIA Data Center INBIF EIA Data Center

Publish with spreadsheets Metadata Primary Biodiversity data Species Checklists Publishing via For biologists and database managers

No special software required

Suite of publishing options

Data Publishing documentation Full documentation for all aspects of data publishing Living documents

Improve Data Quality New roles for engaging Participants

Enable Data Quality Assessment & Improvement as part of the Network Today done in Copenhagen Tomorrow – through the network Improvements made BEFORE data published New and increased roles for participating in GBIF

Extend the standard Darwin Core Archives are extensible Simple Extensible Internationalised Standards-based

Global Names Architecture A Darwin Core-based profile using the GBIF network to share taxonomic information. Evaluation underway – 16 reviewers / 39 checklists

GBIF Schema Repository Darwin Core Terms List of Extensions Vocabularies An schema repository for developers and trainers

Ensure local uptake of technology

Make data publishing worthwhile Improved attribution through provenance improvements in Registry Improved relevance through extensibility DarwinCore Archives support multiple uses of data. Not all roads lead to Copenhagen For OrganisationsFor Individuals Data publishing = scholarly publishing Increased visibility due to simplified and consistent citation methods Data is easier to consume! For Both Make good data even better!

Persistent Identifiers Journal System Submission Acceptance Revision Peer Review Publication Registry GBRDS DOI Distributed Metadata Catalogues Metadata Authors auto conversion to manuscript GBIF Metadata Repository ZooKeys PhytoKeys BioRisks Data Paper: Recognising Data Discovery

Reward data publishing Data Paper Metadata document

Deep Data Citation Mechanism & Service Deep data citation mechanism Recognise ALL with their roles Multilayer citation – Publisher determined and User driven Cascading citation: Citations within citations Assign GUIDs for citation text Data Citation Service Register citations Resolve citation GUIDs Status: Working with CODATA Data Citation Task Group & DataCite

Anticipated Impact of Deep Data Citation Data Citation Data Discovery Data Publishing Data Preservation Data Use

Best Practice Guide on Data Publishing & Use Decleration Survey of existing ‘Data Sharing’ & ‘Data Use’ agreements (35 agreements) Issues identified: – Terminologies used: Provider v/s Publisher, Sharing v/s Publishing, Agreement v/s Declaration – Lack of adequate information: e.g. data citation, fitness-for-use, license, etc. Best Practice Guide on Publishing & Use Declaration (Q3 2011)

Catalogue of tools & services for discovery, digitisation & publishing Tools and services for : – data/metadata capture/collection – data digitisation – quality assessment – quality enhancement – discovery and publishing Community-driven Frequent updates Included in Welcome Box and Online Resource Centre

Questions: METADATA Mobilising more metadata In order to get more metadata catalogues connected, we need greater uptake by GBIF Participants and other organisations. How? act opportunistically – just accept what is offered? just target some of the key external networks (e.g., KNB)? undertake a focused campaign among Participants using the Participants Report feedback? is there a case for a second round of the incentivisation scheme for metadata catalogues but with a focus on “bigger catches” like national catalogues?

Questions: Checklists Mobilising more sources Suggest strategies should be employed for mobilising species lists? Example: Harmonising national and state-relevant species lists to ITIS in USA Priority lists – National Species Inventories Priority subject – National Clearinghouse Mechanisms

Questions: GEO BON / GEOSS How can the GBIF community get itself better represented in GEOSS / GEO BON? leave it to the GBIF Secretariat? just need to be kept informed, e.g., via community site and GBIF news items? active participation - have resources to contribute infrastructure at national level? participate in project consortia to take development forward?