11 th GBIF Global NODES Meeting Incentivising and Strategising Publishing of Biodiversity Data Vishwas Chavan Senior Programme Officer for Digitisation and Mobilisation of Primary Biodiversity Data Global Biodiversity Information Facility (GBIF) 1 October 2011
Impediments to data publishing Lack of Incentives Absence of data publishing strategies Absence of data publishing strategies 3 incentives 3 strategies
Data Paper Data Citation Data Usage Index 3 incentives for data publishing GBIF Data Publishing Framework Task Group
Incentive 1: Data Paper What it is: Scholarly publication of searchable metadata document describing a dataset, or a group of datasets Provide scholarly credit to data publishers through citable journal publications Describe the data in a structured human-readable form Promote and publicize existence of data
Source: Chavan and Penev (in press). Data Paper: A mechanism to incentivise data publishing in biodiversity science. BMC Bioinformatics (special supplement), in press Incentive 1: Data Paper Workflow Integrated Publishing Toolkit (IPT) v facilitate authoring of metadata and auto-generation of Data Paper manuscript
Incentive 2: Data Citation “Data citation standards can form the basis for increased incentives, recognition, and rewards for scientific data activities. Unfortunately, such standards and good practices are lacking” CODATA Data Citation Task Group “Data citation standards can form the basis for increased incentives, recognition, and rewards for scientific data activities. Unfortunately, such standards and good practices are lacking” CODATA Data Citation Task Group Please cite this data as follows: (accessed through GBIF data portal, Mammal specimens, (accessed through GBIF data portal, Vertebrate specimens, (accessed through GBIF data portal, Natural History Museum Rotterdam, (accessed through GBIF data portal, Database Schema for UC Davis Wildlife museum, Please cite this data as follows: (accessed through GBIF data portal, Mammal specimens, (accessed through GBIF data portal, Vertebrate specimens, (accessed through GBIF data portal, Natural History Museum Rotterdam, (accessed through GBIF data portal, Database Schema for UC Davis Wildlife museum,
Incentive 2: Data Citation for publishers We need metrics! for datasets for contributors for role of contributors for release & updates for data volume for access location
Smithsonian National Museum of Natural History (2002 -), Museum Collection Records: Mammals records. Contributed by Helgen KM (Principal Investigator, curator, author), Gordon LK (manager, author, curator), Peurach SC (author, manager), Potter CW (manager, author), Carleton MD (curator), Maldonado JE (author, developer), Wilson DE (curator, author), Thorington Jr RW (curator, author, validator), Ludwig CA (manager, developer, author), Lunde DP (author). Published online, first released on 12/02/2002, last updated on 15/09/2010, doi: /smi Smithsonian National Museum of Natural History (2002 -), Museum Collection Records: Mammals records. Contributed by Helgen KM (Principal Investigator, curator, author), Gordon LK (manager, author, curator), Peurach SC (author, manager), Potter CW (manager, author), Carleton MD (curator), Maldonado JE (author, developer), Wilson DE (curator, author), Thorington Jr RW (curator, author, validator), Ludwig CA (manager, developer, author), Lunde DP (author). Published online, first released on 12/02/2002, last updated on 15/09/2010, doi: /smi Please cite this data as follows: (accessed through GBIF data portal, Mammal specimens, (accessed through GBIF data portal, Vertebrate specimens, (accessed through GBIF data portal, Natural History Museum Rotterdam, (accessed through GBIF data portal, Database Schema for UC Davis Wildlife museum, Please cite this data as follows: (accessed through GBIF data portal, Mammal specimens, (accessed through GBIF data portal, Vertebrate specimens, (accessed through GBIF data portal, Natural History Museum Rotterdam, (accessed through GBIF data portal, Database Schema for UC Davis Wildlife museum, Today Incentive 2: Data Citation
Incentive 3: Data Usage “We believe that the lack of incentive similar to the Impact Factor for scholarly publication remains a major impediment to the provision of free and open access to biodiversity data” GBIF Data Publishing Framework Task Group “We believe that the lack of incentive similar to the Impact Factor for scholarly publication remains a major impediment to the provision of free and open access to biodiversity data” GBIF Data Publishing Framework Task Group
Incentive 3: Data Usage Data Usage Index (DUI) What it is: The Data Usage Index is a measure of the impact of data publishing by being accessed and used by the stakeholder communities Source: Chavan and Ingwersen (2009) Towards a data publishing framework for primary biodiversity data: challenges and potentials for the biodiversity informatics community. BMC Bioinformatics, 10 (Sppl 14): S2
Incentive 3: Data Usage Source: Ingwersen and Chavan (in press). Indicators for Data Usage Index (DUI): An incentive for publishing primary biodiversity data through the global information infrastructure. BMC Bioinformatics (special supplement). DUI is computed on the basis of 14 biodiversity data usage indicators publishers country datasets thematic data DUI for..
3 strategies for data publishing Strategies and action plans Broadening data types & publisher communities Strengthening Infrastructure
Strategy 1: Action Plans It is essential for countries to have a biodiversity data discovery and mobilization strategy in alignment with their overall national biodiversity strategy & action plan (NBSAP) “However, we currently lack best practice guidelines on how to develop demand- driven strategies and action plans” Berents et.al., 2010 “However, we currently lack best practice guidelines on how to develop demand- driven strategies and action plans” Berents et.al., 2010 Source: Berents, Hamer and Chavan (2011). Towards demand-driven publishing: Approaches to the prioritisation of digitization of natural history collections data. Biodiversity Informatics, 7(2):
Strategy 2: Action Plans Content Needs Assessment December 2011 Data Gap Analysis December 2011 Comprehensive dialog with Participants to develop data publishing strategy & action plan Interactions with BIG institutions to ensure uptake of GSAP-NHC recommendations
Strategy 2: Expanding Audubon Core Multimedia Resources Metadata Schema Audubon Core Multimedia Resources Metadata Schema Norway – India IPBES Capacity Building Pilot Project Norway – India IPBES Capacity Building Pilot Project Camera trap data developed applied in
Strategy 2: Expanding EIA practitioners Local Governments Freshwater biodiversity Freshwater biodiversity Invasive Alien Species Invasive Alien Species
Strategy 3: Infrastructure Data Hosting Infrastructure: rescuing data at risk Data Hosting Infrastructure: rescuing data at risk GBIF Best Practice Guide on establishing and operationalising data hosting centre 2012
In summary 3 Incentives Strategy & Action Plans Broadening data types & publisher communities Strengthening infrastructure Strategy & Action Plans Broadening data types & publisher communities Strengthening infrastructure Data Paper Data Citation Data Usage Index Data Paper Data Citation Data Usage Index 3 Strategies
How to mainstream Data Paper? How do we strategise data publishing?