We think you have liked this presentation. If you wish to download it, please recommend it to your friends in any social system. Share buttons are a little bit lower. Thank you!
Presentation is loading. Please wait.
Published byCecilia Newland
Modified about 1 year ago
THE DATA CITATION INDEX & DATACITE NIGEL ROBINSON 26 AUGUST 2014
©2010 Thomson Reuters OVERVIEW What is the Data Citation Index Collaboration with DataCite Requirements to participate
©2010 Thomson Reuters DATA CITATION INDEX Launched October M data records Enable the discovery of data repositories, data studies and data sets in the context of traditional literature Link data to research publications Help researchers find data sets and studies and track the full impact of their research output Provide expanded measurement of researcher and institutional research output and assessment Facilitate more accurate and comprehensive bibliometric analyses
©2010 Thomson Reuters DATA REPOSITORIES Over 1100 repositories identified
©2010 Thomson Reuters TYPES OF DATA BY DISCIPLINE ART & HUMANITIES CULTURAL HERITAGE LANGUAGE CORPUS IMAGE COLLECTIONS RECORDINGS SOCIAL SCIENCES POLL DATA ECONOMIC STATISTICS LONGITUDINAL DATA NATIONAL CENSUS PUBLIC OPINION SURVEYS SCIENCE & TECHNOLOGY MAPS ALGORITHMS GENOMICS SKY SURVEYS ASTROPHYSICS REMOTE SENSING MUSEUM SPECIMENS
©2010 Thomson Reuters METADATA PROCESSING Repository provides metadata feed Collaboration on metadata handling Normalisation and enhancement of metadata Controlled vocabularies Indexing Loading to DCI as data object records Citations from repository Citations from literature Metrics Citation counts
©2010 Thomson Reuters INDEXING A DATA REPOSITORY ON WEB OF SCIENCE Repository/Source: Comprises data studies, data sets and/or microcitations. Stores and provides access to the raw data. Data Study: Descriptions of studies or experiments with associated data which have been used in the data study. Includes serial or longitudinal studies over time. Data Set: A single or coherent set of data or a data file provided by the repository, as part of a collection, data study or experiment. Microcitation: (nanopublication) An assertion about concepts that have been found to be linked by scientific enquiry, and can be uniquely identified and attributed to its author. Made up of three separate parts: a subject, a predicate and an object. 7 Record Types Descriptive metadata feed from repository Repository raw metadata is analysed Metadata added Repository Data study Data set Micro- citation
©2010 Thomson Reuters Search Results within the Data Citation Index present the powerful Web of Science options for exploring a body of information. Data becomes discoverable alongside literature
Data deposition makes it possible to show related data from the repository
Because data are accessible and able to be cited, they can be linked to publications describing research which uses them
Link out directly to the original item, in this case a Data Study.
Start to build citation maps associated with data through the association of data and literature
Provide assistance in how to associate data and literature through citation
©2010 Thomson Reuters RESEARCHER PROBLEMS Access & discovery Citation standards Lack of willingness to deposit and cite Lack of recognition / credit Data sharing leads to more science & more knowledge
©2010 Thomson Reuters DEFINITIONS Data repository An online resource where data are deposited and stored for preservation and access Data Facts collected for reference or analysis. Non traditional scholarly output of scientific research often analysed in traditional research publications. May include numerical, textual, image, video or software information
©2010 Thomson Reuters As we evaluate repositories for inclusion, some of the things we consider are: Editorial Content - ensuring that material is desirable to the research community. Persistence and stability of the repository, with a steady flow of new information. Thoroughness and detail of descriptive information. Links from data to research literature. REPOSITORY SELECTION & EVALUATION
©2010 Thomson Reuters DATA REPOSITORIES Repository must hold “data” Repository must provide access to data Data deposit Material added/updated Provide statistics on deposited data Actively curate data in the archive Active Persistent IDs, DOIs or other permanent ID Contacts available for confirmation of interpretation Indication of intention to preserve data or provide access over the long term Contingency if repository was to cease to operate Make data accessible (or state licensing terms) Sustainable Funding information available for repository and deposited data Persistence Links to literature Citation in literature databases Data reuse
©2010 Thomson Reuters CHALLENGES Metadata –Resources –Expertise Citable data source Metadata quality –Unique & persistent identifiers –Consistency Data repositories are not static –How is version control handled? Partnerships
©2010 Thomson Reuters COLLABORATION BETWEEN DATACITE & THOMSON REUTERS Increasing visibility of DOI Synergies Support for data citation principles
©2010 Thomson Reuters Data Citation Index Repository 1 Repository 2 Repository 3 DATA CITATION INDEX PARTNERSHIPS DataCite Repository 1 Repository 2 Repository 3 Data Citation Index DataCite
©2010 Thomson Reuters REQUIRED METADATA –Unique ID in repository –Date provided –Author –Repository –URL/DOI –Title –Year Published Allows creation of a data citation using DataCite guidelines Compliance with DataCite Metadata schema v3 Allows matching of data citations encountered to known data records
©2010 Thomson Reuters PARTNERSHIP BENEFITS Access to DCI to review implementation Badge for website API to enable citation counts
©2010 Thomson Reuters DATACITE PARTNER REPOSITORIES 68 repositories eligible for evaluation, including: –Archaeology Data Service –Chemotion –Collaborative Research in Computational Neuroscience (CRCNS) –eyeMoviePedia –FLOSSmole –German Center for Gerontology –GigaDB –MatDB –Movebank Network for Earthquake Engineering Simulation (NEES) –Swedish National Data Service –UNAVCO –University of Southampton –World Data Centre For Climate –Zenodo
©2010 Thomson Reuters REASONS FOR NON SELECTION Not meeting selection criteria –Not “data” –No data type Poor quality or inconsistent metadata Defective DOIs More complete metadata from elsewhere –Crossover with other aggregation services Australian National Data Service –Repository
©2010 Thomson Reuters DATA CITATION TRACKING Infrastructure in placeFormal citationsData citation matching processCapture of informal citations
©2010 Thomson Reuters DATA CITATION Current citation style (in full text of article as informal citations) Desired/future citation style (as formally cited references) U.S. Dept. of Justice, Bureau of Justice Statistics (1996): MURDER CASES IN 33 LARGE URBAN COUNTIES IN THE UNITED STATES, Version 1. Inter-university Consortium for Political and Social Research. Lee, Seung-Jae; Lee, He-Jin; Cho, Ji-Hoon; Rho, Sangchul; Hwang, Daehee (2008): GSE11574: The responses of astrocytes stimulated by extracellular a- synuclein. Gene Expression Omnibus. SE11574
©2010 Thomson Reuters DATA CITATION Lee, Seung-Jae; Lee, He-Jin; Cho, Ji-Hoon; Rho, Sangchul; Hwang, Daehee (2008): GSE11574: The responses of astrocytes stimulated by extracellular a- synuclein. Gene Expression Omnibus. SE11574 Data Citation Index New data metrics Scientific literature Published data sets
©2010 Thomson Reuters DATA CITATION INDEX Discovery of data most important to scholarly research Data linked to published research literature Measures of data citation, use and reuse with attribution assisted by identifiers New metrics for digital scholarship
©2010 Thomson Reuters THANK YOU Nigel Robinson
©2010 Thomson Reuters ADDITIONAL SLIDES
©2010 Thomson Reuters DEPOSITION OF DATA BY RESEARCHERS 31
©2010 Thomson Reuters RESEARCHERS NOT RECEIVING CREDIT 32 Barriers to creating and sharing data: Researchers are hesitant to spend time and effort to create and share data because they don’t feel the work is adequately exposed or accredited Researchers find it difficult to expose data they have produced because data repositories do not have clear standards or mechanisms in place for doing so
Open repositories: case studies and check lists Iryna Kuchma, eIFL Open Access Program Manager, eIFL.net Presented at Using Open Access Models for Science.
COUNTER Update Peter Shepherd COUNTER May COUNTER - three new developments Release 4 of the Code of Practice Release 4 definitive version now published.
Brad Houston University Records Officer November 30, 2012.
Gaining Content experiences at the University of Southampton Pauline Simpson Head of Information Services Southampton Oceanography Centre OdinPubAfrica.
Preservation, access and re-use of Research Data The STM view on publishing datasets Presented at the DataCite Summer Meeting 2010 Hannover, 8 June 2010.
The Emerging Framework for Scholarly Communication Steve Hitchcock The Open Citation Project (OpCit), Southampton University These slides prepared for.
Cataloguing – a problem shared? CIG Conference Sept 13 th 2010 Sally Curry, RIN.
Speeding Science Solutions for Data Curation from Microsoft (Research) Lee Dirks Director, Education & Scholarly Communication External Research Division.
Cultural Heritage Institutions and Big Data Collections Leslie Johnston Chief of Repository Development Library of Congress.
Southampton Institutional Research Repository University Medical School Librarians Group (UMSLG) 7-8 July 2005 University of Edinburgh Pauline Simpson.
To share or not to share: how researchers handle data Michael Jubb RIN Fourth Bloomsbury Conference: Valued Resources 24 June 2010.
IZA Data Service Center DDI/SDMX Workshop Wiesbaden, Germany, June 18 th 2008 The Data Documentation Initiative (DDI) Arofan Gregory / Pascal Heus
Open Archives Forum 3rd Workshop, Berlin 28th March 2003 Overview – European activities of Open Archives Multimedia Projects Philip Hunter UKOLN
Advocacy and policy issues : advanced session Open Scholarship 2006 The University of Glasgow October Joanne Yeomans CERN Scientific Information.
An introduction to metadata for libraries, museums and archives Metadata in Digital Libraries, DELOS meeting, Riga, Latvia, 16 April 2003 Pete Johnston.
E-Science and Open Access Tony Hey Director of UK e-Science Core Programme
Libraries supporting e-Science --- … combining cultures … Pauline Simpson National Oceanography Centre University of Southampton, UK Digital Libraries.
Metadata for Digitization and Preservation. Introduction What is metadata and why it matters The key elements How metadata is created Where metadata is.
Usage Statistics in Context: related standards and tools Oliver Pesch Chief Strategist, E-Resources EBSCO Information Services Usage Statistics and Publishers:
Open Access Institutional Repositories (OA IRs): Leadership, Direction and Launch University of Southampton. 25th and 26th January, 2005
University of Warwick 4 th March Developing.
Supplemental Materials to a Journal Article Alexander (Sasha) Schwarzman American Geophysical Union Co-chair, NISO/NFAIS Working Group.
COUNTER: making statistics useful Peter Shepherd Director COUNTER January 2007.
The Role of the Librarian in an Open Access World Ellen Finnie Duranceau Scholarly Publishing & Licensing Consultant MIT Libraries BioMed Central Consultation.
Workshop ESS NET ON MICRO DATA LINKING AND DATA WAREHOUSING IN STATISTICAL PRODUCTION 22 & 23 SEPTEMBER 2011 “Mapping the GSBPM on a SDW architecture”
The emerging infrastructure of scholarly communication Steve Hitchcock The Open Citation Project (OpCit), Southampton University These slides prepared.
Repositories for research – contributing to the Knowledge Cycle Pauline Simpson National Oceanography Centre, Southampton JIBS Conference 15 Sep 2005 British.
A centre of expertise in data curation and preservation Funded by: This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike.
~ Integrate external services in DSpace submission process How to make self-deposit easy and improve metadata quality and presence of full-text.
© 2016 SlidePlayer.com Inc. All rights reserved.