PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE.

Slides:



Advertisements
Similar presentations
Repositories, Learned Societies and Research Funders Stephen Pinfield University of Nottingham.
Advertisements

ESDS user support materials and resources: how to use them Support Services Royal Statistical Society, London 13 February 2009.
The Economic and Social Data Service (ESDS) Kevin Schürer ESDS/UKDA ESDS Awareness Day 5 December 2003.
Depositing Data for Archiving Libby Bishop ESDS Qualidata, University of Essex Changing Families, Changing Food Meeting University of Sheffield 15 March.
Data management, data sharing and the activities of the UKDA Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009.
Access to Economic and Social Data via the UK Data Archive Jack Kneeshaw UKDA.
ESDS - a new service Kevin Schürer, Director, ESDS/UKDA.
The Economic and Social Data Service (ESDS) Karen Dennison UK Data Archive Improving access to government datasets 18 January 2007.
KRDS BENEFITS FRAMEWORK, VALUE- CHAIN AND BENEFIT ANALYSIS TOOLS: UK DATA ARCHIVE CASE STUDY …………………………………………
Joint Information Systems Committee Digital Library Services BL/JISC Workshop Rachel Bruce JISC Programme Director The Digital Library and its Services,
UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
A DATACITE CASE STUDY FROM THE UK DATA ARCHIVE …………………………………………………………………………………………………… TOM ENSOM …………………….…………………………….… UK DATA SERVICE UK DATA ARCHIVE.
BEST PRACTICE FOR DATA SHARING ……………………………………………………
Versioning Requirements and Proposed Solutions CM Jones, JE Brace, PL Cave & DR Puplett OR nd April
Data citation from the perspective of a scholarly publisher Lyubomir Penev TDWG Data Citation Workshop, New Orleans, Oct 2011 ViBRANT.
IDENTIFIERS & THE DATA CITATION INDEX DISCOVERY, ACCESS, AND CITATION OF PUBLISHED RESEARCH DATA NIGEL ROBINSON 17 OCTOBER 2013.
Discove r Humanities and Social Science Electronic Thesaurus - HASSET Faceted search HASSET is the subject thesaurus that the UK Data Service uses to index.
Introducing Symposia : “ The digital repository that thinks like a librarian”
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
DATA LIFECYCLE & DATA MANAGEMENT PLANNING ……………………………………………………………………………………………………………………………….…………………………….. ……………………………………………………………......…... RESEARCH DATA.
Open Exeter Project Team
Managing sensitive data and authorship in Humanities and Social Sciences ODIN conference, Cologne October 2013 Louise Corti Collections Development and.
DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)
Copyright 2006 M.R.Thorley/NERC Mark Thorley, Natural Environment Research Council Research Outputs: Their Access & Preservation A perspective.
THE DATA CITATION INDEX AN INNOVATIVE SOLUTION TO EASE THE DISCOVERY, USE AND ATTRIBUTION OF RESEARCH DATA MEGAN FORCE 22 FEBRUARY 2014.
Presented by Ansie van der Westhuizen Unisa Institutional Repository: Sharing knowledge to advance research
Presented by DOI Create: TERN as a use-case Siddeswara Guru
Good practice in Research Data Management Module 6: Tools, training and support.
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
MANAGING YOUR RESEARCH DATA: PLANNING TO SHARE ……………………………………………………………………………………………………………………………….…………………………….. ……………………………………………………………......…... RESEARCH.
DATA MANAGEMENT SUPPORT FOR RESEARCHERS …………………………………………
The DSpace Course Module – An introduction to DSpace.
DAEDALUS Project William J Nixon Service Development Susan Ashworth Advocacy.
DataCite Canada Cyndie Found, CISTI Background : Who is CISTI, Definition of Data Research Data Management(RDM) – Benefits, Challenges Addressing.
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
VO Sandpit, November 2009 Environmental Data Archival: Practices and Benefits crib sheet Graham Parton With many thanks to Dr.
Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &
Opening access to UK doctoral theses: the EThOS E-Theses Service 13 August 2014 Sara Gould.
ESDS resources for managing and analysing data Beate Lichtwardt Economic and Social Data Service UK Data Archive Research Method Festival, Oxford 1 July.
ESDS - Support and resources Beate Lichtwardt, ESDS/UKDA British Library Conference Centre, London 9 March 2009.
ODIN – ORCID and DATACITE Interoperability Network Presentation to S&C Open House January 2013 John Kaye – British Library Funded by The European Union.
1 ARRO: Anglia Ruskin Research Online Making submissions: Benefits and Process.
May 2, 2013 An introduction to DSpace. Module 1 – An Introduction By the end of this module, you will … Understand what DSpace is, and what it can be.
NOAA Data Citation Procedural Directive 8 November 2012 DAARWG.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
It’s the data that makes a paper Joerg Heber Executive Editor Nature Communications.
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
|| Barbara Hirschmann1 Establishing a DOI service for Switzerland’s university and research sector.
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
Data Citation Implementation Pilot Workshop
Building Capacities for Establishment of Social Science Digital Data Archives Aleksandra Bradić-Martinović, Institute of Economic Sciences, Belgrade Achievements.
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
1 Digital Object Identifiers Update ESIP Data Stewardship Committee Meeting May 16, 2016 Presenters: Nate James, ESDIS Lalit Wanchoo, ADNET Systems Inc.
Ingest – Acquisition and deposit Irena Vipavc Brvar ADP SEEDS Workshop I Belgrade, October.
NRF Open Access Statement
Open Exeter Project Team
An Approach to Software Preservation
WHY? - Found initiative while case statement preparation
ACS 2016 Moving research forward with persistent identifiers
SowiDataNet - A User-Driven Repository for Data Sharing and Centralizing Research Data from the Social and Economic Sciences in Germany Monika Linne, 30.
KIOS Open Knowledge: A pillar for excellence
CNI Spring 2010 Membership Meeting
OpenML Workshop Eindhoven TU/e,
Introduction of KNS55 Platform
DATA LIFECYCLE & DATA MANAGEMENT PLANNING
Mission DataCite was founded in 2009 as an international organization which aims to: establish easier access to research data increase acceptance of research.
Research Data Management
Jisc Research Data Shared Service (RDSS)
Dataverse for citing and sharing research data
Presentation transcript:

PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE ECONOMIC AND SOCIAL DATA SERVICE UNIVERSITY OF ESSEX ………………………………..……………………. BL Datacite Workshop (No. 1): An Introduction to Data Citation and Datacite, 25 MAY 2012, London

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE WHY CITE DATA? It’s a vital part of the scientific research process Acknowledges the researcher’s sources Gives data creators, authors and data curators proper credit when their work is reused Aids scientific replication Provides permanent and reliable information on data sources produced and used in research Facilitates data resource discovery and access Helps track the use and impact of data collections

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE OUR APPROACH TO CITATION Required by our user agreement (End User Licence) for many years:

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE OUR APPROACH TO CITATION Should include enough information to ensure the exact version can be located University of Essex. Institute for Social and Economic Research and National Centre for Social Research, Understanding Society: Wave 1, [computer file]. 2nd Edition. Colchester, Essex: UK Data Archive [distributor], November SN: Different from an acknowledgement A general statement giving credit to the source and distributor Not an acceptable method of citing data

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE DEVELOPING OUR METHODOLOGY Our ‘data collections’ are not digital objects Need to capture changes made to data Versioning data in a commonly understood manner Rule-based but human mediated (in defining a ‘significant’ or ‘high impact’ change) Use structured data so machine-actionable Integrate processes with: Digital preservation activities Current infrastructure / work flows Desire to ‘get it right first time’

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE CHANGES TO DATA Approx. 15% UK Data Archive data collections are altered within first year after first publication Some data collections are issued as new editions: Changes to data/variables Adding new ‘waves’ in a data series Regrossing of a data series Changes to documentation

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE RECORDING SIGNIFICANT CHANGE We have distinguished between major and minor changes to a data collection High impact vs. low impact Largely social science users want most recent data for for research, but information about earlier versions of data must be available … and we should be making earlier versions available … coming soon

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE MINOR CHANGES – LOW IMPACT Publication reference added Correction of spelling in variable labels Small changes in variable labels Removal of (erroneously supplied) admin variables Correction of spelling in metadata Minor changes in documentation New index terms Additional documentation added (non-fundamental) Change in access conditions

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE MAJOR CHANGES – HIGH IMPACT New variable added New labels/value codes added Weighting variables reconstructed Wrong data supplied (e.g., March not April) Mis-coded data (e.g., Don’t know/Refused confused) Change in format (file migration) Significant changes in documentation Change in access conditions

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE DEFINING AN INSTANCE Concept of an instance to denote a changed collection Internal change during ingest process (unreleased)  new internal instance Low impact change (released)  new external instance with unchanged PI High impact change (released)  new external instance and new PI

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE The data publisher registers and obtains DOIs from a DataCite member, e.g. the British Library CREATING OUR DOIs We mint and update DOIs with our metadata management infrastructure We use DataCite' s application programming interface (API) to mint DOIs In 2011 we started working with the British Library and DataCite to develop a permanent, reliable method of citing our data collections

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE WHAT WAS OUR SOLUTION? Original solution DOI allocated to core metadata (title, etc.) relating to a data collection Problem: even titles can change Final solution DOI allocated to metadata relating to each external instance (metadata record) of a data collection DOIs resolve to “jump” page pointing to all external instances New DOI = High Impact change, with explicit logging

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE THIS IS WHAT THAT LOOKS LIKE DOI: SNnnnn/01 DOI: SNnnnn/03 DOI: SNnnnn/02 SNnnnn Survey Waves 1-13 SNnnnn Survey Waves 1-14 SNnnnn Survey Waves 1-15 Instance-specific data and metadata Instance-specific data and metadata Instance-specific data and metadata Jump page

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE CREATING AND UPDATING DOIs New catalogue record mint new DOI through DataCite update DOI change log create new citation file Update catalogue record enter high/low impact changes create/update DOI through DataCite update DOI change log create new citation file where high impact change

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE DOI FORMAT AND VERSIONING ………………………………………………………………………….………………………………………………………….…… Archive readable identifier Resource type identifier Resource identifier Resource version

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE OUR DOI METADATA

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE USING DOIs DOI links to an anchor on the jump page Citation file or the most recent DOI address Previous citations and data available on request only

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE CHALLENGES FOR THE FUTURE Citing parts (fragments) of data collections single files subsets of quantitative data files extracts of textual data Creating relationships between different objects research outputs (articles) and research inputs (data) research outputs (data) and research outputs (data) any output and researcher/institution/funding information

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE LINKING RESOURCES VIA METADATA Institutional Repository Research Council Repository Specialist Data Repository Discovery Portal Metadata Stores research output (article) cites data research output (data) cites article User finds article and relevant data regardless of location

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE RAISING AWARENESS IN THE SOCIAL SCIENCES ESRC funding for short-term project Aim to educate and inform best practice in citing research data Targeting audiences Professional organisations Academic publishers and journal editors Researchers and postgraduate students Key activities Brochure: Data citation principles for social sciences Open letter from ESRC Events with BL DataCite, JISC and PI community Outreach also through Doctoral Training Centres

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE ACKNOWLEDGEMENTS Matthew Woollard, ESDS/UKDA Susan Cozzalino, ESDS/UKDA John Shepherdson, ESDS/UKDA

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE CONTACT UK DATA ARCHIVE UNIVERSITY OF ESSEX WIVENHOE PARK COLCHESTER ESSEX CO4 3SQ ……………..…..……………………….. T +44 (0) E Economic and Social Data Service University of Essex Wivenhoe Park Colchester Essex CO4 3SQ ……………..…..……………………….. T +44 (0) E