A DATACITE CASE STUDY FROM THE UK DATA ARCHIVE …………………………………………………………………………………………………… TOM ENSOM …………………….…………………………….… UK DATA SERVICE UK DATA ARCHIVE.

Slides:



Advertisements
Similar presentations
Reconciling the sharing of research data with ethical review for research with people as participants Dr Veerle Van den Eynden UK Data Archive Data support.
Advertisements

UK DATA ARCHIVE Louise Corti, ODAF April UK Data Archive an internationally-renowned centre of expertise in data acquisition, preservation, dissemination.
Building Repositories of eprints in UK Research Universities Bill Hubbard SHERPA Project Manager University of Nottingham.
ESDS user support materials and resources: how to use them Support Services Royal Statistical Society, London 13 February 2009.
Reconciling the sharing of research data with ethical review for research with people as participants Veerle Van den Eynden UK Data Archive Data Support.
13 February 2009ESDS – whats in it for librarians? Royal Statistical Society The strange case of the local data librarian - a peculiarly Edinburgh perspective!
The Economic and Social Data Service (ESDS) Kevin Schürer ESDS/UKDA ESDS Awareness Day 5 December 2003.
Accessing the MCS via the Economic and Social Data Service Jack Kneeshaw MCS workshop 10 November 2004 ESDS Longitudinal.
Depositing Data for Archiving Libby Bishop ESDS Qualidata, University of Essex Changing Families, Changing Food Meeting University of Sheffield 15 March.
Data management, data sharing and the activities of the UKDA Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009.
The Economic and Social Data Service (ESDS) Karen Dennison, Support Services Manager, UK Data Archive April 2008.
Access to Economic and Social Data via the UK Data Archive Jack Kneeshaw UKDA.
ESDS - a new service Kevin Schürer, Director, ESDS/UKDA.
Accessing the MCS via the Economic and Social Data Service Jack Kneeshaw MCS workshop 23 June 2005 ESDS Longitudinal.
Accessing the NCDS and BCS70 via the Economic and Social Data Service Jack Kneeshaw NCDS/BCS70 workshop 27 October 2004 ESDS Longitudinal.
Accessing the NCDS and the BCS70 via the Economic and Social Data Service Jack Kneeshaw NCDS/BCS70 workshop 21 February 2007 ESDS Longitudinal.
New Services for Data Creators and Providers Louise Corti, Head ESDS Qualidata/ Outreach & Training Alasdair Crockett, ESDS Data Services Manager.
Economic and Social Data Service June What is the ESDS? national service supporting the archiving, dissemination and use of social and economic.
Accessing the UK Longitudinal Studies via the ESDS Jack Kneeshaw UK Data Archive/Economic and Social Data Service 21 June 2004 ESDS Longitudinal.
The Economic and Social Data Service (ESDS) Karen Dennison UK Data Archive Improving access to government datasets 18 January 2007.
Accessing the MCS via the Economic and Social Data Service Jack Kneeshaw and Alasdair Crockett MCS workshop 20 November 2003 ESDS Longitudinal.
Learning and Teaching with Real Data. Today Organised by Economic and Social Data Service (ESDS) –ESDS Government –ESDS Longitudinal –ESDS International.
ESDS Resources Vanessa Higgins ESDS Government Centre for Census and Survey Research University of Manchester.
KRDS BENEFITS FRAMEWORK, VALUE- CHAIN AND BENEFIT ANALYSIS TOOLS: UK DATA ARCHIVE CASE STUDY …………………………………………
Joint Information Systems Committee Digital Library Services BL/JISC Workshop Rachel Bruce JISC Programme Director The Digital Library and its Services,
A centre of expertise in digital information management UKOLN is supported by: UK Perspectives on the Curation and Preservation of Scientific.
A centre of expertise in digital information management UKOLN is supported by: Research Data & Institutions Roles & Responsibilities? Dr.
UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
Collection-level description & the Information Landscape: users evaluate strategies for resource discovery Collection Description Focus Workshop 5 Cambridge,
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
MANAGING YOUR DATA WELL …………………………………………
A deepening of training needs in digital curation Claudia Engelhardt Framing the digital curation curriculum Florence, 6-7 May 2013.
Dr. Markus Quandt GESIS – Leibniz-Institute for the Social Sciences Workshop: Persistent Identifiers for the Social Sciences University Club, Bonn, February.
Update on Data Publishing With Dataverse
Connecticut State Data Center at the Map and Geographic Information Center - MAGIC Connecticut State Data Center Data Collaborator for Planning, Analysis,
Working in collaboration with data centres Elizabeth Newbold, The British Library Presented at: DataCite Annual Conference Nancy France August 25, 2014.
BEST PRACTICE FOR DATA SHARING ……………………………………………………
Versioning Requirements and Proposed Solutions CM Jones, JE Brace, PL Cave & DR Puplett OR nd April
Discove r Humanities and Social Science Electronic Thesaurus - HASSET Faceted search HASSET is the subject thesaurus that the UK Data Service uses to index.
DATA LIFECYCLE & DATA MANAGEMENT PLANNING ……………………………………………………………………………………………………………………………….…………………………….. ……………………………………………………………......…... RESEARCH DATA.
Managing sensitive data and authorship in Humanities and Social Sciences ODIN conference, Cologne October 2013 Louise Corti Collections Development and.
DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
DATA MANAGEMENT SUPPORT FOR RESEARCHERS …………………………………………
DOI Registration for Social and Economic Data da|ra Brigitte Hausstein GESIS Leibniz-Institute for the Social Sciences, Berlin.
The DSpace Course Module – An introduction to DSpace.
DataCite Canada Cyndie Found, CISTI Background : Who is CISTI, Definition of Data Research Data Management(RDM) – Benefits, Challenges Addressing.
Data Citation: the next big thing… ?!?! 1 Victoria University 20 Nov
APARSEN WP22 Identifiers and Citability APARSEN WP22 Identifiers and Citability Some key results Fondazione Rinascimento Digitale Emanuele Bellini, Chiara.
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
ESDS resources for managing and analysing data Beate Lichtwardt Economic and Social Data Service UK Data Archive Research Method Festival, Oxford 1 July.
VIVO and Scholarly Repositories: Synergistic Opportunities.
NOAA Data Citation Procedural Directive 8 November 2012 DAARWG.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
Options for customising DMPonline Sarah Jones Digital Curation Centre, Glasgow DMPonline workshop, 9-10 November.
Data Citation Implementation Pilot Workshop
Building Capacities for Establishment of Social Science Digital Data Archives Aleksandra Bradić-Martinović, Institute of Economic Sciences, Belgrade Achievements.
PERSISTENT IDENTIFIERS FOR THE UK: SOCIAL AND ECONOMIC DATA …………………………………………………………………………………………………… LOUISE CORTI …………………….…………………………….… UK DATA ARCHIVE.
1 Digital Object Identifiers Update ESIP Data Stewardship Committee Meeting May 16, 2016 Presenters: Nate James, ESDIS Lalit Wanchoo, ADNET Systems Inc.
ODIN – ORCID and DATACITE Interoperability Network ODIN: Connecting research and researchers Sergio Ruiz - DataCite Funded by The European Union Seventh.
ACS 2016 Moving research forward with persistent identifiers
Linking persistent identifiers at the British Library
CNI Spring 2010 Membership Meeting
Experiences of the Digital Repository of Ireland
ESDS resources for managing and analysing data
OpenML Workshop Eindhoven TU/e,
The Bodleian Libraries
Mission DataCite was founded in 2009 as an international organization which aims to: establish easier access to research data increase acceptance of research.
Dataverse for citing and sharing research data
Presentation transcript:

A DATACITE CASE STUDY FROM THE UK DATA ARCHIVE …………………………………………………………………………………………………… TOM ENSOM …………………….…………………………….… UK DATA SERVICE UK DATA ARCHIVE UNIVERSITY OF ESSEX ………………………………..……………………. C4D WORKSHOP, JULY 2013, LONDON

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE WHO WE ARE Established in years of selecting, curating, preserving and providing access to social science data 6,000 datasets in the collection Over 25,000 registered users Data and data support services for higher and further education for research, teaching and learning Have been registered to ISO (information security standard) since June 2010

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE OUR SERVICES UK Data Archive itself a department of the University of Essex Distributed service established 1 January 2003 called the Economic and Social Data Service (ESDS) New five-year UK Data Service from 2012

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE WHAT WE DO Research & development, innovation Promoting best practice in data curation Raise standards in data security and awareness of ethical/legal issues Raise standards in data management Data management hub We provide guidance to ESRC researchers and anyone else who asks

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE WE SUPPORT RESEARCHERS Popular training materials Managing and Sharing Guide Training Resources Website: Bespoke training events Large and small scale workshops

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE ENGAGEMENT WITH RDM COMMUNITY Recently completed JISC Managing Research Data project with University of Essex Cross support service, departmental engagement Piloted an RDM infrastructure manage/projects/rd-essexhttp:// manage/projects/rd-essex Outputs of value to RDM community: Metadata profile for institutional data repositories Research data plugin for EPrints

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE WHY CITE DATA? It’s a vital part of a rigorous research process: Acknowledges researcher’s sources Gives data creators, authors and data curators proper credit when their work is reused Facilitates data resource discovery and access Helps track the use and impact of data collections

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE OUR APPROACH TO CITATION Required by our user agreement (End User Licence) for many years:

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE OUR APPROACH TO CITATION Should include enough information to ensure the exact version can be located “University of Essex. Institute for Social and Economic Research and National Centre for Social Research, Understanding Society: Wave 1, [computer file]. 2nd Edition. Colchester, Essex: UK Data Archive [distributor], November SN: 6614.” No widely agreed standard citation format yet! Version information crucial

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE PERSISTENT IDENTIFERS Persistent Identifiers (PIDs) A string identifying a clearly defined digital object Persistence must mean enduring Identifiers must be unique PIDs have been attached to scientific publications for some time Next logical step: data Also being applied to other entities e.g. people via ORCID system

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE CHANGES TO DATA Our ‘data collections’ are not discrete digital objects Approx. 15% UKDA data collections are altered within first year after publication Versioning - we need to distinguish between major and minor changes to a data collection Integrate processes with: Digital preservation activities Current ingest infrastructure / workflows

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE MINOR CHANGES – LOW IMPACT Publication reference added Correction of spelling in variable labels Small changes in variable labels Removal of (erroneously supplied) admin variables Correction of spelling in metadata Minor changes in documentation New index (keyword) terms Additional documentation added (non-fundamental) Change in access conditions

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE MAJOR CHANGES – HIGH IMPACT Adding new ‘waves’ in a data series New variable added New labels/value codes added Weighting variables reconstructed Wrong data supplied (e.g., March not April) Mis-coded data (e.g., Don’t know/Refused mix-up) Change in format (file migration) Significant changes in documentation Change in access conditions

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE DATACITE DOIs 2011: we started working with the British Library and DataCite to develop a permanent, reliable method of citing our data collections DataCite Founded by organisations from six countries Established a citation format for research data, including a DOI Works with data publishers, e.g. established data centres and institutional repositories

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE WHY DATACITE? Not the only choice, but right for us: DOI framework an international and persistent standard for identifying digital objects Familiar within the research data domain Centralised resolution service Metadata registry (and thus de facto standard) Discovery link up API – allowing for automation of minting process (but also manual if you prefer!)

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE DOI FORMAT Readable archive identifier Resource identifier type Resource identifier Resource version / UKDA – SN – 1 – 1 Unique archive identifier

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE DOI VERSIONING …………………….……………………………………………………… High impact change /UKDA-SN /UKDA-SN-1-2 Low impact change /UKDA-SN-1-1 Increments major version – new DOI Increments minor version - internal

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE New data collection ‘ingested’ Structured DOI ‘created’ New change log New citation file CREATING A NEW DOI DataCite API sends back an approval Flagged behind the scenes Minimal DataCite metadata inc. requested DOI pushed to DataCite metadata store via API

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE Minimal DataCite metadata inc. requested DOI pushed to DataCite metadata store via API DataCite API sends back an approval Flagged behind the scenes High impact change to data collection Incremental DOI version ‘created’ Update change log New citation file UPDATING A DOI – HIGH IMPACT

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE Minimal DataCite metadata pushed to DataCite metadata store via API Low impact change to data collection Update change log UPDATING A DOI – LOW IMPACT DataCite API sends back an approval Flagged behind the scenes

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE THE END RESULT… DOI: SN-####-1 DOI: SN-####-3 DOI: SN-####-2 SN#### Survey Waves 1-13 SN#### Survey Waves 1-14 SN#### Survey Waves 1-15 Instance-specific data and metadata Instance-specific data and metadata (current) Instance-specific data and metadata Jump page (= change log)

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE OUR DOI METADATA

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE CHALLENGES FOR THE FUTURE Citing parts (fragments) of data collections single files subsets of quantitative data files extracts of textual data Still uncertainty over where exactly research data should go – IR, Subject Specific Repository, Data Journal? Who should be minting DOIs? Avoid assigning multiple identifiers to an object

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE ESRC’s CITATION AWARENESS GUIDE

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE ACKNOWLEDGEMENTS Thanks to the following UKDA/UKDS staff for their assistance in putting this together: Matthew Woollard Louise Corti John Payne Matthew Brumpton Sharon Bolton

……………………………………………………………………………………………………………………………….…………………………… ………………………………………………………………………………………………………………………………………………………………… UK DATA ARCHIVE CONTACT TOM ENSOM UK DATA ARCHIVE UNIVERSITY OF ESSEX WIVENHOE PARK COLCHESTER ESSEX CO4 3SQ ……………..…..……………………….. T +44 (0) E