The MRC Research Data Gateway

Slides:



Advertisements
Similar presentations
1 Statistics Norway Information Architecture – some challenges ODaF meeting, Colchester April 2008 Rune Gløersen Director Department for IT and.
Advertisements

April 2010 MRC Data Sharing Policy Peter Dukes Policy Lead – Data Sharing & Preservation.
New Services for Data Creators and Providers Louise Corti, Head ESDS Qualidata/ Outreach & Training Alasdair Crockett, ESDS Data Services Manager.
Accessing the MCS via the Economic and Social Data Service Jack Kneeshaw and Alasdair Crockett MCS workshop 20 November 2003 ESDS Longitudinal.
Meeting Disciplinary Challenges in Research Data Management Planning – March 23 rd 2012 Data Management Planning for Secure Services (DMP-SS) † Tito Castillo,
CESSDA Question Databank Tender, results and future Maarten Hoogerwerf, CESSDA expert seminar 2009.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Self-archiving at Southampton a case study University of Cambridge 10 January Wendy White Hartley Library University of.
MEDIN Standards M. Charlesworth and the MEDIN Standards Working Group.
Spatial Information Integration Services (SIIS) ISO/TC211 Workshop on Standards in Action Adelaide, South Australia October 2001 Mr. Neil Sandercock, SA.
December 2008 MRC Data Support Services (DSS) Chris Morris 13 th February 2009 Sharing Research Data: Pioneers, Policies and Protocols The seventh cat.
1 Uppsala University Library Eva Müller Peter Hansson Stefan Andersson Uwe Klosa Electronic Publishing Centre Krister Östlund Waller project.
1 The IIPC Web Curator Tool: Steve Knight The National Library of New Zealand Philip Beresford and Arun Persad The British Library An Open Source Solution.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Scottish Information Landscape An overview from SLIC Elaine Fulton Director Scottish Library and Information Council
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Educause October 29, 2001 A GEM of a Resource: The Gateway to Educational Materials Copyright Nancy Virgil Morgan, This work is the intellectual.
Grey Literature, E-Repositories and Evaluation of Academic & Research Institutes. The case study of BPI e-repository Maria V. Kitsiou - Head Librarian,
Social Science Data and ETDs: Issues and Challenges Joan Cheverie Georgetown University Myron Gutmann ICPSR – University of Michigan Austin McLean ProQuest.
Distributed Access to Data Resources: Metadata Experiences from the NESSTAR Project Simon Musgrave Data Archive, University of Essex.
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
Nationally Significant Databases and Collections Providers’ Group Emma Kelly Environmental Information Advisor Environmental Monitoring and Reporting Team.
December 2010iTEC - Designing the future classroom1 אלף כיתות משתתפות במיזם לעיצוב כיתת הלימוד העתידית – iTEC דב וינר iTEC – Designing the future classroom.
The United States Health Information Knowledgebase: Federal/State Initiatives An AHRQ Research Project J. Michael Fitzmaurice, PhD, AHRQ Robin Barnes,
Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009.
Metadata in a distributed information environment: Interoperability as recombinant potential Lorcan Dempsey OCLC/SCURL pre-IFLA conference, 15/16 Aug 02.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Using Joinup as a catalogue for interoperability solutions March 2014 PwC EU Services.
Experts Workshop on the IPT, v. 2, Copenhagen, Denmark The Pathway to the Integrated Publishing Toolkit version 2 Tim Robertson Systems Architect Global.
FEA DRM Management Strategy Presented by : Mary McCaffery, US EPA.
InSPIRe Australian initiatives for standardising statistical processes and metadata Simon Wall Australian Bureau of Statistics December
From small beginnings: Developing collection level description Mapping the Information Landscape Showcase day British Library Conference Centre, London,25.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Report of the Architecture and Data Committee (ADC) R.Shibasaki (ADC, Japan)
Metadata Training for SEFSC Science Staff Part Two.
PDS4 Demonstration Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
PRESENTATION OF THE TEST REGISTRY AND REPOSITORY (TRR) ON JOINUP 23 OCTOBER 2015 Roch Bertucat, ENGISIS.
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
International Planetary Data Alliance Registry Project Update September 16, 2011.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
Data Sources & Using VIVO Data Visualizing Science VIVO provides network analysis and visualization tools to maximize the benefits afforded by the data.
Beyond the Repository: Research Systems, REF & New Opportunities William J Nixon Digital Library Development Manager.
Metadata standards Using DDI to Inform, Organize, and Drive Survey Data Production.
1 The XMSF Profile Overlay to the FEDEP Dr. Katherine L. Morse, SAIC Mr. Robert Lutz, JHU APL
UK DP Needs Assessment Project overview 2 November 2005 Martin Waller.
Making FAAM Flights Discoverable
Patient and Public Involvement and Engagement in Research (PPIE)
Senior Data and Support Services Officer
Building A Repository for Digital Objects
eInfraCentral Portal User requirements and features
DataNet Collaboration
Digital library and OR 21 October 2002 Members’ Council
OER Commons Hubs A Primer
Standards for success in city IT and construction projects
Data catalogues and the data repository ADMIRe JISC MRD
Martin Tuchyňa Towards an INSPIREd e-reporting & INSPIRE priority datasets in Slovakia INSPIRE conference ,
How to Design and Implement Research Outputs Repositories
Interactive tools for large-scale social surveys
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
Research Data Management
HingX Project Overview
EDDI12 – Bergen, Norway Toni Sissala
Institutional Repositories
Discovery of EDMI compliant data resources and metadata catalogues
JISC and SOA A view Robert Sherratt.
OpenDOAR and ROAR RSP Services Day, Nottingham, 23rd Apr.2008
Questasy: Documenting and Disseminating Longitudinal Data Online with DDI 3 Edwin de Vet 5/21/2019.
The role of metadata in census data dissemination
Presentation transcript:

The MRC Research Data Gateway Phil Curran Medical Research Council Seconded part-time to work for MRC Data Support Service on the development of the Gateway. My other role is as Head of Data Services for the MRC Lifelong Health & Ageing Unit at UCL I am also a member of the Technical Committee for the CLOSER USP

Goals of the Data Support Service Project Project Overview: To provide researchers with a web-based data discovery “catalogue” of research datasets of potential value for new research – the MRC Gateway To provide a web-based library of standards and good practice materials for data management and curation To gather intelligence on data sharing to inform decision making and guidance for policy makers and researchers To gather quantitative data to help assess the benefits and costs of supporting data discovery services

MRC Gateway Population Health Sciences Metadata Repository Part of a larger project – “Data Support Service” Longitudinal and Cohort Studies Study level metadata on 34 studies Variable level metadata on 5 of the 34 studies Metadata on > 45K variables from the 5 case studies Programme to add “variable level” metadata on the other 29 studies This presentation is about experiences gained with the first 5 case studies The Gateway is not a finished system; it is still under active development As of now the Gateway metadata repository contains records on 34 studies with variable level metadata on 5 of the 34 We have already been through a phase of evaluating the core metadata model and eliciting user feedback on using the Gateway. The feedback has highlighted several areas where the user interface could be improved. We are also aiming to add variable level metadata on more of the 29 studies which currently have only study level information. By Spring 2015 our aim is to continue to add variable level metadata for more studies and to provide mechanisms for ingesting and exporting study metadata in a form conforming to the DDI-L international standard. This will ensure interoperability with the CLOSER USP and probably, all future systems that provide data discovery facilities for longitudinal or cohort studies.

First Five Case Studies Avon Longitudinal Study of Parents and Children (ALSPAC) National Survey of Health and Development (NSHD) Southampton Women’s Survey West of Scotland Twenty-07 Whitehall II

Gateway Platform Underlying database is ISO/IEC 11179 Metadata Registry Search Technologies used: Drupal Apache Solr Linux Design started in 2009 Prototype went live in 2010

Key Gateway Metadata Elements Study Data Collection Event Time Period Variable Subject Category Attachments (e.g. Questionnaires, Forms, etc.) (All these map onto DDI-L concepts)

Gateway Data Model

Gateway Security Model 3 levels of User Access Permissions Unregistered user – can search for and browse study level information. Registered user – can search for and browse study level and variable level information. Can create lists of “interesting” variables across all studies and export them as CSV files. Administrative user – can edit their study's metadata content in the directory.

Gateway User Interface

Gateway Search

Experience of Ingesting Metadata from the 5 Case Studies Each of the five case studies had its own data infrastructure. Consequently initial ingestion of metadata from studies was via bespoke scripts. This was time consuming and costly. Considerable effort required by study data managers to support metadata ingestion and maintenance. Result = initial metadata ingestion was not scalable. DDI3 arrived just in time! NSHD was first study to provide metadata in DDI-L form.

Plans for Gateway use of DDI-L The Gateway development plans include: A DDI-L import and export facility; The use of commercial and open source tools for harvesting metadata from studies in DDI-L form. Use of DDI-L will: Ensure the interoperability of the Gateway with other data discovery services; Protect the investment of the study teams and their funders in the construction of metadata; Enable federated structures of metadata maintenance and publishing; Increase the choice of systems and tools for data management teams.

Conclusions Metadata production/maintenance is the most resource intensive process in providing data discovery systems. Study data managers will increasingly need to publish metadata to a number of directory services. DDI-L has the potential to protect investment in metadata from the inevitable obsolescence of directory and other search platforms. Better tools are required to map metadata from relational databases to DDI-L. The complexity of DDI-L is a barrier to widespread adoption amongst the data management community. More data is needed on the resource requirements of metadata production/maintenance to inform funders.

Acknowledgements Medical Research Council Peter Dukes and Caroline Shriver Science and Technology Facilities Council Catherine Jones and Alastair Duncan The Gateway https://www.datagateway.mrc.ac.uk/