Research Data Context Preservation in SCAPE

Slides:



Advertisements
Similar presentations
Grey Literature, Institutional Repositories and the Organisational Context Simon Lambert, Brian Matthews & Catherine Jones Business & Information Technology.
Advertisements

Curating Research: problems and policy Dale Peters Scientific Technical Manager DRIVER II.
Introduction to Planets Hans Hofman Nationaal Archief Netherlands Prague, 17 October 2008.
Supporting further and higher education Invitation to Tender A project to Support e-theses for UK Higher Education.
Publishing Data Catherine Jones Library Systems Development Manager, STFC Rutherford Appleton Laboratory CLADDIER workshop, Chilworth, Southampton, UK.
Towards an information model for I2S2
I2S2 - Infrastructure for Integration in Structural Sciences Cross-Institutional Pilot
I2S2 - Infrastructure for Integration in Structural Sciences Information Model Development Workshop RAL 11 th February 2010
ICAT + Information Model Brian Matthews Scientific Information Group E-Science Centre STFC Rutherford Appleton Laboratory
PaN-data WP7 - Integration Brian Matthews STFC-e-Science.
Superconducting Undulator Workshop Rutherford Appleton Laboratory 28 th & 29 th April 2014 Jim Clarke STFC Daresbury Laboratory.
Project Overview APA Conference 2012 ESA/ESRIN (Frascati), 6-7 November 2012 M. Albani (European Space Agency), Project Coordinator.
U.S. Department of Energy’s Office of Science Dr. Raymond Orbach February 25, 2003 Briefing for the Basic Energy Sciences Advisory Committee FY04 Budget.
Brian Matthews, CRIS 2002, 31/08/02 1 Accessing the Outputs of Scientific Projects Brian Matthews, Michael Wilson, Business & Information Technology Dept,
The Changing Face of Research Anthony Beitz DART Integration Manager.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
Persistent Digital Archives and Library System (PeDALS) South Carolina Department of Archives and History.
Publication of facility investigations Brian Matthews Scientific Information Group Scientific Computing Department STFC Rutherford Appleton Laboratory.
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
Building Scalable Web Archives Florent Carpentier, Leïla Medjkoune Internet Memory Foundation IIPC GA, Paris, May 2014.
Shaping Future Science Introducing the National Science and Innovation Campuses.
Integrated e-Infrastructure for Scientific Facilities Kerstin Kleese van Dam STFC- e-Science Centre Daresbury Laboratory
Caring and Sharing Collaboration in Digital Curation outside North America Ross Harvey Simmons College, Boston Curation Matters: 17 June 2010.
Chipir A new neutron single event test facility at the UK’s ISIS Neutron and Muon Source.
Context and Linking in the Research Lifecycle CERIF and other standards Catherine Jones Scientific Information Group Scientific Computing Department STFC.
1 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre The Data Portal Glen Drinkwater.
Alastair Duncan STFC Pre Coffee talk STFC July 2014 The Trials and Tribulations and ultimate success of parallelisation using Hadoop within the SCAPE project.
ESFRI & e-Infrastructure Collaborations, EGEE’09 Krzysztof Wrona September 21 st, 2009 European XFEL.
SCAPE Rainer Schmidt SCAPE Training Event September 16 th – 17 th, 2013 The British Library Building Scalable Environments Technologies and SCAPE Platform.
Metadata for structural science Workshop on research metadata in context Nijmegen, 7–8 September 2010 Simon Lambert STFC e-Science UK.
Bill Roberts, PresDB 07 Database Preservation: A success story and an unsolved problem Bill Roberts 23 March 2007 PresDB, Edinburgh.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
Data in Context Co-chairs: Brigitte Jörg, Keith Jeffery RDA 3rd Plenary, March, 26th - 28th, 2014 Dublin.
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
PLANETS, OPF & SCAPE A summary of the tools from these preservation projects, and where their development is heading.
ATTRACT is a proposal for an EU-funded R&D programme for sensor, imaging and related computing devlopment Its purpose is to demonstrate the value of European.
How to gain access to European synchrotron facilities Liisa Porra, University of Helsinki / HUS.
Experimental Context, Publishing and Research Objects Brian Matthews STFC.
Data Preservation at Rutherford Lab David Corney 9 th July 2010 KEK.
Introduction to Research Data Management Joy Davidson and Sarah Jones Digital Curation Centre
Exploitation of ISS Scientific data EGI-Aparsen Workshop March Science Park– Amsterdam – The Netherlands Cooperative ISS Research data Conservation.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Usecases: 1.ISIS Neutron Source 2.DP for HEP Matthew Viljoen STFC, UK APARSEN-EGI workshop: preserving big data for research Amsterdam Science Park 4-6.
Setting up long term digital preservation DLM Forum Member Meeting Luxembourg 14-15th October 2015.
Archiving CAD in Archaeology: Ingest to Dissemination (or The ADS experience to date) Kieron Niven Archaeology Data Service, University of York, UK.
What is ATTRACT? A proposal has been made to the European Commission (EC) for a dedicated EC-funded program to develop new (ionizing) radiation sensor.
Virtual Repository Progress Lars Lindberg Christensen (ESA/ESO)
Capabilities and Programmes of STFC’s Accelerator Science & Technology Centre (ASTeC)
Enhancements to Galaxy for delivering on NIH Commons
An Approach to Software Preservation
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Scientific Computing Department
Building A Repository for Digital Objects
An Introduction to Tessella and The Safety Deposit Box Platform
Joseph JaJa, Mike Smorul, and Sangchul Song
A Web-enabled Approach for generating data processors
NRF Knowledge Management Corporate
VI-SEEM Data Repository
VI-SEEM Data Repository
EGI – Organisation overview and outreach
NFFA Europe.
EOSCpilot All Hands Meeting 8 March 2018 Pisa
Metadata for digital long-term preservation
ESciDoc Introduction M. Dreyer.
Brian Matthews STFC EOSCpilot Brian Matthews STFC
GISELA & CHAIN Workshop Digital Cultural Heritage Network
ESS policy for scientific data
Building a CMMI Data Infrastructure
STFC case study: PhD research graph
Presentation transcript:

Research Data Context Preservation in SCAPE Catherine Jones, Science and Technology Facilities Council, UK (STFC) IPres 2013: Lisbon

SCAPE: Scalable Digital Preservation SCAPE is an EU funded project (2011 – 2014) Exploring preservation issues with large collections of material. Three testbeds implementing the tools and Taverna workflows utilising the Hadoop platform built elsewhere in the project: Web archives Large Scale Digital Repositories Research Data Website http://www.scape-project.eu/

driving scientific research STFC Facilities – driving scientific research Neutron Sources Providing powerful insights into key areas of energy, biomedical research, climate, environment and security High Power Lasers Providing applications on bioscience and nanotechnology and demonstrating laser driven fusion as a future source of sustainable, clean energy Light Sources Providing new breakthroughs in medicine, environmental and materials science, engineering, electronics and cultural heritage We are a science-driven organisation, making it possible for a broad range of scientists to do the highest quality research tackling some of the most fundamental scientific questions. We provide access to world-class facilities in the UK including Neutron and Muon Sources Lasers Computational Science and Engineering Atmospheric, Astronomy and Space Science Synchrotron light sources and free electron lasers Materials Analysis Neutron Sources Providing powerful insights into key areas of energy, biomedical research, climate, environment and security. ISIS - Pulsed Neutron and Muon Source Target Station 2 High Power Lasers Providing applications on bioscience and nanotechnology Central Laser Facility Demonstrating laser driven fusion as a future source of sustainable, clean energy HiPER Light Sources Providing new breakthroughs in medicine, environmental and materials science, engineering, electronics and cultural heritage Diamond Light Source Limited (86%) European Synchrotron Radiation Facility (ESRF), Grenoble

Facilities Data Lifecycle Proposal Approval Scheduling Experiment Data storage Record Publication Facilities Data Lifecycle Subsequent publication registered with facility Data analysis Scientist submits application for beamtime Tools for processing made available Raw data filtered, and stored Facility committee approves application Scientists visits, facility run’s experiment Facility registers, trains, and schedules scientist’s visit http://code.google.com/p/icatproject/

Background – Research Data What are the scalability issues? STFC research data is complex rather than vast Each ISIS instrument generates files with different semantics – there are 35 different instruments.  Linking experimental data, publications and analysed data Links may to be different places for each dataset and ensuring that these remain resolvable is an intellectual challenge even at a small scale. Generating these links is a preservation activity in itself.

Investigation as a Research Object Raw Data :hasDataset :investigator Investigation #n DOI:STFC.xxx Derived Data :hasRelatedDataset :instrument :hasPublication Publications :hasPublication Own metadata format (Core Scientific Metadata Model CSMD) OAI-ORE W3C Prov ontology

Proposed architecture for Investigation Research Objects at STFC Grey: infrastructure/tools already in use Blue: tools which depend on local infrastructure Green: proposed generic tools.

Mock up of ISIS data journal showing investigation research objects

IRO builder under construction RO validator next tool for development Timetable IRO builder under construction RO validator next tool for development Hope to be able to use SCAPE Watch tool SCOUT for parts of this functionality

For more information, contact Catherine.jones@stfc.ac.uk Thanks For more information, contact Catherine.jones@stfc.ac.uk This work is funded by the EU within the SCAPE project. Other STFC staff who contributed to this work are: Alastair Duncan Vasily Bunakov Antony Wilson Shirley Crompton Brian Matthews