UKOLN is supported by: Adding value to open access research data: the eBank UK Project. Dr Liz Lyon, Director UKOLN, University of Bath, UK OAI4, CERN.

Slides:



Advertisements
Similar presentations
IATUL Porto, May 21, 2006 DOI and e-Science Dr Anne E Trefethen Oxford e-Research Centre
Advertisements

AHM, Nottingham, September eBank UK : linking research data, scholarly communication and learning. Dr Liz Lyon, UKOLN, University of Bath Dr Simon.
A centre of expertise in data curation and preservation DCC/NeSC eScience Workshop, June 2008 Working in partnership with the eScience community This work.
S.J. Coles a*, M.B. Hursthouse a, R.A. Stephenson a, P. Cliff b, E. Lyon b, M. Patel b J. Downing c & P. Murray-Rust.
© S.J. Coles 2006 Usability WS, NeSC Jan 06 Enabling the reusability of scientific data: Experiences with designing an open access infrastructure for sharing.
Crystal Structure EPrints: Source Through the Open Archive Initiative S.J. Coles a*, J.G. Frey a, M.B. Hursthouse a, L. Carr b & C.J. Gutteridge.
Opening the Research Data Lifecycle Workshop Capturing and Sharing Research Data Simon Coles School of Chemistry, University of Southampton, U.K.
© S.J. Coles 2006 Digital Repositories as a Mechanism for the Capture, Management and Dissemination of Chemical Data Simon Coles School of Chemistry, University.
Linking Data and Publications: the Chemistry Way Simon Coles School of Chemistry, University of Southampton, U.K. CLADDIER workshop.
S.J. Coles a*, J.G. Frey a, M.B. Hursthouse a, L. Carr b & C.J. Gutteridge b. a School of Chemistry, University of Southampton, UK.; b School of Electronics.
© S.J. Coles 2006 Digital Repositories as a Mechanism for the Capture, Management and Dissemination of Chemical Data Simon Coles School of Chemistry, University.
RCUK, Octiber Archiving research data and research publications. Dr Leslie Carr, Intelligence, Agents Multimedia, University of Southampton Dr Simon.
© S.J. Coles 2006 eCrystals: A Route for Open Access to Small Molecule Crystal Structure Data Simon Coles School of Chemistry, University of Southampton,
UKOLN is supported by: Enhanced support for eScience: the role of Digital Libraries Digital Libraries Go eScience, ECDL, Alicante September 2006 Rachel.
A centre of expertise in digital information management UKOLN is supported by: Digital libraries and digital scholarship: changing roles.
A centre of expertise in digital information management UKOLN is supported by: Adding Value to Data and Information: Moving towards a Science.
UKOLN is supported by: Realising the scholarly knowledge cycle: The experience of eBank UK Dr Liz Lyon, UKOLN, University of Bath, UK CNI Task Force Meeting.
UKOLN is supported by: From research data to new knowledge: a lifecycle approach. Dr Liz Lyon, Director UKOLN, University of Bath, UK JISC/SURF/CNI Conference.
A centre of expertise in digital information management UKOLN is supported by: Curating the Scientific Record: The Challenges Ahead Dr.
Digital | Curation | Centre Adding value to open access research data: reflections on the process of data curation Dr Liz Lyon, DCC Associate Director.
UKOLN is supported by: Digital Repositories Roadmap: looking forward The JISC/CNI Meeting, July 2006 Rachel Heery Assistant Director R&D, UKOLN
Integrating research data into the publication workflow: eBank UK experience Rachel Heery, UKOLN, University of Bath
UKOLN is supported by: e-Research: trends, requirements and challenges Dr Liz Lyon, UKOLN, University of Bath, UK Cross Research Council ICT Conference.
UKOLN is supported by: Digital Libraries and e-Research: new horizons, new challenges? Dr Liz Lyon, Director UKOLN, University of Bath, UK 8 th International.
UKOLN is supported by: Digital Libraries and e-Research: a UK perspective on a changing landscape. Dr Liz Lyon, Director UKOLN, University of Bath, UK.
UKOLN is supported by: eBank UK : linking research data, scholarly communications and learning. Dr Liz Lyon, UKOLN, University of Bath, UK JISC CNI Conference.
UKOLN is supported by: Data, information and knowledge repositories: developing infrastructure to support the e-Research landscape. Dr Liz Lyon, Director.
JISC Joint Programmes Meeting eBank UK : linking research data, learning and scholarly communications. Dr Liz Lyon, UKOLN, University of Bath Dr.
UKOLN is supported by: Digital Library developments supporting eResearch Dr Liz Lyon, Director UKOLN, University of Bath, UK British Library, November.
A centre of expertise in digital information management UKOLN is supported by: Digital repositories as research infrastructure: a UK perspective.
Digital | Curation | Centre An Introduction to the UK Digital Curation Centre Dr Liz Lyon, DCC Associate Director Outreach Director, UKOLN, University.
A centre of expertise in digital information management UKOLN is supported by: British Academy e-Resources Policy Review: UKOLN Report.
Digital | Curation | Centre UK Digital Curation Centre An Introduction Dr Liz Lyon, Associate Director Outreach IACMST MED Forum, November 2005 Funded.
UKOLN is supported by: Emergent technologies & digitisation: the institutional impact. Liz Lyon & Kevin Edge VCs Retreat, October a.
UKOLN is supported by: Starting to explore the role of memory institutions within the social fabric of the new Web Dr Liz Lyon, UKOLN, University of Bath,
A centre of expertise in digital information management UKOLN is supported by: Data Informatics Top Ten : (for Libraries) Dr Liz Lyon,
Federation The eCrystals Federation Dr Simon Coles, University of Southampton, UK Dr Liz Lyon, UKOLN, University of Bath, UK Open Repositories 2008, University.
Federation eCrystals Federation: Open Repositories for Open Science Dr Liz Lyon, UKOLN, University of Bath, UK Dr Simon Coles, University of Southampton,
A centre of expertise in digital information management UKOLN is supported by: UK Perspectives on the Curation and Preservation of Scientific.
A centre of expertise in digital information management UKOLN is supported by: Changing Roles, Responsibilities and Relationships Dr Liz.
Federation eCrystals Federation: Open Repositories for Data-driven Science Dr Liz Lyon, UKOLN, University of Bath, UK Dr Simon Coles, University of Southampton,
A centre of expertise in digital information management UKOLN is supported by: Digital Futures for MLAs? A snapshot in real time. Dr Liz.
A centre of expertise in digital information management UKOLN is supported by: Memory institutions and the social fabric of the Web Dr.
UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
UKOLN is supported by: Enhancing access to research data: the challenge of crystallography Rachel Heery, Monica Duke, Michael Day UKOLN, University of.
© S.J. Coles 2006 Institutional Data Repositories for Chemistry Simon Coles School of Chemistry, University of Southampton, U.K.
EBankII Workshop 1 Making Scientific Data Openly Available Simon Coles School of Chemistry, University of Southampton.
A centre of expertise in digital information management UKOLN is supported by: Open Science and the Research Library: Roles, Challenges.
UKOLN is supported by: Developing e-Infrastructure to support new research and learning paradigms. Dr Liz Lyon, Director UKOLN, University of Bath, UK.
Digital | Curation | Centre Supporting Digital Curation to safeguard research data: adding value today and ensuring long-term access Dr Liz Lyon, DCC Associate.
EBank UK CCLRC Workshop February eBank and CCLRC Workshop February 2005 University of Bath.
Digital Repositories: interoperability & common services Closing Remarks Dr Liz Lyon, UKOLN, University of Bath, UK
The Discovery Landscape in Crystallography UKOLN is supported by: Monica Duke UKOLN, University of Bath, UK – eBank UK project A centre.
University of Southampton, U.K.
EPrints Workshop, January eBank UK: Dissemination of research data using EPrints Simon Coles, School of Chemistry, University of Southampton.
© S.J. Coles 2005 ACS 2005, San Diego Furthering Chemoinformatics through ‘Crystalloinformatics’ Simon J. Coles EPSRC National Crystallography Service.
© S.J. Coles 2006 Data Management in the Chemistry Domain Simon Coles School of Chemistry, University of Southampton, U.K.
© S.J. Coles 2005 eChemInfo2005 Open Archives as a Route for Capture, Dissemination and Access to Chemical Data and Information Simon Coles School of Chemistry,
EBank UK: linking scientific data, scholarly communication and learning Michael Day and Rachel Heery UKOLN, University of Bath
UKOLN is supported by: Digital Preservation Benefits Tools Project Dissemination Workshop Dr Liz Lyon, Associate Director, UK Digital Curation Centre Director,
UKOLN is supported by: Enhancing access to research data: the e-Science project eBank UK A centre of expertise in digital information management.
UKOLN is supported by: Introduction to UKOLN Dr Liz Lyon, Director UKOLN, University of Bath, UK Grand Challenge Meeting, June a centre.
CombeDay Making Data Openly Available Simon Coles.
UKOLN is supported by: Library futures in the new research landscape. Dr Liz Lyon, UKOLN, University of Bath, UK CURL Members Meeting October 2004, London.
Joint Information Systems Committee Repositories Support Project Summer School 2008 Amber Thomas, JISC.
eCrystals Federation: Open Repositories for global Open Science
Realising the scholarly knowledge cycle:
JISC Joint Programmes Meeting 2005
Developing Institutional Data Repositories
eCrystals Federation: Open Repositories for global Open Science
Presentation transcript:

UKOLN is supported by: Adding value to open access research data: the eBank UK Project. Dr Liz Lyon, Director UKOLN, University of Bath, UK OAI4, CERN Geneva, October a centre of expertise in digital information management

OAI4, CERN Geneva, October Overview 1.e-Research & data-intensive science 2.Repository services & adding value Aggregation and linking: eBank UK Integration and workflows 3.Looking to the longer term: digital curation and preservation

1. e-Research & data-intensive science

OAI4, CERN Geneva, October Data Overload! How do we disseminate? EPSRC National Crystallography Service eScience - the data deluge

OAI4, CERN Geneva, October Diversity of data collections Very large, relatively homogeneous: Large-scale Hadron Collider (LHC) outputs from CERN Smaller, heterogeneous and richer collections: World Data Centre for Solar-terrestrial Physics CCLRC Small-scale laboratory results: jumping robots project at the University of Bath Population survey data: UK Biobank Highly sensitive, personal data: patient care records

OAI4, CERN Geneva, October Taxonomy of data collections Research collections: jumping robots Community collections: Flybase at Indiana (with UC Berkeley ) Reference collections: Protein Data Bank Source: NSF Long-Lived Digital Data Collections Draft report revised May 2005 Evolution……

OAI4, CERN Geneva, October Experience of data-sharing Large scale data sharing in the life sciences Draft Report June 2005 Sponsored by UK research funding bodies MRC, BBSRC, NERC, JISC, Wellcome Outcomes & recommendations –Importance of standards and good quality metadata –Require a data management plan –Work needed on vocabularies & ontologies –Awareness of archiving & long term preservation Position of research funders and policy makers?

OAI4, CERN Geneva, October 20058

OAI4, CERN Geneva, October Learning & Teaching workflows Research & e-Science workflows Aggregator services: national, commercial Repositories : institutional, e-prints, subject, data, learning objects Institutional presentation services: portals, Learning Management Systems, u/g, p/g courses, modules Harvesting metadata Data creation / capture / gathering: laboratory experiments, Grids, fieldwork, surveys, media Resource discovery, linking, embedding Deposit / self- archiving Peer-reviewed publications: journals, conference proceedings Publication Validation Data analysis, transformation, mining, modelling Resource discovery, linking, embedding Deposit / self- archiving Learning object creation, re-use Searching, harvesting, embedding Quality assurance bodies Validation Presentation services: subject, media-specific, data, commercial portals Resource discovery, linking, embedding The scholarly knowledge cycle. Liz Lyon, Ariadne, July This work is licensed under a Creative Commons License Attribution-ShareAlike 2.0Creative Commons License © Liz Lyon (UKOLN, University of Bath), 2005

OAI4, CERN Geneva, October Learning & Teaching workflows Research & e-Science workflows Aggregator services: eBank UK Repositories : institutional, e-prints, subject, data, learning objects Institutional presentation services: portals, Learning Management Systems, u/g, p/g courses, modules Harvesting metadata Data creation / capture / gathering: laboratory experiments, Grids, fieldwork, surveys, media Resource discovery, linking, embedding Deposit / self- archiving Peer-reviewed publications: journals, conference proceedings Publication Validation Data analysis, transformation, mining, modelling Resource discovery, linking, embedding Deposit / self- archiving Learning object creation, re-use Searching, harvesting, embedding Quality assurance bodies Validation Presentation services: subject, media-specific, data, commercial portals Resource discovery, linking, embedding

2. Repository services & adding value: the eBank UK Project

OAI4, CERN Geneva, October eBank UK Project Two key themes: –Open access to datasets –Linking research data to publications and to learning JISC-funded from September 2003: now in Phase 2 UKOLN at the University of Bath (lead), University of Southampton, University of Manchester Exemplar: e-Science testbed Combechem –Grid-enabled combinatorial chemistry / crystallography –National Crystallography Service Resource Discovery Network / PSIgate physical sciences portal

OAI4, CERN Geneva, October The hybrid project team UKOLN Michael Day Monica Duke Rachel Heery Traugott Koch Liz Lyon + Andy Powell Southampton Les Carr Simon Coles Jeremy Frey Chris Gutteridge Mike Hursthouse Andrew Milstead Manchester John Blunden-Ellis

OAI4, CERN Geneva, October Data Flow in eBank UK Submit Store/link Data files Metadata Present HTML Institutional repository eCrystals OAI-PMH Harvest (XML) Index and Search Present HTML eBank aggregator service Create Deposition Interface Local archive search interface Service Provider interfaces e.g. Subject Portal Deposit

OAI4, CERN Geneva, October CombeChem: An EPSRC pilot project X-Ray e-Lab Analysis Properties Properties e-Lab Simulation Video Diffractometer Grid Middleware Structures Database

OAI4, CERN Geneva, October Crystallography workflow RAW DATADERIVED DATARESULTS DATA Initialisation: mount new sample set up data collection Collection: collect data Processing: process and correct images Solution: solve structures Refinement: refine structure CIF: produce CIF (Crystallographic Information File) Validation: chemical & crystallographic checks Report: generate Crystal Structure Report

OAI4, CERN Geneva, October

OAI4, CERN Geneva, October A data repository entry

OAI4, CERN Geneva, October Access to the underlying data: complex objects ecrystals.chem.soton.ac.uk

OAI4, CERN Geneva, October Harvesting: OAIster

OAI4, CERN Geneva, October Aggregating: search & discover

OAI4, CERN Geneva, October Linking data to publications

OAI4, CERN Geneva, October Embedding in a science portal for student learners

OAI4, CERN Geneva, October Ontologies for discovery in an inter-disciplinary world Transform the list into an ontology Embed ontology into the deposition process Aggregators use keywords for linking with the broader literature Researchers use keyword ontology in search and discovery services

OAI4, CERN Geneva, October Persistent identifiers for data citation eBank use cases: depositor, author, service provider, reader, publisher, ? Schemes: DOI, Handle, ARK, PURL Global identification: express as http URIs Added value services: CrossRef, resolution service, integration (Globus), look-up service, ? Degree of trust or persistence Costs Future potential: political, ? Domain identifiers: International Chemical Identifier (InChI) codes

OAI4, CERN Geneva, October Publication & citation of scientific primary data project National Library for Science & Technology (TIB), University of Hanover, Germany STD-DOI Project DOI registry for datasets Data requirements: quality control, long-term curation, use DOI resolver Data publication agents: World Data Center Climate, GeoForschungsZentrum Potsdam Exemplar data citation: –Kamm, H; Machon, L; Donner, S (2004): Gas chromatography (KTB Field Lab), GFZ Potsdam. doi: /GFZ/ICDP/KTB/ktb-geoch-gaschr-p

OAI4, CERN Geneva, October Integration into crystallographic publishing practices Publishers seal of approval

OAI4, CERN Geneva, October Integration into chemistry research workflows R4L Repository for the Laboratory Project (JISC-funded) automated data capture from instrumentation, registration of results SMART TEA electronic Laboratory notebook + annotations Related sub-domains of chemistry: SPECTRa Project (JISC-funded) Research assessment (RAE) process?

OAI4, CERN Geneva, October Integration into the curriculum and e-Learning workflows MChem course Assess role in Undergraduate Chemical Informatics courses Pedagogic evaluation

3. Looking to the longer term: digital curation & preservation

OAI4, CERN Geneva, October For later use? In use now (and the future)? Repositories and digital curation Data preservationData curation StaticDynamic maintaining and adding value to a trusted body of digital information for current and future use

OAI4, CERN Geneva, October Assuring long term access to the research record Trusted digital repositories –Audit Checklist for Certification Draft Report –Research Libraries Group, August 2005 –RLG-NARA Taskforce –Defined criteria under 4 categories Organisation Functions, processes & procedures Designated community & usability Technologies & technical infrastructure UK Digital Curation Centre –1 st International DCC Conference presentations available –PV2005 Royal Society Edinburgh November Nov

Thank you. Questions?….. More information: UKOLN

OAI4, CERN Geneva, October ebank_dc record (XML) Crystal structure (data holding) Crystal structure report (HTML) Dataset Institutional repository eBank UK aggregator service ePrint UK aggregator service Subject service Deposit Harvesting OAI-PMH ebank_dc Harvesting OAI-PMH oai_dc Searching, linking and embedding Dataset dc:identifier dcterms:references Linking dc:type=CrystalStructure and/or Collection Model input Andy Powell, UKOLN. PSIgate portal Eprint oai_dc record (XML) dcterms:isReferencedBy dc:type=Eprint and/or Text eBank data model Eprint jump-off page (HTML) dc:identifier Eprint manifestation (e.g. PDF) Linking