A centre of expertise in digital information management www.ukoln.ac.uk UKOLN is supported by: Evolution or revolution? The changing data landscape Dr.

Slides:



Advertisements
Similar presentations
David Shotton Image BioInformatics Research Group Department of Zoology University of Oxford, UK The Dryad-UK vision © David Shotton,
Advertisements

DRIVER Building a worldwide scientific data repository infrastructure in support of scholarly communication 1 JISC/CNI Conference, Belfast, July.
Philip LordDigital Archiving Consultancy Alison Macdonald Digital Archiving Consultancy Liz LyonDigital Curation Centre David GiarettaDigital Curation.
A centre of expertise in data curation and preservation DCC/NeSC eScience Workshop, June 2008 Working in partnership with the eScience community This work.
Opening the Research Data Lifecycle Workshop Capturing and Sharing Research Data Simon Coles School of Chemistry, University of Southampton, U.K.
Collection-level description & collection management: tool for the trade or information trade-off? Collection Description Focus Workshop 4 Newcastle, 8.
A centre of expertise in digital information management UKOLN is supported by: Curating the Scientific Record: The Challenges Ahead Dr.
A centre of expertise in digital information management UKOLN is supported by: Incremental Change or Revolution? Libraries and the Informatics.
A centre of expertise in digital information management UKOLN is supported by: Open Science at Genome Scale Dr Liz Lyon, Director, UKOLN,
A centre of expertise in digital information management UKOLN is supported by: Monica Duke Project.
A centre of expertise in digital information management UKOLN is supported by: Dealing with Data: Roles, Rights, Responsibilities & Relationships.
A centre of expertise in digital information management UKOLN is supported by: Roadmaps, Roles & Re-engineering: Developing Data Informatics.
UKOLN is supported by: Digital Repositories Roadmap: looking forward The JISC/CNI Meeting, July 2006 Rachel Heery Assistant Director R&D, UKOLN
A centre of expertise in digital information management UKOLN is supported by: Digital Curation for Modern Universities Dr Liz Lyon, Associate.
Project E: Citation Understanding the problem space Progress so far How you can contribute : afternoon session Lessons learned and challenges ahead Acknowledgements:
A centre of expertise in digital information management UKOLN is supported by: British Academy e-Resources Policy Review: UKOLN Report.
A centre of expertise in digital information management UKOLN is supported by: Mind the Gap: Reflections on Data Policies and Practice.
A centre of expertise in digital information management UKOLN is supported by: Open Science at Web-Scale: Breaking all Barriers? Dr Liz.
A centre of expertise in digital information management UKOLN is supported by: Data Informatics Top Ten : (for Libraries) Dr Liz Lyon,
A centre of expertise in digital information management UKOLN is supported by: Introducing the Community Capability Model Project Dr Liz.
A centre of expertise in digital information management UKOLN is supported by: Evolution or revolution? The changing data landscape Dr.
A centre of expertise in digital information management UKOLN is supported by: UK Perspectives on the Curation and Preservation of Scientific.
A centre of expertise in digital information management UKOLN is supported by: Virtual Research Environments: Into the Future Dr Liz Lyon.
A centre of expertise in digital information management UKOLN is supported by: Changing Roles, Responsibilities and Relationships Dr Liz.
A centre of expertise in digital information management UKOLN is supported by: Research Data & Institutions Roles & Responsibilities? Dr.
A centre of expertise in digital information management UKOLN is supported by: Acting as Advocate? Seven steps for libraries in the data.
UK Digital Curation Centre : enabling research data management at the coalface Dr Liz Lyon Associate Director DCC / Director UKOLN University of Bath,
Federation eCrystals Federation: Open Repositories for Data-driven Science Dr Liz Lyon, UKOLN, University of Bath, UK Dr Simon Coles, University of Southampton,
A centre of expertise in digital information management UKOLN is supported by: Partnering for research data Dr Liz Lyon, Associate Director,
A centre of expertise in digital information management Enhancing access to e-resources. Dr Liz Lyon, Director, UKOLN RSC-SW Meeting, Taunton.
1 SageCite Building a data citation demonstrator for bionetwork models.
A centre of expertise in digital information management UKOLN is supported by: Data Publishing: Challenges for HEIs and Libraries Dr Liz.
A centre of expertise in digital information management UKOLN is supported by: Digital Futures for MLAs? A snapshot in real time. Dr Liz.
A centre of expertise in digital information management UKOLN is supported by: UKOLN Update on Selected Activities Dr Liz Lyon, Director,
A centre of expertise in digital information management UKOLN is supported by: Memory institutions and the social fabric of the Web Dr.
UKOLN is supported by: Share your genome? Dealing with very personal data. Professor Adam Hedgecoe, Associate Director CESAGEN ESRC Centre for Social and.
I2S2 - Infrastructure for Integration in Structural Sciences Information Model Development Workshop RAL 11 th February 2010
A centre of expertise in digital information management UKOLN is supported by: Evolution or revolution? The changing data landscape Dr.
A centre of expertise in digital information management UKOLN is supported by: Dealing with the Data Cloud Dr Liz Lyon, Director, UKOLN,
A centre of expertise in digital information management UKOLN is supported by: From Audience to Avatar? Transforming the cultural experience.
A centre of expertise in digital information management UKOLN is supported by: Open Science and the Research Library: Roles, Challenges.
Digital Repositories: interoperability & common services Closing Remarks Dr Liz Lyon, UKOLN, University of Bath, UK
Collection-level description & the Information Landscape: users evaluate strategies for resource discovery Collection Description Focus Workshop 5 Cambridge,
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
A centre of expertise in data curation and preservation UKOLN Open ForumIWMW June 2006 Funded by: This work is licensed under the Creative Commons.
APA CONFERENCE, FRASCATI 6 November 2012 Data management planning at the DCC Martin Donnelly Digital Curation Centre University of Edinburgh.
PaN-data WP7 - Integration Brian Matthews STFC-e-Science.
Research Data Management (RDM) initiatives Sarah Jones Digital Curation Centre Twitter: sjDCC.
13th Fiesole Collection Development Retreat, St Petersburg, May 2011 (Primary) Data: The New Special Collections for Research Libraries? Wouter Schallier.
UCL Library Services and Research Data Management – a case study Martin Moyle UCL Library Services ODE Workshop, LIBER Conference, 27 June 2012.
December 2008 MRC Data Support Services (DSS) Chris Morris 13 th February 2009 Sharing Research Data: Pioneers, Policies and Protocols The seventh cat.
Research data: lifecycle, plans and planning SEQld Data Intensive 29 th January 2015 Kathryn Unsworth.
An Open Access publisher’s perspective on data publishing Matthew Cockerill Managing Director, BioMed Central Dryad-UK meeting HEFCE, London, 28 April.
A centre of expertise in digital information management UKOLN is supported by: What is a Data Scientist? (…Data Scientists in the Wild…)
A centre of expertise in digital information management UKOLN is supported by: Building Capacity and Capability for Data : Requirements,
A centre of expertise in digital information management UKOLN is supported by: Evolution or revolution? The changing data landscape Dr.
A centre of expertise in digital information management UKOLN is supported by: Benefits of Research360 Catherine Pink Institutional Data.
A centre of expertise in digital information management UKOLN is supported by: Monica Duke Project.
Elements of a Data Management Plan Bill Michener University Libraries University of New Mexico Data Management Practices for.
A centre of expertise in digital information management UKOLN is supported by: University of Bath Roadmap for EPSRC Catherine Pink Institutional.
A centre of expertise in digital information management UKOLN is supported by: Introducing the Community Capability Model Framework Dr.
Because good research needs good data Funded by: Digital Curation for Researchers, 28th February 2013 The Shifting Research Data Management Policy Landscape.
UKOLN is supported by: Digital Preservation Benefits Tools Project Dissemination Workshop Dr Liz Lyon, Associate Director, UK Digital Curation Centre Director,
Because good research needs good data Funded by: Data and the web manager Kevin Ashley Director, DCC CC-BY.
Dr Liz Lyon Associate Director, Outreach Funders: Engaging the Users: the Outreach & Community Support Programme Digital Curation Centre a centre of expertise.
Open Access and Institutional Repositories. Accra, June 2007 Institutional repositories in SA research institutions: the DISA experience Dr D Peters.
Because good research needs good data Funded by: C4D Workshop, 26 th July 2013 The Shifting Research Data Management Policy Landscape Joy Davidson and.
A. D. SMITH – SEPTEMBER 29, 2011 RESOURCE PLANNING.
NRF Open Access Statement
Mind the Gap: Reflections on Data Policies and Practice
Presentation transcript:

A centre of expertise in digital information management UKOLN is supported by: Evolution or revolution? The changing data landscape Dr Liz Lyon, Associate Director, UK Digital Curation Centre Director, UKOLN, University of Bath, UK 3rd DCC Regional Roadshow, Glasgow, June This work is licensed under a Creative Commons Licence Attribution-ShareAlike 2.0

Data sets are becoming the new instruments of science Dan Atkins, Univ Michigan

Digital data as the new special collections? Sayeed Choudhury, Johns Hopkins

Research data : institutional crown jewels? /

Perspectives Environmental scan –Scale and complexity –Infrastructure –Open science Policy –Funders –Institutions –Ethics & IP Practice Challenges –Storage –Incentives –Costs & Sustainability

Surfing the Tsunami Science: 11 February 2011

I worry there wont be enough people around to do the analysis. Chris Ponting, University of Oxford The costs of sequencing DNA has taken a nosedive...and is now dropping by 50% every 5 months. A single sequencer can now generate in a day what it took 10 years to collect for the Human Genome Project. The 1000 Genomes Project generated more DNA sequence data in its first 6 months than GenBank had accumulated in its entire 21 year existence.

PDB GenBank UniProt Pfam Spreadsheets, Notebooks Local, Lost High throughput experimental methods Industrial scale Commons based production Publicly data sets Cherry picked results Preserved CATH, SCOP (Protein Structure Classification) ChemSpider Data collections Slide: Carole Goble

Complexity challenges Data pipelines Visualise: Cytoscape Workflow: Taverna

Distributed gene expression & clinical traits data Workflows capture the complex model construction process Derive large-scale bionetwork models Use to predict disease patterns

A centre of expertise in digital information management Structural Sciences Infrastructure

Infrastructure Roadmap Cross Organisations

Infrastructure Roadmap Cross Disciplines

Infrastructure Roadmap Open Science

l#november-2009

: Citizens getting involved in science

Citizen as scientist

18 Classify galaxies…

19 Working with academics

Validate results data and publish

Patients Participate! Bridging the Gap Feasibility pilot study Stem cell research Develop Use Cases Deliver advocacy, guidance Report & Recommendations JISC funding 21 Citizen-patients producing crowd-sourced lay summaries of UK PubMed Central papers Blog :

Policy

Funder Policy

EPSRC Expectations : implications for HEIs

NSF-OCI TASK FORCE on Data and Visualization : Report

INCREMENTAL Project Institutional perspective Creating & organising data Storage and access Back-up Preservation Sharing and re-use The majority of people felt that some form of policy or guidance was needed....

Institutional Policy Article in next issue Int J Digital Curation

Institutional Policy

Policy Summary from DCC

Policy summary from ANDS

International collaboration around the DCC DMPOnline tool

While many researchers are positive about sharing data in principle, they are almost universally reluctant in practice using these data to publish results before anyone else is the primary way of gaining prestige in nearly all disciplines. INCREMENTAL Project Data sharing was more readily discussed by early career researchers.

Alzheimers Disease Neuroimaging Initiative: a unique (open) $60M partnership between NIH, FDA, universities and drug companies. It was unbelievable. Its not science the way most of us have practiced in our careers. But we all realised that we would never get biomarkers unless all of us parked our egos and intellectual property noses outside the door and agreed that all of our data would be public immediately. Dr John Trojanowski, University of Pennsylvania

Data is headline news JISC FoI FAQ

P4 medicine: Predictive, Personalised, Preventive, Participatory. Leroy Hood – Institute for Systems Biology Your genome is basis for your medical record

Open data and ethics Buy a DIY kit? Share your data?

Open data and ethics Bring your genes to CAL UC Berkeley personalised medicine initiative in 2010 >700 new students have submitted a genetic sample and a consent form Aggregate analyses for three genes related to nutrition Constrained by State Law Implications for UK HE students & staff?

Policy Gaps... Is Policy disconnected from Practice? –Data Sharing –Data Licensing –Ethics and Privacy –Citizen Science & Public Engagement –Data Storage, Selection & Appraisal –Data Citation and Attribution

Departments dont have guidelines or norms for personal back-up and researcher procedure, knowledge and diligence varies tremendously. Many have experienced moderate to catastrophic data loss Incremental Project Report, June

Data storage... The case for cloud computing in genome informatics. Lincoln D Stein, May 2010 –Scaleable –Cost-effective (rent on-demand) –Secure (privacy and IPR) –Robust and resilient –Low entry barrier / ease-of-use –Has data-handling / transfer / analysis capability Cloud services?

Your data in the cloud

Janet Brokerage & Connectivity Services Janet Brokerage & Connectivity Services Common Cloud Service Bus (CSB) JISC Community CloudConsortium Eduserv MIMAS Other Public Clouds Amazon AWS Microsoft Azure Private Clouds University A University B University C University D University E University F University G Community Services EduBox Disaster Recovery VM launch pad VM launch pad DCC Services Access Control … … HEFCE UMF cloud infrastructure model : new DCC role

Incentivising data management

Beyond the PDF Workshop, January 2011 Concept of reproducibility Executable papers Data papers Links to data, workflows, analyses (GenePattern) within a document Post-publication peer review Alternative impact metrics : downloads, slide reuse, data citation, YouTube views La Jolla Manifesto : guiding principles for digital scholarship Jodi Schneider, Ariadne, Issue 66, January 2011

DataCite sagecitedemorepository Data Produces Register Generate landing page for data DOIsDOIs Mint DataCite API Google API Resolve to landing page Taverna workflow The relationships between data via DataCite DOIs with tools are captured by the provenance (OPM) produced by Taverna Workflow metadata For referring to data reported in the provanance? Slide : Peter Li

KRDS

Research Outputs Citations, References User registration data; Instrument allocation data etc. Comments, annotations, ratings etc. Risk assessment data; other sample data Process & Analyse Derived Data Research Concept and/or Experiment Design Start Project Peer-review Proposal Conduct Experiment Generate, Create, & Collect Raw Data Check & Clean Raw Data Interpret & Analyse Results Data Archive, Preservation & Curation (OAIS conformant; Representation Information etc.) IPR, Embargo & Access Control Discover, Access, Validate, Reuse & Repurpose Data Publish Research Results DataDerived DataProcessed DataRaw Data Documentation, Metadata & Storage (Reference, Provenance, Context, Calibration etc.) Acquire Sample Write Proposal (include DMP) Scholarly Knowledge Write Usage Report Research ActivityAdministrative Activity Curation Activity Information Flow KEY: Peer Review Prepare Manuscript Prepare Supplementary Data Publications Database Publication Activity An Idealised Scientific Research Activity Lifecycle Model Appraisal & Quality Control Programs (generate customised software) Papers, articles, presentations, reports An Idealised Scientific Research Data Lifecycle Model

KRDS/I2S2 Project Extending the Benefits Framework Developing Value Chain and Impact Analysis tool Applying to different domains Workshop South Bank Univ, London 12 July KRDS Activity Model Benefits & Metrics Use Case 1 : National Crystallography Service Use Case 2 : Researcher in the lab

Thank you… 7 th International Digital Curation Conference Dec 5-7, Bristol /