Presentation is loading. Please wait.

Presentation is loading. Please wait.

A centre of expertise in digital information management www.ukoln.ac.uk UKOLN is supported by: Acting as Advocate? Seven steps for libraries in the data.

Similar presentations


Presentation on theme: "A centre of expertise in digital information management www.ukoln.ac.uk UKOLN is supported by: Acting as Advocate? Seven steps for libraries in the data."— Presentation transcript:

1 A centre of expertise in digital information management www.ukoln.ac.uk UKOLN is supported by: Acting as Advocate? Seven steps for libraries in the data decade Dr Liz Lyon, Director, UKOLN, University of Bath, UK Associate Director, UK Digital Curation Centre IATUL Conference, Purdue University, June 2010. This work is licensed under a Creative Commons Licence Attribution-ShareAlike 2.0

2 1.Scale, Complexity, Predictive Potential 2.Continuum of Openness 3.Citizen Science 4.Credentials, Incentives, Rewards 5.Institutional Readiness & Response 6.Data Informatics Capacity & Capability http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/publications.html#november-2009 Open Science at Web-Scale Consultation: Write-To-Reply Keynote Presentations: eResearch Australasia Nov 2009 CNI, Baltimore April 2010 http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/presentations.html

3 data s c a l e Human Genome printed http://www.flickr.com/photos/johnjobby/2252981353/sizes/l/

4 Data sets are becoming the new instruments of science

5 $1000 genome in <15 minutes....by 2013?

6 ...data logistic challenges.... Large-scale data storage that is: –Cost-effective (rent on-demand) –Secure (privacy and IPR) –Robust and resilient –Low entry barrier / ease-of-use –Has data-handling / transfer / analysis capability Move sequencing out of genome centres....analyse an entire human genome in a single day sitting with a laptop at your local Starbucks....cloud services

7 Clients in the cloud

8 Library Actions 1.Provide Briefings on Cloud Data Services (in partnership with local IT Services?)

9 Workflows, Models, Tools Sage Bionetworks genomics Workflow

10 Reference Linking Research Outputs User registration data; Instrument allocation data etc. Comments, annotations, ratings etc. Risk assessment data; other sample data Analyse Derived Data Research Concept and/or Experiment Design Acquire Sample Peer-review Proposal Conduct Experiment Generate, Create, & Collect Raw Data Process Raw Data into Derived Data Interpret & Analyse Results Data Archive, Preservation & Curation IPR, Embargo & Access Control Validate, Reuse & Repurpose Data Publish Research Results DataDerived DataProcessed Data Raw, Correction & Calibration Data Papers, articles, presentations, reports An Idealised Scientific Research Data Lifecycle Model Documentation, Metadata & Storage (Reference, Provenance, Context, Calibration etc.) Start Project Write Proposal (include DMP) Scholarly Knowledge Write Usage Reports Publication Database Research ActivityResearch Admin Activity Archive Activity Information Flow KEY Prepare Supplementary Data Prepare Manuscript Peer Review Research Discover & Access Appraisal & Quality Control Programs (generate customised software) Publication Activity

11 State-of-the-Art Report : Models & Tools (Alex Ball, June 2010) Data Lifecycles Data Policies (UK) incl DMP Standards & tools Data Asset Framework (DAF) DANS Seal of Approval Preservation metadata Archive management tools Cost / benefit tools

12 Library Actions 1.Provide Briefings on Cloud Data Services (in partnership with local IT Services?) 2.Build usable Data Management Tools working in partnership with researchers

13 Data Sustainability….

14 Dimension 1 DirectIndirect (costs avoided) Dimension 2 Near-termLong-term Dimension 3 PrivatePublic Benefits Taxonomy: Summary Keeping Research Data Safe2 Report: April 2010

15 Library Actions 1.Provide Briefings on Cloud Data Services (in partnership with local IT Services?) 2.Build usable Data Management Tools working in partnership with researchers 3.Develop Data Sustainability Strategies and articulate the cost-benefits

16 Ethics, Privacy, Culture You have zero privacy anyway. Get over it Scott McNealy, CEO Sun Microsystems, 1999

17 Post-genome decade Human genomes: >24 published & almost 200 unpublished

18 P4 medicine : Predictive, Personalised, Preventive, Participatory. Leroy Hood – Institute for Systems Biology Image from Scientific American...medicine is going to become an information science...

19 P4 medicine Each patients genome sequenced Your genome is basis of your medical record New method to anonymise medical records for genomics research at Vanderbilt Univ (April 10) New predictive models of health and disease Personalised treatments focus on preventative therapies Genome scale network biology Genomic data as a commodity

20 They have shared their data….

21 Share my data?

22 While many researchers are positive about sharing data in principle, they are almost universally reluctant in practice...... using these data to publish results before anyone else is the primary way of gaining prestige in nearly all disciplines. INCREMENTAL Project

23 Sage Bionetworks : Integrative genomics Open data in the Sage Commons repository Human and mouse: clinical and genetics data Develop predictive models of disease: liver / breast / colon cancer, diabetes, obesity Crowd-sourced effort : global scope Stephen Friend

24 Participatory medicine : share data & empower the patient... Sage Congress San Francisco April 2010

25 Library Actions 1.Provide Briefings on Cloud Data Services (in partnership with local IT Services?) 2.Build usable Data Management Tools working in partnership with researchers 3.Develop Data Sustainability Strategies and articulate the cost-benefits 4.Publish Case Studies on Open Science to show benefits of universal data sharing

26 Library Actions 1.Provide Briefings on Cloud Data Services (in partnership with local IT Services?) 2.Build usable Data Management Tools working in partnership with researchers 3.Develop Data Sustainability Strategies and articulate the cost-benefits 4.Publish Case Studies on Open Science to show benefits of universal data sharing 5.Present at University Ethics Committee to highlight open data issues for faculty

27 Professional Scientists Enthusiastic amateurs Training Citizen scientist Standards and ethics Local : natural history, environ. Peer-review Global : astronomy Organisational support Self-supporting

28

29 Citizen Science : validated in the professional press

30 Working with science professionals

31 Library Actions 6.Raise awareness of Citizen Science opportunities & guidelines for good practice

32 Data Publication and Attribution http://www.flickr.com/photos/digitalfemme57/3271063366 /

33 Calls for action, new metrics

34 Journal Article Workflow Visualisation Model Data Annotation Concept Macro Micro / Nano Attribution granularity What are we citing?

35 How to cite large-scale predictive network models? Multiple data sources Linked data approach Visualise : Cytoscape Workflow : Taverna Provenance issues

36 Library Actions 6.Raise awareness of Citizen Science opportunities & guidelines for good practice 7.Promote Data Citation and Attribution to embed in publication practice and influence funder policy

37 Take homes... 1.Briefings on Cloud Data Services 2.Build usable Data Management Tools 3.Develop Data Sustainability Strategies 4.Publish Case Studies on Open Science 5.Present at University Ethics Committee 6.Raise awareness of Citizen Science 7.Promote Data Citation and Attribution...Acting as Advocate

38 Chicago Mart Plaza, 6-8 December 2010 Thank you…


Download ppt "A centre of expertise in digital information management www.ukoln.ac.uk UKOLN is supported by: Acting as Advocate? Seven steps for libraries in the data."

Similar presentations


Ads by Google