Presentation is loading. Please wait.

Presentation is loading. Please wait.

© S.J. Coles 2006 Data Management in the Chemistry Domain Simon Coles School of Chemistry, University of Southampton, U.K.

Similar presentations


Presentation on theme: "© S.J. Coles 2006 Data Management in the Chemistry Domain Simon Coles School of Chemistry, University of Southampton, U.K."— Presentation transcript:

1 © S.J. Coles 2006 Data Management in the Chemistry Domain Simon Coles School of Chemistry, University of Southampton, U.K. s.j.coles@soton.ac.uk

2 © S.J. Coles 2006 Why? Funding Body Viewpoint

3 © S.J. Coles 2006 Why? Curation in the Laboratory “Data from experiments conducted as recently as six months ago might be suddenly deemed important, but those researchers may never find those numbers – or if they did might not know what those numbers meant” “Lost in some research assistant’s computer, the data are often irretrievable or an undecipherable string of digits” “To vet experiments, correct errors, or find new breakthroughs, scientists desperately need better ways to store and retrieve research data” “Data from Big Science is … easier to handle, understand and archive. Small Science is horribly heterogeneous and far more vast. In time Small Science will generate 2-3 times more data than Big Science.” ‘Lost in a Sea of Science Data’ S.Carlson, The Chronicle of Higher Education (23/06/2006)

4 © S.J. Coles 2006 Why? Publishing and the Data Deluge 30,000,000 1.5,000,000 450,000

5 © S.J. Coles 2006 Why? Publishing Data and Information Loss

6 © S.J. Coles 2006 Separating Data from Interpretations Underlying data (Institutional data repository) Intellect & Interpretation (Journal article, report, etc)

7 © S.J. Coles 2006 Data Generation in Chemistry Synthesis Characterisation

8 © S.J. Coles 2006 Smart Labs

9 © S.J. Coles 2006 Preservation and Curation by Data Centres & Institutions G bytes M bytes Lab / Institution Data Centre / Public Domain k bytes

10 © S.J. Coles 2006 Laboratory IRs and Information Management

11 © S.J. Coles 2006 The R4L Repository Deposit Search / Browse Create new compoundAdd experiment data and metadata

12 © S.J. Coles 2006 The eCrystals Public Data Archive http://ecrystals.chem.soton.ac.uk

13 © S.J. Coles 2006 Metadata Publication Using simple Dublin Core Crystal structure Title (Systematic IUPAC Name) Authors Affiliation Creation Date Additional chemical information through Qualified Dublin Core Empirical formula International Chemical Identifier (InChI) Compound Class & Keywords Specifies which ‘datasets’ are present in an entry DOI http://dx.doi.org/10.1594/ecrystals.chem.soton.ac.uk/145 Rights & Citation http://ecrystals.chem.soton.ac.uk/rights.html Application Profile http://www.ukoln.ac.uk/projects/ebank-uk/schemas/

14 © S.J. Coles 2006 Aggregator services Institutional data repositories Validation Deposit Publication Validation Data analysis, transformation, mining, modelling Search, harvest Presentation services / portals Data discovery, linking, citation Laboratory repository Deposit The eCrystals ‘Global Federation’ Model Publishers: peer- review journals, conference proceedings, etc Curation Preservation Subject Repository Institution Library & Information Services


Download ppt "© S.J. Coles 2006 Data Management in the Chemistry Domain Simon Coles School of Chemistry, University of Southampton, U.K."

Similar presentations


Ads by Google