Presentation is loading. Please wait.

Presentation is loading. Please wait.

Publishing and Cataloguing Datasets It’s time everyone got involved UKSG Conference 2009.

Similar presentations


Presentation on theme: "Publishing and Cataloguing Datasets It’s time everyone got involved UKSG Conference 2009."— Presentation transcript:

1 Publishing and Cataloguing Datasets It’s time everyone got involved UKSG Conference 2009

2 BUT FIRST, LET’S GET SERIOUS Part 1 OECD likes being cool with data.

3 TIB in Hamburg say they have archived and added DOIs to 500,000 datasets. Yet their most-cited dataset has been cited just 3 times. Some data to start with

4 Either no-one wants to cite data OR, having a DOI by itself isn’t enough So …

5 Let us imagine for a moment...

6 If an article is... A piece of data that is presented in a static, two-dimensional, form. “ “ Geoffrey Bilder, CrossRef, 2007

7 Print HTML PDF Static, two-dimensional objects http://dx.doi.org/10.1787.280675838368

8 Active, two-dimensional object

9 Another active, two-dimensional object?

10 It’s a view on a datacube

11 In fact, it’s a view on a collection of datacubes Active, multi-dimensional object!

12 OECD Article Static, two- dimensional object OECD Excel Table Active, two- dimensional object OECD Database Active, multi- dimensional object http://dx.doi.org/10.1787.280675838368

13 So, instead of imagining, let’s say we built this. We’d get something like...

14 DatasetOECD Regional Database Excel – Active two- dimensional object Dataset – Active multi- dimensional object PDF – Passive two- dimensional object... this.

15 OK – that’s cool and OECD can do this because we have all the objects in our publishing system. But how are other publishers, authors and librarians coping with data?

16 Source: OECD Chart from The Economist

17 Source: Acemoglu et al (2001), based on Curtin, 1989, Philip D. Curtin, Death by migration: Europe’s encounter with the tropical world in the nineteenth century, Cambridge University Press, New York (1989).Curtin 1989 and other sources. Tertiary school enrollment: School enrollment, tertiary (% of gross). Source: Barro and Lee (2000) and their databases. Taken from an appendix to an article published in Elsevier’s World Development You can’t fault the author for trying... but it’s not a lot of help for a reader

18 And Librarians, How many are cataloguing datasets in their OPACs in ways which are compatible with search systems for books and journals?

19 Conclusion: Datasets: Scholarly Publishing’s Black Sheep?

20 A&I & subject portals Publishers Library portals Content Aggregators EconLit RePEc Science Direct OPACs Ingenta Scholarly Publishing Sites for Journals and Books Network

21 A&I & subject portals Publishers Library portals Content Aggregators EconLit RePEc Science Direct OPACs Ingenta Scholarly Publishing Sites for Journals and Books Network

22 A&I & subject portals Publishers Library portals Content Aggregators EconLit RePEc Science Direct OPACs Ingenta Scholarly Publishing Sites for Journals and Books Network

23 U sing metadata for: Datasets In the same industry standard formats as... Book chapters and Journal articles

24 Authors will be able to cite... Publishers will be able to link... Discovery systems will be able to find... Librarians will be able to catalogue... Datasets alongside published outputs...... to the benefit of Everyone

25 A proposed example of a dataset using standard bibliographic and citation metadata. Bibliography of Books that cite this database Citation tool compatible with EndNotes et al Dataset title with ISSN, DOI (& MARC) record

26 There are still challenges: - Dynamic data - Versioning - Preservation But, let’s round the sheep up first.

27 OECD is: issuing a white paper on Publishing Standards for Datasets Speaking with CrossRef about citation standards for dynamic objects publishing OECD datasets with ‘sheepdogs’ from mid-2009: MARC records ONIX records Citation records

28 End of Part One QUESTIONS? DISCUSSION?

29 BEING COOL WITH DATA: OECD’S FIRST STEPS Part 2

30 March 2007

31 Print edition Web-book on SourceOECD USB Key Edition OECD Factbook on eXplorer (new for 2009) OECD Factbook on iPhone (new for 2009) April 2009

32 OECD Regional Statistics data using the eXplorer tool tool eXplorer tool OECD Regional Statistics using NCVA’s eXplorer toolRegional Statistics October 2008 March 2009 http://stats.oecd.org/OECDregionalstatistics/

33 Other cool visualisation stuff IMF Datamapper on www.imf.org. See also www.mappingworlds.com who provided the technology.www.imf.org www.mappingworlds.com See Gapcasts and Trendalyzer on www.gapminder.orgwww.gapminder.org The New York Times uses a lot of dynamic graphicsNew York Timesgraphics USA Today built their reputation on graphics – now they’re doing it online. We like How much is $700bn?How much is $700bn? Economist’s Chart Gallery generate a lot of comment.Chart Gallery Data sharing sites include www.swivel.com, www.many-eyes.com an newcomers www.icharts.net and www.widgenie.com.www.swivel.comwww.many-eyes.comwww.icharts.netwww.widgenie.com There are many blogs on charts or visualization such as www.flowingdata.com or www.eagereyes.org www.flowingdata.comwww.eagereyes.org


Download ppt "Publishing and Cataloguing Datasets It’s time everyone got involved UKSG Conference 2009."

Similar presentations


Ads by Google