Presentation is loading. Please wait.

Presentation is loading. Please wait.

Per Møldrup-Dalum State and University Library SCAPE Information Day State and University Library, Denmark, 2014-06-25 A Weekend with Nanite Large scale.

Similar presentations


Presentation on theme: "Per Møldrup-Dalum State and University Library SCAPE Information Day State and University Library, Denmark, 2014-06-25 A Weekend with Nanite Large scale."— Presentation transcript:

1 Per Møldrup-Dalum State and University Library SCAPE Information Day State and University Library, Denmark, 2014-06-25 A Weekend with Nanite Large scale characterisation of web archives

2 A short introduction to the experiment A live demonstration A look at the data for characterisation A look at the input for the job Run the job Analysis of the output and of the run itself. 2 Agenda This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).

3 Performance-testing the tools SCAPE User Story: As a Web Archive I need a Digital Preservation System that can process both ARC and WARC files and identify file formats/characterize of items contained so that I can assess preservation risks and plan which tools will be required for access to those formats. 3 Task at Hand This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).

4 Apache Tika DROID from The National Archive (libmagic) Not a word on FITS... 4 Tools at Hand This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).

5 Created and maintained by the British Library Improved by SCAPE and sustained by Open Planets Foundation Tika and libmagic support added Advanced Tika support through a ”persistent” Tika server ARC header extraction added More to come… 5 Nanite This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).

6 6 This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).

7 SCAPE User Story for web archive data: http://wiki.opf- labs.org/display/SP/File+Format+Identification+and+Ch aracterisation+of+Web+Archiveshttp://wiki.opf- labs.org/display/SP/File+Format+Identification+and+Ch aracterisation+of+Web+Archives Nanite: https://github.com/openplanets/nanitehttps://github.com/openplanets/nanite A Weekend With Nanite blog post: http://openplanetsfoundation.org/blogs/2014-05-28- weekend-nanite http://openplanetsfoundation.org/blogs/2014-05-28- weekend-nanite Open Planets Blogs: http://openplanetsfoundation.org/blog http://openplanetsfoundation.org/blog 7 References This work was partially supported by the SCAPE Project. The SCAPE project is co‐funded by the European Union under FP7 ICT‐2009.4.1 (Grant Agreement number 270137).


Download ppt "Per Møldrup-Dalum State and University Library SCAPE Information Day State and University Library, Denmark, 2014-06-25 A Weekend with Nanite Large scale."

Similar presentations


Ads by Google