Archiving and Presenting Journals with Rosetta Matthias Groß, Bavarian State Library, Munich, Germany 10th IGeLU Conference, Budapest, September 2 nd 2015.

Slides:



Advertisements
Similar presentations
EPrints Web Configuratio n Management. SQL database Web server Scripts to configure repository activities Configuration files EPrints - the Administrator's.
Advertisements

The DRIVER Infrastructure (Digital Repository Infrastructure Vision for European Research) Paolo Manghi ISTI - National Research Council, Italy.
JISC/BL Workshop Digital Libraries and their services March 6, 2006 Richard Boulderstone Director eStrategy, The British Library.
September 1 st 2010 Igelu Ghent The on-the-fly conversion circus Matthias Gross (Bavarian State Library)
Getting Started with EndNote X4 Dr. Christiane Holtz, 28.September
1 Building a “Virtual Library Collection” through freely-accessible web sites: ‘Select Web Sites database’ at University of Vermont Wichada SuKantarat.
OPEN RESEARCH DATA, EPFL, 28 October 2014, M. Töwe, M. Bärlocher docuteam packer: viewer and editor for file structures and metadata.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
1 Managing Legal Deposit for Online Publications in Germany Cornelia Diebel.
Depositing e-material to The National Library of Sweden.
Resource Discovery Module DigiTool Version 3.0. Resource Discovery 2 Deposit Approval Search & Index Dispatcher & Viewers Single & Bulk Web Services DigiTool.
Ingest and Loading DigiTool Version 3.0. Ingest and Loading 2 Ingest Agenda Ingest Overview and Introduction Ingest activity steps Transformers Task Chains.
Challenges for the DL and the Standards to solve them Alan Hopkinson Technical Manager (Library Systems) Learning Resources Middlesex University.
Basic Concepts Architecture Topology Protocols Basic Concepts Open e-Print Archive Open Archive -- generalization of e-print Data Provider and Service.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
1  Ex Libris Ltd., Internal and Confidential Ex Libris Rosetta Sofia University “St. Kliment Ohridski” Sofia, Bulgaria 2 nd July, 2013.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Putting it all together for Digital Assets Jon Morley Beck Locey.
Bibliography in the Digital Age - IFLA Satellite Meeting Warsaw, 9 August Online materials published in Austria collecting, archiving and metadata.
High-Speed, High Volume Document Storage, Retrieval, and Manipulation with Documentum and Snowbound March 8, 2007.
Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
Alma 1 year after STP: implementing batch services IGeLU Budapest Sep 2, 2015 Bart Peeters Head Operations LIBIS.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Getting Started with CONTENTdm Corey Harper, University of Oregon Terry Reese, Oregon State University OLA - April 8, 2005.
Subject To Change automatic catalog enrichment with subject headings and codes 10th IGeLU conference Budapest, Marcus Zerbst Zentralbibliothek.
The DigiTool to FDA Program Lydia Motyka Florida Center for Library Automation.
OCLC Online Computer Library Center Kathy Kie December 2007 OCLC Cataloging & Metadata Services an introduction.
Choosing Delivery Software for a Digital Library Jody DeRidder Digital Library Center University of Tennessee.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
11 October 2015 MAVIS v “Sneak Preview”. 11 October 2015 Enhancements in the Release  Reference Material  Brief Accessioning View  Template.
Word Lesson 12 Creating Mail Merge Documents Microsoft Office 2010 Advanced Cable / Morrison 1.
A rticle L icensing I nformation A vailability S ervice IDS Project Information Delivery Services Mark Sullivan Library Systems Administrator SUNY Geneseo.
Keele Pathfinder Project CLA Reporting of Scanned Material in a Repository Pathfinder - Tim Denning - Project Leader Catering VLE Powerlink - Boyd Duffee.
September 8th, 2013 IGeLU Berlin IT staff and librarians pull together: Collaborative development of a new METS viewer Matthias Groß, Bavarian State.
ICOLC Las Vegas March 28, 2003 TDNet E-Management Services for Consortia From E-Journals to E-Resources Michael Markwith President, TDNet Inc.
FlexElink Winter presentation 26 February 2002 Flexible linking (and formatting) management software Hector Sanchez Universitat Jaume I Ing. Informatica.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
DOI’s, Open URL’s and Context Sensitive Linking What Are They and How Can I Make Them Work for My Library Rachel L. Frick Head, Bibliographic Access Services.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
PREMIS Implementation Fair – SF 2009 PREMIS use in Rosetta Yair Brama – Ex Libris.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
VITAL at the National Library of Wales Glen Robson
The New DRS Introduction. What is DRS? Digital repository for preservation and access – Maintains integrity of deposited content – Preserves content for.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
Implementing PREMIS in DigiTool Michael Kaplan ALA 2007 Update.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
FACES General Overview ViRR (Virtueller Raum Reichsrecht) Software Solutions Kristina Büchner and Bastien Saquet Contact:Kristina Buechner:
Radoslav Pavlov, Galina Bogdanova, Desislava Paneva- Marinova, Todor Todorov, Konstantin Rangochev
Automating the Audit: Updates from the Metadata Upgrade Project at the University of Houston Libraries Andrew Weidner, Metadata Librarian Santi Thompson,
CERN Document Server 19 tth January 2006 CERN Document Server Jean-Yves Le Meur 19 th January 2006.
Preservation Functionality in a Digital Archive Erik Oltmans Koninklijke Bibliotheek Raymond J. van Diessen IBM Business Consulting Services Hilde van.
Building KFS using KNS Presented by James SmithJustin Beltran University of ArizonaUniversity of California, Irvine.
William J Nixon Setting up a Repository. Introduction Key Features to consider (and review) Wide Range of Technology Available –Best fit for purpose –Clear.
Archon: Facilitating Access to Special Collections Prepared for PACSCL Conference Something New for Something Old: Innovative Approaches.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
A Framework for Institutional Repositories José Luis Borbinha, Jorge Machado
Synchronizing data from remote digital repository to Alma
Jordan PIŠČANC, University of Trieste
Featured Enhancements to the IDE & Debugger
Building A Repository for Digital Objects
Trove Tufts Digital Image Library
Andreas Trappe Scientist of Information and Media Technologie
Malte Dreyer – Matthias Razum
EPrints Web Configuration Management
Márton Németh – László Drótos How to catalogue a web archive?
DATABASES WHAT IS A DATABASE?
Presentation transcript:

Archiving and Presenting Journals with Rosetta Matthias Groß, Bavarian State Library, Munich, Germany 10th IGeLU Conference, Budapest, September 2 nd 2015 DRAG Dresden 2014

2 Short timeline (1) - DigiTool BVB: Bavarian Library Network, regional consortia for research libraries Head Office: department of the Bavarian State Library : looking for powerful „multimedia“ software 2006-: implementing DigiTool, going live 2007/08 How to manage journals? complex objects / collections / METS objects  BVB chooses METS-objects for journals

3

4 Short timeline (2) - Rosetta 2010-: implementing Rosetta at BSB journals not included in pilot workflows How to manage journals? collections / METS-objects/… 2013/14 collection management gets better, but … 2014 … decision to follow own approach in parallel 2015 struggling with some problems, then:  Welcome, journals, to Rosetta!

5 Presenting journals with Rosetta BSB uses Rosetta as „light“ archive whenever reasonable A tree structure with several levels (unlimited depth) is powerful enough to handle most common journal structures and seems natural for end user presentation If the tree structure is represented by an „object“, this can correspond with catalogue entries / persistent identifier on the title level WANTED: (elsewhere)

6 Re-shaping our DigiTool concept for Rosetta In the „Manual Legal Deposit“ workflow, new issues are ingested as new IEs Testing collection management in Rosetta in 2014 we saw still some shortcomings (addressed in Pressure Points document) Adding new components (issues) to METS-objects would create new versions and lead to a confusing situation, obfuscating genuine preservation actions  BVB wants something that acts like METS, but is not a METS-object

7 Starting at the end … BVB developed own METS viewer for DigiTool in 2012/13 which is basically independent of the system holding the objects; display uses jquery/css. Only a few interfaces to the system needed: 1.Table of contents: from StructMap/FileSec  json (Precache) tree structure with Digitool-PIDs of components as leaves 2.Bibliographic metadata: on-the-fly from original MARC/MODS/DC data (2-layer XSLT transformation to json) 3.Request for a child object: uses delivery URL for embedded mode (provides main title and stream) 4.Thumbnail preview: based on Table of contents using special Delivery Rule

8 Facial composite of the solution (1) 1. Table of contents as „near-METS“ All components of a journal share the same bibliographic ID in dc:relation Store reference data (volume, issue, year) in dcterms:bibliographicCitation (trick: use OpenURL 1.0) Based on this information, a ToC can be created and stored in the file system as BibID.json with Rosetta‘s IE IDs as leaves.

9 Facial composite of the solution (1a) Plan: Using MARC/MODS metadata instead; OpenURL trick is not so friendly for human editing OpenURL as container

10 Facial composite of the solution (2) 2. Bibliographic metadata BibID is known (from each component); for display fetch recent MARC-XML record via Aleph SRU interface 3. Request for child object DeliveryRule „embedded“ in Rosetta 4. Thumbnail preview DeliveryFunction „thumbnail“ in Rosetta

11 Proof of concept

12 Creation of near-METS industrialized Our approach: Harvesting the OAI interface (good experience with DigiTool) However, we encountered problems to get valid XML output from Rosetta. After some months it turned out that there is a config parameter ‚dublincore_additional_namespaces‘ (see Home > Advanced > Configuration > General > General Parameters) that should be defined as [blank] – which was not the case in our installation.

13 Data processing (simplified: without deletions) ( Rosetta OAI repository filter by journal Harvest: What‘s new since …? BibID BV issue 3, vol. 2, year 2015 Found new component? add to StructMap BV json Known journal New journal create StructMap BV json get bibliographic MD from Aleph (* der Übersichtlichkeit halber ohne den Fall „Löschung“ dargestellt …)

14 Following two tracks Combining near-METS with Rosetta-Collections 1 collection equals 1 journal Metadata on journal level  URN on journal level (PP: CM 2.2.2)  AssignCMS for journal level (metadata in Rosetta // URN, ArchiveURL in ALEPH) (Collection Support – WP, 2012)  Searching monographs and journals in parallel (IEs and collections, PP: CM 2.2.3)  Manual Legal Deposit : Issue goes to correct journal „automatically“ Easy administration of IEs in Rosetta

15 They are waiting: Legal Deposit: - in DigiTool: 450 journals, issues -on heap: 100+ journals, constantly new titles arriving OA publications - finalizing collection strategy for Bavarica and special subject fields Licensed publications (E-journal backfiles): -responsibility on national, regional and local levels -for hosting and long term preservation Digitized material - from ZEND / TSM

16 Thank you very much for your interest in the most fascinating format of scientific literature!