Presentation is loading. Please wait.

Presentation is loading. Please wait.

Repository-specific Spoke Scripts Content Repository JSR-170/283 Content Repository for Java Technology API Normalized H&S METS Files METS Import/ExportMETS.

Similar presentations


Presentation on theme: "Repository-specific Spoke Scripts Content Repository JSR-170/283 Content Repository for Java Technology API Normalized H&S METS Files METS Import/ExportMETS."— Presentation transcript:

1 Repository-specific Spoke Scripts Content Repository JSR-170/283 Content Repository for Java Technology API Normalized H&S METS Files METS Import/ExportMETS Utilities Apache Jackrabbit (or any JSR-170 compliant Repository or CMS) Tools For Acquisition, Packaging & Ingest of Web Objects into Multiple Repositories 2 3 1 Abstract Technical architecture for acquiring, packaging & ingesting web objects for archiving in multiple repositories. Part of ECHO DEPository Project, a 3- year NDIIPP-partner digital preservation project at the University of Illinois Urbana-Champaign in partnership with OCLC, the Library of Congress & others. 1 Web Archives Workbench A suite of four web archiving tools for identifying, selecting, describing & harvesting web-based content based on library and archival practice bridges gap between manual selection & automated capture by transforming collection policies into software- based rules & configurations accommodates variety of web harvesting approaches -- mass harvesting, selective harvesting, individual document harvesting packaged content ingestible into variety of repositories via Hub-and-Spoke repository architecture 2 METS Profiles Development Modular approach similar in concept to TEI Pizza Chef (http://www.tei-c.org/pizza.html) Attempts to generalize METS requirements into inter- operable modules, instead of profiles that address only particular environment or toolset Common Hub requirements apply to all objects; allows interoperability of Hub scripts and processes Additional requirements extend common requirements to provide more robust description of objects 3 Hub-and-Spoke Architecture An archival interoperability architecture in proof- of-concept implementation ‘Spokes‘ = programs that translate repository-specific formats to and from hub METS profiles support archival & preservation metadata formats (i.e.,PREMIS), including changes to metadata & digital objects themselves as moved between repositories ‘Hub‘ = family of SIP/DIP/AIP METS profiles, METS import/export and utility programs & JSR-170/283 compliant content repository content repository = temporary staging area for data as moved between repositories -- & may be used for long-term preservation store Authors: S. Rani, J. Goodkin, J. Cobb (OCLC); T. Habing, J. Eke, R. Urban (UIUC); R. Pearce-Moses (AZ State Library & Archives) Domain Discovery Tool Properties Entity Tool Analysis Tool Packager Tool Discover Domains Group & Prioritize Domains Prioritize Discovery Domains Organize Collection Space Create Metadata Content Harvest Schedule Site Analysis Review Content Package For Ingest METS Package Objects Associate Owners Associate Content Create Package W E B A R C H I V E S W O R K B E N C H 1 HUB–AND– SPOKE MODEL 3 METS Profiles Common Requirements Root MIME Type Requirements Object Structure Requirements PDF Object Common Hub requirements MIME Application requirements (application/pdf) Simple structure requirements Service Requirements Common Hub requirements MIME Text requirements. (text/html) MIME Image requirements. (image/jpeg) Web Structure requirements Web Harvest Service requirements Website Object METS PROFILES DEVELOPMENT 2 METS-packaged object package Hub Digital Archive Repository N ingest Web Archives WorkbenchTools Identify, describe, package web content


Download ppt "Repository-specific Spoke Scripts Content Repository JSR-170/283 Content Repository for Java Technology API Normalized H&S METS Files METS Import/ExportMETS."

Similar presentations


Ads by Google