1 Using Scalable and Secure Web Technologies to Design Global Format Registry Muluwork Geremew, Sangchul Song and Joseph JaJa Institute for Advanced Computer.

Slides:



Advertisements
Similar presentations
Preservation by Migration to XML Dirk Roorda. work on a preservation strategy positioning of the XML preservation strategy implementing the strategy in.
Advertisements

MicroKernel Pattern Presented by Sahibzada Sami ud din Kashif Khurshid.
Remote Visualisation System (RVS) By: Anil Chandra.
Software Quality Assurance Plan
Snejina Lazarova Senior QA Engineer, Team Lead CRMTeam Dimo Mitev Senior QA Engineer, Team Lead SystemIntegrationTeam Telerik QA Academy SOAP-based Web.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
14 October 2003ADASS 2003 – Strasbourg1 Resource Registries for the Virtual Observatory R.Plante (NCSA), G. Greene (STScI), R. Hanisch (STScI), T. McGlynn.
1 Introduction to SOA. 2 The Service-Oriented Enterprise eXtensible Markup Language (XML) Web services XML-based technologies for messaging, service description,
CIM2564 Introduction to Development Frameworks 1 Overview of a Development Framework Topic 1.
SOAPI: a flexible toolkit for implementing ingest and preservation workflows Mark Hedges Centre for e-Research, King’s College London Arts and Humanities.
ADAPT An Approach to Digital Archiving and Preservation Technology Principal Investigator: Joseph JaJa Lead Programmers: Mike Smorul and Mike McGann Graduate.
May Archiving PAWN: A Policy-Driven Software Environment for Implementing Producer- Archive Interactions in Support of Long Term Digital.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Rutgers University Libraries What is RUcore? o An institutional repository, to preserve, manage and make accessible the research and publications of the.
Automatic Evaluation of Migration Quality in Distributed Networks of Converters Miguel Ferreira Supervisors Ana Alice Baptista.
July NAGARA 1 Producer-Archive Workflow Network Mike Smorul, Mike McGann, Joseph JaJa Institute for Advanced Computer Science Studies University.
BitstreamFormat Renovation: DSpace Gets Real Technical Metadata.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Workshop on Cyber Infrastructure in Combustion Science April 19-20, 2006 Subrata Bhattacharjee and Christopher Paolini Mechanical.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
FOCUS: FOrmat CUration Service Advisor: Dr. Joseph JaJa Students: Sang Chul Song Muluwork Geremew.
May 23, 2007 Archiving ACE: A Novel Software Platform to Ensure the Integrity of Digital Archives Sangchul Song and Joseph JaJa Institute for Advanced.
Archiving Digital Government Data Joseph JaJa Institute for Advanced Computer Studies Department of Electrical and Computer Engineering University of Maryland.
Robust Technologies for Automated Ingestion and Long-Term Preservation of Digital Information Principal Investigator: Joseph JaJa Lead Programmers: Mike.
An Agent-Oriented Approach to the Integration of Information Sources Michael Christoffel Institute for Program Structures and Data Organization, University.
Robust Technologies for Automated Ingestion and Long-Term Preservation of Digital Information PI: Joseph JaJa Co-PIs: Allison Druin and Doug Oard Major.
Chapter 9: Moving to Design
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
A Framework for Distributed Preservation Workflows Rainer Schmidt AIT Austrian Institute of Technology iPres 2009, Oct. 5, San.
FOCUS – A Scalable and Extensible Digital Format Registry Principal Investigator: Joseph JaJa Graduate Students: Sang Song and Muluwork Geremew Lead Programmers:
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
February Semantion Privately owned, founded in 2000 First commercial implementation of OASIS ebXML Registry and Repository.
Construction of efficient PDP scheme for Distributed Cloud Storage. By Manognya Reddy Kondam.
● Problem statement ● Proposed solution ● Proposed product ● Product Features ● Web Service ● Delegation ● Revocation ● Report Generation ● XACML 3.0.
A web interface for DAISY Pipeline. Introduction The PipeOnline web application serves as a web interface for DAISY Pipeline. DAISY Pipeline is a framework.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
Adapting Legacy Computational Software for XMSF 1 © 2003 White & Pullen, GMU03F-SIW-112 Adapting Legacy Computational Software for XMSF Elizabeth L. White.
MAHI Research Database Data Validation System Software Prototype Demonstration September 18, 2001
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
International Telecommunication Union Geneva, 9(pm)-10 February 2009 ITU-T Security Standardization on Mobile Web Services Lee, Jae Seung Special Fellow,
A Domain-Specific Modeling Language for Scientific Data Composition and Interoperability Hyun ChoUniversity of Alabama at Birmingham Jeff GrayUniversity.
2004/12/02Slide Number 1 of 15 Exposure Time Calculator (ETC) as a Web Service Donald McLean 2004 Technology Open House.
Web Services based e-Commerce System Sandy Liu Jodrey School of Computer Science Acadia University July, 2002.
XMPP Concrete Implementation Updates: 1. Why XMPP 2 »XMPP protocol provides capabilities that allows realization of the NHIN Direct. Simple – Built on.
File format registries - a global infrastructure for local persistence Andreas Aschenbrenner, ERPANET.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
© 2012 xtUML.org Bill Chown – Mentor Graphics Model Driven Engineering.
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
10/25/20151 Single Sign-On Web Service Supervisors: Viktor Kulikov Alexander Sherman Liana Lipstov Pavel Bilenko.
Freelib: A Self-sustainable Digital Library for Education Community Ashraf Amrou, Kurt Maly, Mohammad Zubair Computer Science Dept., Old Dominion University.
Global Digital Format Registry Progress Andrea Goethals, Harvard University Library NDIIPP Digital Preservation Partners’ Meeting Arlington, VA July 9,
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
SOAP-based Web Services Telerik Software Academy Software Quality Assurance.
Enterprise Solutions Chapter 10 – Enterprise Content Management.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
Simple Object Access Protocol
Providing web services to mobile users: The architecture design of an m-service portal Minder Chen - Dongsong Zhang - Lina Zhou Presented by: Juan M. Cubillos.
IBM Global Services © 2005 IBM Corporation SAP Legacy System Migration Workbench| March-2005 ALE (Application Link Enabling)
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Transparent Format Migration of Preserved Web Content D. S. H. Rosenthal, T. Lipkis, T. S. Robertson, S. Morabito Lib Magazine, 11(1), 2005
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Joseph JaJa, Mike Smorul, and Sangchul Song
How to Implement an Institutional Repository: Part II
Presentation transcript:

1 Using Scalable and Secure Web Technologies to Design Global Format Registry Muluwork Geremew, Sangchul Song and Joseph JaJa Institute for Advanced Computer Science Studies Department of ECE, University of Maryland Sponsored by Library of Congress and NSF

2 Motivation Handling of digital formats is an essential part of long-term preservation. Format obsolescence –Technology evolution and the obsolescence of systems and applications software may leave users unable to access their old files. –Software developers may go out of business and no longer support the applications. Digital preservation requires –Different essential aspects of objects. –Tools for capturing the essential format characteristics of information stored as digital object and processing it.

3 Existing Methodologies Standardizing the digital contents to few common formats. –JPEG2000, OMF, and PDF/A are among the few selected open standard formats. Migration –Transforms older versions to newer formats. –Tends to be costly and prone to errors. Emulation –The original bit-streams are executed using an emulator. –Implementing such a strategy is extremely challenging and can be viewed as a transformation.

4 Our Goal A flexible framework for incorporating advances achieved through the existing approaches. Development of an efficient, scalable and platform independent prototype to enable the tracking and handling of format obsolescence. –Development of a Global Digital Format Registry (GDFR) – FOrmat CUration Service (FOCUS) –Development of enabler modules that can interface between GDFR and end-user applications.

5 FOCUS Architecture

6 FOCUS on LDAP and SOAP Interoperability –Protocols are platform independent Performance –Most operations are read-only queries. LDAP gives high performance in this environment. Extensibility –LDAP schema can be easily extended Scalabilit y –By the use of Distributed LDAP Security –SOAP can be on top SSL (https) –LDAP-based Format Registry can be easily integrated with any other LDAP-based authentication/authorization mechanisms.

7 Global Digital Format Registry GDFR serves to provide detailed information about formats. Existing Format Registries: –UPenn ’ s FRED- ( –Pronom - ( –Wotzit ’ s Format- ( Not clear how extensible, scalable, or how they can be interfaced with existing preservation systems.

8 FOCUS The registry contains information –File formats –Software tools Multiple ways to access GDFR in FOCUS are provided. –Directly through LDAP interface –Indirectly through SOAP interface Web Service Agent Global Digital Format Registry Software

9 GDFR-Internal Structure dc=umiacs, dc=umd, dc=edu ou=Format- Registry ou=Applications Adobe Acrobat v6.0 Adobe Photoshop v7.0 Jhove 1.0 ou=Formats Adobe PDF v1.4 CompuServ GIF 1989a JPEG Image Format 2000  General descriptive properties.  Processing: rendering, editing, conversion and validation services/systems.  General descriptive properties.  Processing : format taken as input and/or output.

10 Web-Service Agent Mediator between user and registry Serviced via SOAP Contains a file format identifier module, FIDER –Java module for format identification –Uses file magic number –Sequential from restrictive to general Web Service Agent Global Digital Format Registry Client Format Inquiry

11 Web-Service Agent Tailorability – Specific needs of an existing preservation system can be met by custom-tailoring Web-Service. Interoperability –Independent of OS and languages Convenience –Multiple LDAP queries can be reduced to one Web Service function call. –Any updates can be done in a single place, not having to distribute new modules to end users

12 FOCUS- Supplementary Tools Validation Software –Verifies and validates file formats of given file. Rendering Software –Interprets bit streams of files into human-friendly representation on the screen. Editing Software –Adds/Deletes/Modifies the contents of given file, keeping the correct file format. Conversion Software –Converts a file format to current or emerging formats.

13 Validation Software Conversio n Software Web Service Agent Identificatio n Service Rendering Software Rendering Software FOCUS Service Model Format Registry Identifies format of a specific DO using the internal signature Determines a verification service to verify the format of a specific DO Identifies current rendering conditions for specific digital format. Locates transformation services to convert DO from source format to format of interest.

14 Example Scenario: Digital Object Format Verification Validation Service Conversio n service Web Service Agent ID Service Rendering Service Rendering Service Format Registry Format ? Format ID / Format Info Verifier? App ID / App Info Verify this? Valid/Well-formed Step 1: User requests to identify the format a file via Web Service Step 2: Registry returns format ID and format information Step 3: User requests for information on available verifier for this format Step 4: Registry returns validation service ID and information, such as its service location Step 5: User connects to the validation service and verify the format Step 6: Validation service returns the verification result Web Service Agent Format Registry

15 Demo

16 Conclusion FOCUS design offers maximum –Flexibility – Web Service Agent can be easily tailored to meet the various needs of different preservation institutions. –Scalability – Format registry can also be distributed. FOCUS integrates current format preservation techniques and makes them available through SOAP- based web interface. In summary, we believe that the FOCUS prototype represents a significant advance towards the development of secure and scalable digital format registry.