Metadata Acquisition with XML Case studies from the Swiss Federal Archives 9. October 2002 / Stephan Heuscher.

Slides:



Advertisements
Similar presentations
Organising and Documenting Data Stuart Macdonald EDINA & Data Library DIY Research Data Management Training Kit for Librarians.
Advertisements

DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Oracle Hyperion Financial Data Quality Management Considerations for a scaled, expedited and integrated approach on data quality NCOAUG – Aug 15, 2008.
Information Management Taunya L. Kidd | Freelance Consultant  Written Documents  Excel Workbooks  Access Databases  PowerPoint Presentations  Administrative.
METS at UC Berkeley Part I: Generating METS Objects.
VIEWS / TSS Overview. End-to-end Air Quality Data and Decision Support VIEWS / TSS Vision Acquisition Import Unification Management Manipulation Retrieval.
High-level VIEWS Architecture. Data Acquisition & Import Data Acquisition System: Accepts submission of data in a variety of schemas and formats Can automatically.
Requirements Specification
16 months…. The Visibility Information Exchange Web System is a database system and set of online tools originally designed to support the Regional Haze.
The Visibility Information Exchange Web System (VIEWS): An Approach to Air Quality Data Management and Presentation In a broader sense, VIEWS facilitates.
WMS: Democratizing Data
The eXtensible Past XML As a Means for Easy Access to Historical Research Data and a Strategy for Digital Preservation.
Automatic Data Ramon Lawrence University of Manitoba
Overview of Search Engines
Agenda  Overview  Configuring the database for basic Backup and Recovery  Backing up your database  Restore and Recovery Operations  Managing your.
Electronic Archive Services in Lithuania Dr. Arūnas Stočkus Vilnius University Faculty of Mathematics and Informatics Lithuania EBNA,
Data Transformation for Analysis Purposes Presented By: Gregg Ravenscroft Khulisa Management Services
1 L07SoftwareDevelopmentMethod.pptCMSC 104, Version 8/06 Software Development Method Topics l Software Development Life Cycle Reading l Section 1.4 – 1.5.
The CBSO project - Experience and issues Madrid, 05 October 2006 Camille Dümm Pascal Rodrique Central Balance Sheet Office.
A summary of the report written by W. Alink, R.A.F. Bhoedjang, P.A. Boncz, and A.P. de Vries.
XML Anisha K J Jerrin Thomas. Outline  Introduction  Structure of an XML Page  Well-formed & Valid XML Documents  DTD – Elements, Attributes, Entities.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Federal Statistical Office eSTATISTIK.core - Integrating Respondents’ IT Systems into Data Collection UNECE Work Session on Statistical Data Editing Bonn,
Web-Enabled Decision Support Systems
Introduction to MDA (Model Driven Architecture) CYT.
© ITEDO Software 2001 From 3D CAD to Web catalogs Dieter Weidenbrück.
Technical Aspects of SIARD “SIARD under the hood” 10. April 2003 / Stephan Heuscher.
XML Data Storage Joe Carroll Russell Gibbons. Agenda What is XML Storage of XML Benefits of XML Databases Problems with XML Databases Discussion.
METS at UC Berkeley Generating METS Objects. Background Kinds of materials: –primarily imaged content & tei encoded content archival materials: manuscripts.
1 Reviewing Data Warehouse Basics. Lessons 1.Reviewing Data Warehouse Basics 2.Defining the Business and Logical Models 3.Creating the Dimensional Model.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
CMSC 1041 Algorithms II Software Development Life-Cycle.
Power Designer Sybase.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
1 Digital Preservation Testbed Database Preservation Issues Remco Verdegem Bern, 9 April 2003.
XML Databases by Sebastian Graf Hier beginnt mein toller Vortrag.
Data resource management
Fachstelle ARELDA Schweizerisches Bundesarchiv 1 SIARD: Software Invariant Archiving of Relational Databases at the Swiss Federal Archives Contents: 
2-1 A Federation of Information Systems. 2-2 Information System Applications.
DATAWHERE - MIAB1 DATAWHERE The MIAB Solution to Information Support and Data Conversion/Migration 통신망 연구실 석사 3 학기 임 수 정.
What is HTTP? - the underlying communication protocol used by the www - common HTTP headers?
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007.
Automatic Metadata Discovery from Non-cooperative Digital Libraries By Ron Shi, Kurt Maly, Mohammad Zubair IADIS International Conference May 2003.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
ESRI Education User Conference – July 6-8, 2001 ESRI Education User Conference – July 6-8, 2001 Introducing ArcCatalog: Tools for Metadata and Data Management.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
Robert Aydelotte ExxonMobil - Upstream Technical Computing 13 May 2004 Standardizing Fluid Property Reporting.
Open Planets Foundation Hackathon Database Archiving Event Implementation of SIARD at the Danish National Archives.
Managing Semi-Structured Data. Is the web a database?
Database Development Indra Budi
The OAIS model SEEDS meeting May 5 th, 2015, Lausanne Bojana Tasic.
1 How to move test data from existing files into a satellite database Jacek Wojcieszuk Jacek Wojcieszuk Warsaw University of Technology.
Database Overview What is a database? What types of databases are there? How are databases more powerful than spreadsheets?
Capture and Storage of Tabular Data Leveraging Ephesoft and Alfresco W. Gary Cox Senior Consultant Blue Fish Development Group.
Eurostat May 2016 Eurostat, Unit B3 – IT solutions for statistical production Test Client Jean-Francois LEBLANC Christian SEBASTIAN.
European Archival Records and Knowledge Preservation Database preservation Format and toolkit Jan Dalsten Sørensen Danish National Archives DLM Forum Riga.
May 2011DLM Forum, Budapest1 The First OAIS-compliant Ingest of Digital Records Zoltán Lux The National Archives of Hungary web:
Session 2b, 25 November 2015 eChallenges e-2015 Copyright 2015 The National Archives of Estonia Current lack of interoperability among submission information.
KEEPS – a system for UELMA preservation and security
KEEPS – a system for UELMA preservation and security
Metadata and XML <xmlpresentation>
Approaches to database archiving at the Danish National Archives
VIEWS / TSS Overview.
Data validation at DESTATIS
CSE591: Data Mining by H. Liu
Jean-Francois LEBLANC Christian SEBASTIAN
Presentation transcript:

Metadata Acquisition with XML Case studies from the Swiss Federal Archives 9. October 2002 / Stephan Heuscher

Urbino2002.ppt; Stephan Heuscher; Swiss Federal Archives2 Overview  Problems acquiring metadata  Why XML?  Featured Projects  Lessons learned  Conclusions

Urbino2002.ppt; Stephan Heuscher; Swiss Federal Archives3 Problems acquiring metadata  Documentation  Data format  Data consistency  System borders  Money  Communication with stakeholders

Urbino2002.ppt; Stephan Heuscher; Swiss Federal Archives4 Why XML? XML …  … is an open standard  … is self-explanatory  … is human-readable  … can be validated automatically  … has a broad software support  Most products feature XML support

Urbino2002.ppt; Stephan Heuscher; Swiss Federal Archives5 Featured Projects SIARD  Archiving of relational databases  Manual generation of additional metadata  Metadata and content is stored in XML files AMDA  Manages metadata for audio data from the Swiss Parliament  Does not manage audio data  Import of XML metadata  Must provide a variety of export formats

Urbino2002.ppt; Stephan Heuscher; Swiss Federal Archives6 Data and low-level metadata extraction SIARD (System Independent Archiving of Relational Databases) OracleMS-SQL???-DB Additional high-level descriptive metadata Digital Archive (to be built) Database regeneration

Urbino2002.ppt; Stephan Heuscher; Swiss Federal Archives7 XML use in SIARD  SQL-99 (ISO/IEC 9075)  Low-level data description  Structure  Datatypes  Constraints  XML  High level metadata  Table content (thin wrapper)

Urbino2002.ppt; Stephan Heuscher; Swiss Federal Archives8 Data Logic (SQL) CREATE TABLE "FLUGLE"."CLASS" ( "CLASS_ID" NATIONAL CHARACTER VARYING(20) NOT NULL, "SCHEDULE_ID" NATIONAL CHARACTER VARYING(20), "CLASS_BUILDING" NATIONAL CHARACTER VARYING(25), "CLASS_ROOM" NATIONAL CHARACTER VARYING(25), "COURSE_ID" NATIONAL CHARACTER VARYING(5), "DEPARTMENT_ID" NATIONAL CHARACTER VARYING(20), "INSTRUCTOR_ID" NATIONAL CHARACTER VARYING(20), "SEMESTER" NATIONAL CHARACTER VARYING(6), "SCHOOL_YEAR" TIMESTAMP(0) ) CREATE TABLE "FLUGLE"."CLASS_LOCATION" ( "CLASS_BUILDING" NATIONAL CHARACTER VARYING(25) NOT NULL, "CLASS_ROOM" NATIONAL CHARACTER VARYING(25) NOT NULL...

Urbino2002.ppt; Stephan Heuscher; Swiss Federal Archives9... SIARD Metadata XML

Urbino2002.ppt; Stephan Heuscher; Swiss Federal Archives10 SIARD Data XML... 6,104200;4,S180;9,POCO HALL;3,150;3,198;5,PHILO;4,E491;6,SPRING;19, :00:00; 6,104500;3,T15;11,NARROW HALL;3,200;3,184;4,HIST;4,D944;6,SPRING;19, :00:00;...

Urbino2002.ppt; Stephan Heuscher; Swiss Federal Archives11 AMDA (Audio MetaData Acquisition) Online parliament session metadata (XML) AMDA Webinterface Metadata Audio data Digital Archive Unified XML import Access DB (to be built)

Urbino2002.ppt; Stephan Heuscher; Swiss Federal Archives12 XML use in AMDA  Import  XSLT transformation to common format  Online metadata  Legacy data (Access database)  Export  Raw XML output transformed using XSLT

Urbino2002.ppt; Stephan Heuscher; Swiss Federal Archives13 AMDA Import XML (raw) ; Mitteilungen des Präsidenten Der Beginn dieser Herbstsession ist schmerzlich getrübt von unseren Gedanken an das...

Urbino2002.ppt; Stephan Heuscher; Swiss Federal Archives14 AMDA Import XML (transformed)... Der Beginn dieser Herbstsession ist schmerzlich getrübt von unseren Gedanken...

Urbino2002.ppt; Stephan Heuscher; Swiss Federal Archives15 Lessons learned  Transforming and reformatting of XML data is easy  Documentation and data integrity are crucial  Agree on rules and standards for XML formats early  Stakeholders’ uses of XML differ greatly

Urbino2002.ppt; Stephan Heuscher; Swiss Federal Archives16 Conclusions  XML  is not a preservation strategy  is only a technology  is too new for a common understanding  XML provides tools and techniques for a concise metadata management  Working solutions need both XML and non-XML experience  Most problems are still of human nature