Introducing the ELAR information system architecture

Slides:



Advertisements
Similar presentations
Current design issues for digital archives Robert Munro (presented by David Nathan) Endangered Languages Archive (ELAR), School of Oriental and African.
Advertisements

OLAC Metadata Steven Bird University of Melbourne / University of Pennsylvania OLAC Workshop 10 December 2002.
IRCS Workshop on Open Language Archives IMDI & Endangered Languages Archives Heidi Johnson / AILLA.
OLAC Process and OLAC Protocol: A Guided Tour Gary F. Simons SIL International ___________________________ OLAC Workshop 10 Dec 2002, Philadelphia.
LSA Archiving Tutorial January 2005 Archives, linguists, and language speakers.
Endangered Languages and Web-Based Archiving Megan J. Crowhurst The University of Texas at Austin & CELP Contributors: Chris Beier, Heidi Johnson, Lev.
FAIR – Focus on Access to Institutional Resources William J Nixon DAEDALUS Project, University of Glasgow e-libraries for e-learning.
New Services for Data Creators and Providers Louise Corti, Head ESDS Qualidata/ Outreach & Training Alasdair Crockett, ESDS Data Services Manager.
UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
CESSDA Question Databank Tender, results and future Maarten Hoogerwerf, CESSDA expert seminar 2009.
DRS 2 one in a series of periodic updates Harvard University Library Andrea Goethals October 21, 2009 DRS = Digital Repository Service.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
Depositing e-material to The National Library of Sweden.
ASSDA: A Trusted Digital Repository or a trusted digital repository? Sophie Holloway The Australian Social Science Data Archive Taking the Shock Out of.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
Data format translation and migration Future possibilities Alasdair Crockett, Data Standards Manager UK Data Archive.
Methodology Conceptual Database Design
The British Library’s METS Experience The Cost of METS Carl Wilson
Current Trends in Language Documentation and the Hans Rausing Endangered Languages Project Lenore A. Grenoble Dartmouth College Lenore A. Grenoble Linguistics.
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
Addressing Metadata in the MPEG-21 and PDF-A ISO Standards NISO Workshop: Metadata on the Cutting Edge May 2004 William G. LeFurgy U.S. Library of Congress.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
The Archive of the Indigenous Languages of Latin America Goals and Visions.
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
David Nathan Endangered Languages Archive SOAS University of London 3L Summer School, Conference, 6 July 2012 Training for language documentation: trends.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
A disaggregated model for preservation of E-Prints Gareth Knight SHERPA DP Project Arts and Humanities Data Service.
AILLA:The Archive of the Indigenous Languages of Latin America Heidi Johnson / The University of Texas at Austin.
Data Management David Nathan & Peter Austin & Robert Munro.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
CAVA: a human Communication Audio-Visual Archive Matt Mahon [1], Suzanne Beeke [1], Merle Mahon [2] and Martin Moyle [3] UCL Departments of Language and.
“This presentation is for informational purposes only and may not be incorporated into a contract or agreement.”
Technology – Broad View Aspects that play a role when integrating archives leave the details of some core topics to the 2. day Bernhard Neumair:Base Technologies.
Database Management Systems (DBMS)
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
Archiving microdata Standards and good practices United Nations Statistics Commission New York, February 26, 2009 Olivier Dupriez World Bank, Development.
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
The NLW Digital Asset Management System Paul Bevan DAMS Implementation Manager
Preservation metadata and the Cedars project Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
The ELAR Metadata Set David Evans, ELAR 3 November 2006.
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
SEDAC Long-Term Archive Development Robert R. Downs Socioeconomic Data and Applications Center Center for International Earth Science Information Network.
NDIIPP Access Project Building on Metadata NDIIPP Partner Meeting June 25, 2009.
The OAIS model SEEDS meeting May 5 th, 2015, Lausanne Bojana Tasic.
1 February 2012 ILCAA, TUFS, Tokyo program David Nathan and Peter Austin Hans Rausing Endangered Languages Project SOAS, University of London Language.
1 Options Clearing Corporation Encore Data Distribution Services April 22, 2004.
Preservation Functionality in a Digital Archive Erik Oltmans Koninklijke Bibliotheek Raymond J. van Diessen IBM Business Consulting Services Hilde van.
XML 2002 Annotation Management in an XML CMS A Case Study.
Thinking Long Term - Archive Strategies for Alfresco Nathan McMinn Remote Service Engineer Alfresco Chetan Lalye Senior Software Architect Agilent Technologies.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
The Multi-Faceted Use of the OAI-PMH in the LANL Repository Written By: Henry, Xiaoming,Patrick Henry, Xiaoming,Patrick and Herbert. Presented By: Shashi.
XML Databases Presented By: Pardeep MT15042 Anurag Goel MT15006.
Data Management and Archival Storage Bojana Tasić FORS SEEDS Workshop I Belgrade, October.
Metadata Issues in Long-term Management of Data and Metadata
Ingest and Dissemination with DAITSS
Building A Repository for Digital Objects
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
Archiving and Delivery of Student Portfolios
Introducing the ELAR information system architecture
Presentation transcript:

Introducing the ELAR information system architecture Robert Munro & David Nathan Endangered Languages Archive (ELAR), School of Oriental and African Studies, London

Outline Introduction The ELAR architecture User Requirements Ingestion Archive & dissemination Conclusions

Introduction – who we are Part of the Hans Rausing Endangered Languages Project (HRELP), based at the School of Oriental and African Studies (SOAS), University of London. Funded by the Lisbet Rausing Charitable fund The other two parts are: Academic Programme (ELAP) runs postgraduate courses, seminars and workshops Documentation Programme (ELDP) funds endangered language documentation projects

ELAR – current state In the process of designing and implementing key systems: accession system (ingestion system) archive information system catalogue serving system archive access system data storage long-term backup system

ELAR – current state Source of materials supporting the systems analysis and design: literature review review of exemplar materials interaction with associated archives interaction with ELDP grantees interaction with members of ELAP departmental seminars on language documentation seminars focused on archiving

ELAR – architecture Strongly informed by the Open Archive Information System (OAIS) Reference Model (CCSDS, 2002)

Designated communities The OAIS model afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds Producers Ingestion Archive Dissemination Designated communities

Designated communities The OAIS model afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds Producers Ingestion Archive Dissemination Designated communities Identify the nature of the materials (content, format and structures) that data producers will create

Designated communities The OAIS model afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds Producers Ingestion Archive Dissemination Designated communities Identify the intended users of the archive, and their user requirements

Designated communities The OAIS model afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds Producers Ingestion Archive Dissemination Designated communities Define dissemination formats, data structures and procedures that support the user requirements of the designated communities

Designated communities The OAIS model afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds Producers Ingestion Archive Dissemination Designated communities Design an archive information system able to store all the information and produce the required dissemination packages.

Designated communities The OAIS model afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds Producers Ingestion Archive Dissemination Designated communities Define ingestion (accession) formats and structures that minimise the conversion cost

Designated communities The OAIS model afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds afd_34 dfa dfadf fds fdafds Producers Ingestion Archive Dissemination Designated communities The archive needs to define three types of ‘packages’: ingestion, archive and dissemination.

User requirements EL speakers and communities: continuation of ownership of language and materials depositors: preserve deposit structure; update material; be correctly attributed researchers: search (broad, narrow, domain specific); add materials; add relationships publisher– repurposing: obtain high quality data for repurposing publisher– public heritage: archive to act as mediator public: browse long-term preserver: obtain clearly structured data

Ingestion A set of formats & structures that can be converted to archive formats with minimal effort: file formats conforming to the 7 + 1 dimensions of portability (Simons and Bird, 2003; Johnson 2004) support incremental assembly of the deposit well-documented structures: XML with schema ideal ELAR preferences: uncompressed, nonpropriety formats well-documented structures: (OLAC, IMDI, custom)

Ingestion Filenames and structure of deposit: we convert deposits to formats / structures appropriate for the archive information system …but, we record the filenames and directory structures of the deposit, allowing depositors to navigate the materials via them

Ingestion Access protocols … tomorrow

Archive and dissemination Granularity: archive objects can be bundles archive objects can be a subsection of a file the types of related materials and their relationships should play a part in the search options

Archive and dissemination Version control: modeling versions of materials are required multiple types of versioning might be required (migration / dissemination / content update) versions will be ‘invisible’ to most dissemination packages

Archive and dissemination Adding materials and metadata: users can add comments to data users can add metadata values not provided by a depositor users can make relationships between items, including mapping users can supplement the kinds of metadata and relationships in the archive. note: all the above require moderation and supporting architecture

Archive and dissemination Language support: users should be able to add comments / metadata in any language users should be able to navigate the archive access system via the language preference(s) of their choice the archive architecture needs to support translations of metadata and comments

Archive and dissemination Archive services advice and conversion services to depositors response to requests for information supporting communications between individuals associated with the archive

Archive and dissemination Archive information system: separate metadata from materials avoid redundancy Dissemination packages: favour embedding metadata redundancy ok if an aid interpretation Technical solutions: we use MySQL to support the archive for dissemination, we favour XML and formats allowing metadata to be embedded (PDF, BWF)

Conclusions ELAR is newly opened for deposits Key systems are in the process of development Significant features include: modelling archive objects at different granularities modelling relationships between objects users can enter/define their own metadata users can translate information into the language of their choice users can navigate via the language(s) of choice