Linking and Exploring Authority Files TEL-ME-MOR/M-CAST Seminar, Prague November 23 rd 2006 Hans-Jörg Lieder, Berlin State Library.

Slides:



Advertisements
Similar presentations
THE DONOR PROJECT Titia van der Werf-Davelaar. Project Financed by: Innovation of Scientific Information Provision (IWI) Duration: –phase 1: 1 may 1998.
Advertisements

OAF - Workshop, Lisbon, Dec Open Access to Libraries MALVINE and LEAF. Perspectives of the Open Archives Initiative Protocol for Metadata Harvesting.
Advanced Metadata Usage Daan Broeder TLA - MPI for Psycholinguistics / CLARIN Metadata in Context, APA/CLARIN Workshop, September 2010 Nijmegen.
History of English Language Assessment Archives in context and as context Database structure ISAAR (CPF) Online Archival Sustainability.
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features INIS Training Seminar 7-11 October 2013, Vienna Domenico.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
‘european digital library’ (EDL) Julie Verleyen TEL-ME-MOR / M-CAST Seminar on Subject Access Prague, 24 November 2006.
30 May 2003IASSIST 2003: Strength in Numbers From manuscripts to metadata: collaborative working in the Archives Hub Amanda Hill University of Manchester.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Overview of Search Engines
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
The German Union Catalogue of Serials and its interlibrary services Hans-Jörg Lieder Head of the Department of Bibliographic Services Staatsbibliothek.
Xpantrac connection with IDEAL Sloane Neidig, Samantha Johnson, David Cabrera, Erika Hoffman CS /6/2014.
CISTI Source & SiteSearch OCLC User Meeting 2001 Danielle Langlois & Carol Serroul May 9, 2001.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Database Planning, Design, and Administration Transparencies
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Metadata Repositories for Interoperable/Shareable Metadata.
Interoperable Digitised Content “Discover, search, extract, link, associate, and view digitised content” Les Carr.
Databases and Database Management Systems
Architecture for a Database System
ARCHIVISTS’ TOOLKIT WORKSHOP March 13, 2008 Christine de Catanzaro Jody Thompson.
DACS Describing Archives: A Content Standard. The Background  Archives, Personal Papers & Manuscripts, 1980s –New Technologies with Web, XML, EAD –Revision.
Library needs and workflows Diane Boehr Head of Cataloging National Library of Medicine, NIH, DHHS
PAN-European Exploitation of the Results of the Libraries Programme - EXPLOIT German Libraries Institute Berlin EXPLOIT 1 Electronic Access, Document Ordering.
1 st -4 th December st BioXHIT Annual Meeting WorkPackage 5.2: Implementation of Data management and Project Tracking in Structure Solution Peter.
OCLC Research: Selected projects Eric Childress Larry Olszewski Presentation for Dpto. Biblioteconomía y Documentación Universidad Carlos III de Madrid.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
A centre of expertise in digital information management RDN, e-Prints UK and NOF- Digitise: a (very) small sample of UK OAI activity Andy.
FlexElink Winter presentation 26 February 2002 Flexible linking (and formatting) management software Hector Sanchez Universitat Jaume I Ing. Informatica.
Building the Mother of all Collections: the future of the National Library’s discovery services Warwick Cathro Assistant Director-General, Innovation National.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
Accessing a national digital library: an architecture for the UK DNER Andy Powell ELAG 2001, Prague 7 June 2001 UKOLN, University of Bath
Publications Office Metadata Registry (MDR) INSPIRE Registry and Registers Workshop Willem van Gemert Publications Office of the EU Dissemniation and Reuse.
Automated (meta)data collection – problems and solutions Grete Christina Lingjærde and Andora Sjøgren USIT, University of Oslo.
Archival authority files and the representation of literary networks: first steps and opportunities Cataloguing Creativity, 15/11/2013: Bill Stockting,
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
ADVANTAGES OF DATA BASE MANAGEMENT SYSTEM. TO BE DICUSSED... Advantages of Database Management System  Controlling Data RedundancyControlling Data Redundancy.
INFO 6850 Archives II Week Seven THEORY, STANDARDS, BEST PRACTICES How do you encode the “context” of archival records?
ARABIC SCRIPT CATALOGUING at Georgetown University in Qatar Stefan Seeger MENA-IUG 5 th Annual Conference, Dubai 2010.
Metadata and OAI DLESE OAI Workshop April 29-30, 2002 Katy Ginger Presentation available at:
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
By N.Gopinath AP/CSE.  The data warehouse architecture is based on a relational database management system server that functions as the central repository.
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features The Role of the International Nuclear Information System.
U.S. Environmental Protection Agency Central Data Exchange Pilot Project Promoting Geospatial Data Exchange Between EPA and State Partners. April 25, 2007.
Users, information technology and standardised description in a non-standardised world Geoffrey Yeo UCL School of Library, Archive and Information Studies.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
Introduction to Databases Dr. Osama AL Rababah. Objectives In this capture you will learn: Some common uses of database systems. The characteristics of.
Cooperation, interoperability and reuse of descriptions in AIM25 Robert Baxter Project Coordinator
PAN-European Exploitation of the Results of the Libraries Programme - EXPLOIT German Libraries Institute Berlin EXPLOIT 1 Internal Services.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Development of a relational database schema for collection- level descriptions in SCONE for archives, libraries, and museums Gordon Dunsire Presentation.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
With the support of the Lifelong Learning Programme of the European Union Project _ LLP BE-ERASMUS-ESMO European Grade.
Architecture Review 10/11/2004
Building A Repository for Digital Objects
Innovative projects at the UA library
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Accessing a national digital library: an architecture for the UK DNER
A step-by-step guide to DOI registration
OAI and Metadata Harvesting
Manuscript Transcription Assistant Initiative
COUNTER Update February 2006.
Oya Y. Rieger Cornell University Library May 2004
Database Design Hacettepe University
Presentation transcript:

Linking and Exploring Authority Files TEL-ME-MOR/M-CAST Seminar, Prague November 23 rd 2006 Hans-Jörg Lieder, Berlin State Library

Subject access seminar, Prague Overview Authority Control / Context Information Limitations LEAF Objectives Innovations LEAF System Architecture LEAF and other Systems

Subject access seminar, Prague Authority Control Libraries –Separate data records – authority records – describing persons, corporate bodies, subjects etc., maintained independantly from bibliographic records (as independent database (e.g. PND) or tables/group of tables within library system –Authority records can be linked to a variety of resources –Purpose of a person name authority record: disambiguation of a name, i.e. establishing of a 1:1 relationship between a name and a person

Subject access seminar, Prague Context Information Archives –Traditionally biographical information is part of an archival record –Purpose of biographical context information: providing context for the specific archival record

Subject access seminar, Prague Present Limitations Access to authority data is limited to some institutions only Access to authority data is dependant of employed cataloguing rules and data formats Cross-domain sharing of authority data (e.g. between libraries, archives, museums and other ‘memory institutions‘) does not exist Public users do not have access to authority data

Subject access seminar, Prague Examples Minimal level library authority record: –Smith, John, (LOC) Biographical context information in archives come in all shapes and sizes Richer library authority record:

Subject access seminar, Prague LEAF European project within the 5 th Framework of the European Commission, Programme “Information Society Technology” March 2001 – February partners in 10 countries

Subject access seminar, Prague LEAF Objectives 1 LEAF (Linking and Exploring Authority Files) developed a prototype system through which (internationally) distributed person name authority records are gathered, automatically linked in meaningful ways, made available to a variety of operations and opened up for multiple analysis.

Subject access seminar, Prague LEAF Objectives 2 The following steps are included: new or updated local name authority records are fetched/harvested by or uploaded to the LEAF system on a regular basis all records in the LEAF system are converted into one common exchange format (EAC) and inserted into a central database records describing the same person are automatically linked all records in the LEAF database are available for search and retrieval

Subject access seminar, Prague LEAF Objectives 3 retrieved search results are stored in a Central Name Authority File registered users can annotate records external systems can query the LEAF service LEAF can query external systems external resources can link to LEAF records results retrieved in LEAF can be used as search arguments in other applications

Subject access seminar, Prague LEAF Innovations Common exchange format for libraries and archives Linking process Usage impact Addition of annotations Integration into a distributed search service

Subject access seminar, Prague Exchange format EAC, Encoded Archival Context XML DTD, parallel to EAD (Encoded Archival Description) Describes circumstances of creation and use of records, including the identification of persons, corporate bodies and families, their roles and relationships Compatible with MARC family, MAB and archival standard ISAAR(CPF) (Int. Standard Archival Authority Record for Corporate Bodies, Persons, and Families)

Subject access seminar, Prague EAC structure

Subject access seminar, Prague EAC example (partial) Pi Sunyer Charles Pi i Sunyer Charles Pi Sunyer Carlos

Subject access seminar, Prague Linking process Local records are harvested via ftp, OAI or Z39.50, converted into EAC to form LEAF Authority Records (LARs) When LARs are found to describe the same person according to linking rules (based on names, dates, IDs), they are merged into a Shared LEAF Authority Record (SLAR) Records and links are regularly updated

Subject access seminar, Prague Usage impact When a user retrieves a LEAF record, its status is changed to Central Name Authority Record (CNAR) Data providers can see which of their records have been used and target the improvements they make to their data Records never used may be removed from the central system after an expiry date

Subject access seminar, Prague Public annotations Registered users can make temporary annotations to (shared) LEAF records, e.g. to suggest a correction Data providers are alerted when an annotation is made to one of their records Registered users can make persistent annotations to central LEAF records, e.g. to provide a piece of information of general interest

Subject access seminar, Prague Private annotations A private workspace is available to registered users They can save central LEAF records into it They can make private annotations to the saved records They receive a warning if a central LEAF record they saved is modified

Offline components REPOSITORY Database Access unit LEAF database Online components ACQUISITION Harvesting unit Local Data Base Data import unit Conversion unit Linking unit Presentation unit User Workspace unit Export unit Maintenance Suite External Z39.50 system External Services User Interface Logging unit Admin. unit Registration unit Annotation unit Search unit MALVINE User Interface MALVINE Search Engine System Architecture

Subject access seminar, Prague

Possible problem with a distributed search 100 Louis $b XIV $c Roi de France 100 Ludwig $b 14 $c franz. König 400 Louis $b XIV $c Roi de France Search argument: Ludwig returns recordsreturns no records

Subject access seminar, Prague The Issue of Names Turgenev, Ivan S. Turgenjew, Iwan S. Turgenev, Ivan Sergeevich Turgenev, Ivan Sergeevic Turgenev, Ivan Turgenjew, Iwan Turgenjew, Iwan Sergejewitsch Turgenjew, I. S. Turgenjew, I. Turgenew, Iwan Turgenew, Iwan S. Turgenew, Iwan Sergejewitsch Turgenjew, Iwan Sergejewic Turjenjew, Iwan S. Turgenjeff, Iwan Turgenjeff, Iwan S. Turgenjeff, I. S. Turgeniew, Iwan S. Turgeniew, Iwan Turgenjev, Iwan Turghenew, Iwan Turgenew, Johann von Turgenew, I. S. Turgeneff, Johann von Tourguénev, Ivan Turgeneff, Sergei Turgénjew, Iwan Sergejewitsch

Subject access seminar, Prague Integration LEAF functionalities were tested in combination with the MALVINE service (i.e. a distributed search service for manuscript descriptions; see: How does it work?

Subject access seminar, Prague

Next steps / Failed Plans After end of project (February 2004), consortium, plans to scale up to a full LEAF service. But, so far no follow-up, and the project website has disappeared. However prototype remains: index index No integration yet with Malvine. Current deliberations: Kalliope, MALVINE.

Subject access seminar, Prague Follow-up? Reactivate? Study add-ons to VIAF? E.g. annotations Any experiences with LEAF to share? Depends on the future of TEL!

Subject access seminar, Prague Contacts Technical questions: All other questions: