Subject To Change automatic catalog enrichment with subject headings and codes 10th IGeLU conference Budapest, 3.9.2015 Marcus Zerbst Zentralbibliothek.

Slides:



Advertisements
Similar presentations
The creation of "Yaolan.com" A Site for Pre-natal and Parenting Education in Chinese by James Caldwell DAE Interactive Marketing a Web Connection Company.
Advertisements

Auto-Graphics Update Mary E. Jackson Product Manager, Resource Sharing October 20, 2010.
Opening the Door: using Endeca for a faceted catalog Emily Lynema NCSU Libraries MLC: Discovery & Access March 2, 2007.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
EBooks in the Online Catalog: Challenges and Opportunities Gary Moore, Susannah Benedetti University of North Carolina Wilmington OLAC Conference 2006.
Cataloging: Millennium Silver and Beyond Claudia Conrad Product Manager, Cataloging ALA Annual 2004.
Converging parallel universes Library services as building blocks of digital humanities research 42nd LIBER Annual Conference Munich June 2013 Gregor Horstkemper.
Virtual Library Slavistics Its modules & new technologies COSEELIS conference 2009 Cambridge, April 6th, 2009.
5 th September 2003Diane Tough Content Creation at the NHM or The evolving catalogue!
Cambridge University Library From Data To Discovery Pete Girling – Systems Librarian, Huw Jones - Systems Librarian,
Library integrated system -Aleph Fang Peng Stony Brook University.
RDA: Resource Description and Access A New Cataloging Standard for a Digital Future Jennifer Bowen Cornell University May 16, 2006
LSTA Digital Imaging Grants Presentation Projects Workshop September 13, 2002 Wendy Sistrunk Music Catalog Librarian University of Missouri—Kansas City.
ERIN STALBERG NCSU LIBRARIES SEPTEMBER 16, 2009 Cool Tools – More Connexion.
Introduction to MARC Cataloguing Part 2 Presenters: Irma Sauvola: Part 1 Dan Smith: Part 2.
Microsoft Office Word 2013 Expert Microsoft Office Word 2013 Expert Courseware # 3251 Lesson 4: Working with Forms.
Is Cataloging Dead: Advocacy for Bibliographic Control Randy Roeder and Rebecca Routh ILA/ACRL Spring Conference Davenport, Iowa March 3, 2008.
Vended Authority Control --Procedures and issues.
What’s New in VRS? GUGM May 15, 2008 Presenter: Kelly P. Robinson GIL Service Georgia State University
Updated :02 Hong Kong University of Science & Technology Library XML Name Access Control Repository at the Hong Kong University of Science.
1 DATABASES By: Hanna Ben-Or Phone: October 2011.
ALLIANCE Administration 20 Oct 2009 (Based on Release 2.2) Michaël Petit.
Alma 1 year after STP: implementing batch services IGeLU Budapest Sep 2, 2015 Bart Peeters Head Operations LIBIS.
Let VRS Work for You! ELUNA Conference 2008 Presenter: Kelly P. Robinson GIL Service Georgia State University
Library.dk Integration of National and Local library services ELAG Rome 18 April 2002 Leif Andresen Danish National Library Authority.
Jenn Riley Metadata Librarian IU Digital Library Program New Developments in Cataloging.
Cataloging 12.3 to 14.2 Seminar. Cataloging 2 -New check routines -Cataloging authorizations -Other innovations -Fix and expand routines -Floating keyboard.
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
Relational Databases Melton, Beth “Databases: Access Terminology and Relational Database Concepts.” 09/LPMArticle.asp?ID=73http://pubs.logicalexpressions.com/Pub00.
NCSU Libraries Andrew Pace & Emily Lynema NCSU Libraries May 24, 2006.
DACS Describing Archives: A Content Standard. The Background  Archives, Personal Papers & Manuscripts, 1980s –New Technologies with Web, XML, EAD –Revision.
Library needs and workflows Diane Boehr Head of Cataloging National Library of Medicine, NIH, DHHS
From AACR2 to RDA: An Evolution Kathy Glennan University of Maryland.
The Future of Cataloging Codes and Systems: IME ICC, FRBR, and RDA by Dr. Barbara B. Tillett Chief, Cataloging Policy & Support Office Library of Congress.
The Dewey Consortium in Iceland and the use of Dewey in the National Union Catalogue Ragna Steinarsdóttir Director of Acquisitons and Cataloging National.
Endeca: a faceted search solution for the library catalog Kristin Antelman & Emily Lynema UNC University Library Advisory Council June 15, 2006.
Planning for Life after OCLC Passport for Cataloging An overview of the new OCLC cataloging service Revised April 2002.
Information for Scotland 816 Nov 2001 The potential of CORC Gordon Dunsire presented at Information for Scotland 8 16 November 2001, Edinburgh.
Cataloging from A (authorities) to C (Connexion) Authority control in Millennium Presented by: Lynn Whittenberger Catalog/Database Management Librarian.
The physical parts of a computer are called hardware.
Web Discovery and Millennium Integrating Millennium with Summon Helen Bronleigh Library Systems Coordinator.
ARABIC SCRIPT CATALOGUING at Georgetown University in Qatar Stefan Seeger MENA-IUG 5 th Annual Conference, Dubai 2010.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Basic ALEPH ( Library Online Catalog) Anne Bardolph, Acquisitions Librarian FSU College of Law Library Fall 2006.
Migration of Physical to Electronic (P2E) Resources in Alma
Loading Bibliographic Records Online and in Batch Pat Riva Romance Languages Cataloguer/ Bibliographic Database Specialist McGill University
Cataloging/Acquisitions Workflow William Rainey Harper College James Edstrom Michele Ukleja.
1 Introduction to Metadata: The Role of the Metadata Editor Institutional Repository Workshop 1-3 April 2009 Marguerite Nel Metadata editor
The world’s libraries. Connected. RDA & OCLC Glenn Patton Director, WorldCat Quality Management.
Cataloging v.16 eSeminar September 2003 Judith Fraenkel.
| Barbara Pfeifer | VIAF workshop Strasbourg | VIAF partners: Deutsche Nationalbibliothek (DNB) Barbara Pfeifer.
5 Copyright © 2008, Oracle. All rights reserved. Testing and Validating a Repository.
How to create and use authority records Version 16 and up Yoel Kortick.
The ___ is a global network of computer networks Internet.
Lihong Zhu Interim Cataloging Manager/Monographic Cataloging Librarian Washington State University Libraries
| 29 | Machine-based issuing of DNB Subject Categories and DDC Short Numbers for Medicine | 25. April Machine-based issuing of DNB Subject Categories.
NC LIVE Titles Common Problems Ralph Kaplan 3 April 2003.
Information Literacy University of Namibia Library 2006.
Merge Rules and Routines
FAST at the British Library
Metadata Editor Introduction
Tools and Techniques to Clean Up your Database
Tools and Techniques to Clean Up your Database
Working the A to Z List enhance journal access in the OPAC
EDS Discovery Health & EBSCO eBooks Workflow Optimization
Decisions, Decisions: How to Determine the Appropriate Method of Cataloging Special Collections in the 21st Century Presented by Patricia Falk, Music Catalog/Metadata.
Metadata - Catalogues and Digitised works
Onboarding Webinar 13 April 2019 Presented by and.
Everything Union Catalog
Using FAST (Faceted Application of Subject Headings) in CONTENTdm
Presentation transcript:

Subject To Change automatic catalog enrichment with subject headings and codes 10th IGeLU conference Budapest, Marcus Zerbst Zentralbibliothek Zürich, Systems Librarian

Overview Background → Desire to align legacy data with GND and LCSH and DDC data → The Digital Assistant Automatic catalog enrichment with subject headings and codes → Parameters, technical workflow, numbers and statistics Prospect → Plan to use more subject cataloging data from external sources Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept 20152

Excuse me, what is GND? 3Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept 2015

What‘s GND? GND → The Integrated Authority File (GND) is an authority file for Persons, Corporate bodies, Conferences and Events, Geographic Information, Topics and Works. It is used above all for the cataloguing of literature by libraries […] It is operated cooperatively by the German National Library, all German-speaking library networks […] → In April 2012 the GND replaced [various] previously separate authority files. (from Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept 20154

Background 5Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept 2015

Desire: Subject Indexing of older records with GND/LCSH data → Since Autumn 2012: Cease homegrown system for subject indexing in favour of GND → Standards in German speaking market instead of local peculiarities → Desire for better searchability of all titles by consistent indexing per GND data → Idea: Enrich old records with GND data – manual cataloging not an option Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept 20156

The Digital Assistant → In production in the parameters of computer-aided subject indexing since October 2013 → Helps subject librarians with indexing by suggesting subject entries according GND → Generates suggestions based on external database entries, translation and statistical analysis of table of contents → Is used in an intuitive web client → Daily import of fields for processed titles to Aleph Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept 20157

DA – Flow Chart Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept ILS Aleph Import in DA Load TOC Load metadata (Z39.50) Enrichment Subject heading from WorldCat,… and Autocat Editing in DA Choice / addition of headings Suggestions, headings lists Export from DA Daily in Aleph seq format GND/S headings, scope 072, loc. fields Import Daily by FTP Info

ToC Technology Components DA Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept

Project Rekat automatic catalog enrichment with subject headings and codes 10Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept 2015

Project Rekat Parameters Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept → Enrichment of all our records from publication years From Autumn 2012 on: subject indexing with GND → Around 1.8 million titles → Matching records against WorldCat → Enrichment with GND, LCSH and DDC

Retrospektive Erschliessung– Ablaufschema Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept Aleph / EBI01 Export One-time marcXML Match WorldCat GND, LCSH, DDC Import Consortium cataloging rules Avoid duplicates

Technical Workflow 13Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept 2015

Find match in WorldCat Order of precedence: ISBN Author & title all terms from 245$$a/$$b and 100$$a Both to exist mandatory As 2., but no stop words Author a & title a/b/c in full record, no stop words. Same no of terms Search=Find Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept

Import of Data: Adaption I → Transfer of records found in WorldCat → Received: All subjects of all matches All DDC codes → Avoid duplicates: → Check against data in NEBIS catalog → On text level, no Aleph routine Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept

Import of Data: Adaption II → Local routine adjusts MARC21 to KIDS (Swiss MARC rules) → Avoid duplicates Received: _ $$a $$a501 Adapted and deduplicated: _ $$a501 Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept

Import of data: Adaption III Technical difference after adaption In 082 all 2nd indicators are stripped during import, afterwards identical fields are purged. But if 1st, valid indicator differs, then fields with identical content are technically different, and will be imported. Received: _4$$a $$a $$a Adapted: __ $$a _ $$a _ $$a Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept

Testing 18Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept 2015

Testing → Every 5th year is checked by a subject cataloger (12 years). Inspection of correct attribution of subject terms to title Inspection of correct and complete import to Aleph → By these measures several problems could be detected and eliminated. Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept

→ Form/type heading «Online Publication» might be wrong and so was eliminated in general → LCSH: 650_0 not identical to 65000: Import of non LCSH subjects in various languages – not intended → Music material (notes etc.) without ISBN tend to bring bad results, so were eliminated. Problems identified Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept

→ L $$aPhysics$$xEarly works to 1800 → L $$aPhysics$$yEarly works to 1800 → L $$aSwitzerland$$vBibliography → L $$aSwitzerland$$xBibliography → L $$aAdministrative law$$vCases$$zSwitzerland → L $$aAdministrative law$$zSwitzerland$$vCases → L $$aAdministrative law$$zSwitzerland$$xCases Assumed issues Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept

→ L $$aPhysics$$xEarly works to 1800 → L $$aPhysics$$yEarly works to 1800 → L $$aSwitzerland$$vBibliography → L $$aSwitzerland$$xBibliography → L $$aAdministrative law$$vCases$$zSwitzerland → L $$aAdministrative law$$zSwitzerland$$vCases → L $$aAdministrative law$$zSwitzerland$$xCases Assumed issues Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept

Numbers and Statistics 23Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept 2015

Statistics → Records sent: 2.5 million → Records enriched: 1.8 m Matches by ISBN: 1.45 m → Enrichment: GND: 1.12 m DDC: 1.33 m LCSH: 1.21 m Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept

Processed / Enriched Titles Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept

Match Types from Total Match

Subject Types from Success

Prospect 28Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept 2015

Project FRED – automatic daily enrichment → Trigger On item creation or On item arrival → Continuous search for external data until final edit by subject specialist or item status is «loanable» → Stop flag by subject specialists or during final item handling → Subjects are imported to Aleph, CAT field → Different start and stop flags for ebooks Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept

Conclusion → Just started to load enrichment → Success and acceptance yet to be discovered → High demand of automatic processes → Changes in tasks and function of subject specialists → Reduced demand in larger community environment expected Subject To Change, Marcus Zerbst, IGeLU conference, 3. Sept