Florian Betz Automatic indexing at the German National Library (DNB). Experiences and results.

Slides:



Advertisements
Similar presentations
The OAIS experience at the British Library Deborah Woodyard Digital Preservation Coordinator ERPANET OAIS Training Seminar, Nov 2002.
Advertisements

Subject Analysis: An Introduction Based on BASIC SUBJECT CATALOGING USING LCSH edited by Lori Robare.
METS: An Introduction Structuring Digital Content.
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
Lecture №2 State System of Scientific and Technical Information.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
1 Managing Legal Deposit for Online Publications in Germany Cornelia Diebel.
SLIDE 1IS 257 – Fall 2007 Codes and Rules for Description: History 2 University of California, Berkeley School of Information IS 245: Organization.
Contents and Formats Existing Digital Sources Gertraud Griepke Cornell University, July 26th 2002.
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
Thesaurus Design and Development
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
AACR3: Resource Description and Access Presented by Dr. Barbara Tillett Chief, CPSO Library of Congress 2004.
The Library Cataloging Tradition
Monguz Kft. Various aspects of metadata structuring, construction and the business logic characteristics of integrated library systems. The highest level.
Grey Literature, E-Repositories and Evaluation of Academic & Research Institutes. The case study of BPI e-repository Maria V. Kitsiou - Head Librarian,
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
June Overview of Operations & the INIS Record INIS Training Seminar 2-6 June 2003 Vienna, Austria Seyda RIEDER INIS Section Supervisor, Bibliographic.
Planning a digital library How to Build a Digital Library Ian H. Witten and David Bainbridge.
Inside the DDC Dewey goes Europe: On the use and development of the Dewey Decimal Classification (DDC) in European libraries Austrian National Library.
Cataloguing Electronic resources Prepared by the Cataloguing Team at Charles Sturt University.
Lecture Four: Steps 3 and 4 INST 250/4.  Does one look for facts, or opinions, or both when conducting a literature search?  What is the difference.
DINI „Electronic Publishing Group“ DINI – Certificate Document and Publication Repositories “Electronic Publishing Group“
Subject To Change automatic catalog enrichment with subject headings and codes 10th IGeLU conference Budapest, Marcus Zerbst Zentralbibliothek.
1. 2 Content The Romanische Bibliographie Online is the only comprehensive specialist bibliography for Romance language and literature studies –available.
The Library Cataloging Tradition Marty Kurth CS 431 February 9, 2005 [slides stolen from Diane Hillmann]
Jenn Riley Metadata Librarian IU Digital Library Program New Developments in Cataloging.
Digital Archiving in the Hungarian Széchényi Library The story and the plans of the Hungarian Electronic Library Rome, 21. Oct István Moldován OSZK,
P. Schirmbacher Humboldt-Universität zu Berlin The Changing Process of Scholarly Publishing or the Necessity of a New Culture of Electronic.
Planning a digital library How to Build a Digital Library Ian H. Witten and David Bainbridge.
Developing Databases and Selecting an Appropriate Library System.
Current Events and Issues Using Index Databases for Finding Answers.
Librarians vs. Automation Carolyn Weber Lucio Campanelli Will Hohyon Ryu.
Research library of the National Aerospace University Kharkiv Aviation Institute.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Resource Description and Access Deirdre Kiorgaard ACOC Seminar, September 2007.
Translingual Retrieval Moving between vocabularies MACS 2010 Jahns / Karg, Deutsche Nationalbibliothek Concepts in Context - Cologne Conference on Interoperability.
AACR2 Pt. 1, Monographic Description LIS Session 2.
RDA DAY 1 – part 2 web version 1. 2 When you catalog a “book” in hand: You are working with a FRBR Group 1 Item The bibliographic record you create will.
The physical parts of a computer are called hardware.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Librarians vs. Automation Carolyn Weber Lucio Campanelli Will Hohyon Ryu.
Improving Description through Collaboration: The Ethnomusicological Video for Instruction & Analysis Digital Archive Music Library Association, February.
AACR 2 –Rules for Descriptive Cataloguing
Information Retrieval
8/28/97Information Organization and Retrieval Introduction University of California, Berkeley School of Information Management and Systems SIMS 245: Organization.
Jean-Yves Le Meur - CERN Geneva Switzerland - GL'99 Conference 1.
1 Shelflisting and Filing Rules and Subject Authority Control May 11, 2005.
| 29 | Machine-based issuing of DNB Subject Categories and DDC Short Numbers for Medicine | 25. April Machine-based issuing of DNB Subject Categories.
1 CS 430: Information Discovery Lecture 7 Automatic Generation of Catalog Records.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
DESIGN AND IMPLEMENTATION OF LIBRARY AUTOMATION USING KOHA (Open Source Software) AT BHARATHIDASAN UNIVERSITY COLLEGE, PERAMBALUR R.VENUS MLISc- Final.
0 A model for direct cataloguing in WorldCat Reiner Diedrichs Verbundzentrale des GBV (VZG) CBS-Partner Meeting, September 15th, 2015 Hamburg.
Theory, Tools, History: A Brief Introduction August 17, 2016.
Subject Analysis: An Introduction
New features in DDC applications
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Categories of APTs Dr. Dania Bilal IS 530 Fall 2005.
From the old to the new… Towards better resource discoverability
Professional development training on cataloging at the University Wisconsin-Madison Memorial Library, USA 14th October -24th October, 2016 Aigerim Shurshenova.
Introduction to Metadata
Cataloging Tips and Tricks
MARC: Beyond the Basics 11/24/2018 (C) 2006, Tom Kaun.
Introduction of KNS55 Platform
MUMT611: Music Information Acquisition, Preservation, and Retrieval
SupportCenter Plus Product Overview.
FRBR and FRAD as Implemented in RDA
Presentation transcript:

Florian Betz Automatic indexing at the German National Library (DNB). Experiences and results

Development of automatic indexing with the English-language LCSH 2 Table of contents Chronological summary of efforts on automatic indexing at the German National Library (DNB) Automatic indexing with the German- language Integrated Authority File (GND) Development of automatic indexing with the English-language LCSH Outlook | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

1. Chronological summary of efforts on automatic indexing at the DNB 3 1. Chronological summary of efforts on automatic indexing at the DNB | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Mandate of the German National Library (DNB) 4 Mandate of the German National Library (DNB) “to collect in the original, to inventory, to catalogue and bibliographically index, to permanently safeguard and to prepare for use by the general public media works published in Germany from 1913 on and b) German-language media works, translations of German-language media works into other languages and foreign-language media works about Germany published abroad from 1913 on and to provide central library and national bibliographic service” Quelle: inofficial translation of DNBG - http://www.dnb.de/SharedDocs/Downloads/EN/DNB/wir/dnbg.pdf?__blob=publicationFile - DNBG § 2 Functions, powers (PS: and to provide central library and national bibliographic service = v.a. Deutsche Nationalbibliografie + Integrated Authority File (GND)). - DNBG §3 Media works: Definition of „media works“ = „all representations in text, image and sound that are distributed in material form or made accessible to the public in immaterial form”. Definition of „media works in immaterial form” = “all representations in public networks.” = online publications | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Collection of online publications 5 Collection of online publications 1996-2006: Collecting of online publications but no legal obligation 2006: Official collection mandate for online publications by the „Law regarding the German National Library“ (DNBG) | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Background I: Increase of incoming publications 6 Background I: Increase of incoming publications Quelle: Zuarbeit Sandro Uhlmann (Zahlen + leicht abgeändertes Diagramm) _______ Dr. Peter Leinen, MAV 2017, Folien: Year| stock| acquisitions 2016: 3.500.000; 1.300.000 expected 2017: 5.800.000; 2.300.000 2018: 11.700.000; 5.900.000 ____ NP-Ordner-Statistik, nach Gattungen: Monographs ca. 37,9% University publications 5,3% BoD-titles 13,3% Miscellanous monographs 19,3% Perdiodicals ca. 61,2% E-paper (newspaper editions) 38,9% Miscellanous periodicals (issues and articles) 22,3% Total sum: 98,1% (rest miscellanous; websites = 0,3%). | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Background II: Intellectual subject cataloguing at the DNB 7 Background II: Intellectual subject cataloguing at the DNB Subject categorisation: assignment of about 100 DNB subject categories based on DDC numbers (level of Hundreds, e.g. 150 Psychology, 330 Economics) obligatory for all publications used as classificatory arrangement for the Deutsche Nationalbibliografie Classificatory indexing: assignment of full DDC numbers Subject indexing: assignment of subject headings of the Integrated Authority File (GND) syntactical indexing according to the RSWK (Rules for Subject Cataloguing) | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

8 Example: Title announcement in the Deutsche Nationalbibliografie (series A) Quelle: http://d-nb.info/1125537922 = Deutsche Nationalbibliografie 2017 A 07 (pdf). | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Automatic cataloguing: How it began? 9 Automatic cataloguing: How it began? | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

10 2010: Introduction of new series „O“ of the Deutsche Nationalbibliografie Series O comprises all online publications (formerly media-independent sorted). At the same time termination of intellectual subject cataloguing of online publications announced in series O Quelle: http://www.dnb.de/EN/Service/DigitaleDienste/DNBBibliografie/reiheO.html Series O is issued monthly and comprises: all online publications, including journals print-on-demand publications based on a digital master online publications in the field of music data carriers and media combinations are not included | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

11 The Project PETRUS (2009-2011) PETRUS = Process-supporting software for the digital German National Library Large inter-divisional DNB-project for developing and establishing automatic cataloguing procedures Current automatic subject indexing procedures at the DNB originally go back to scenario 4 of this project. Subsequently institutionalisation and further development of the different scenarios to productive automatic cataloguing procedures PS: Automatic metadata exchange between parallel editions or different editions of the same work as second example originally go back to scenario 2 of the PETRUS project. | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

12 Further chronology of productive automatic subject cataloguing procedures 2012: Start of automatic assignment of DNB subject categories for German- and English-language titles 2014: Start of automatic subject indexing of German- language university publications 2015: Start of automatic assignment of DDC short numbers for medicine (automatic classificatory indexing) for German- and English-language university publications 2017: Start of automatic subject indexing of German- language BoD-titles (without fiction) and of scientific articles Latest: Start of automatic subject indexing of print publications by digitised ToCs (university publications; publications outside the publishers‘ booktrade) | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

2014: Organisational reform 13 2014: Organisational reform Subject area Automatic indexing was prior to 2014 part of the Departement of Subject Cataloguing. 2014: Formation of the „Section AEN: Automatic indexing; Online publications“ by centralisation of formerly dispersed subject areas. Section AEN comprises automatic subject cataloguing as well as big parts of aquisition and descriptive cataloguing of online publications (last only for periodicals). Overview of the DNB organisation: http://www.dnb.de/EN/Wir/Organisationsuebersicht/organisationsuebersicht_node.html | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

14 Table of contents Chronological summary of efforts on automatic indexing at the German National Library (DNB) Automatic indexing with the German- language Integrated Authority File (GND) | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Objectives and principles 15 Objectives and principles Method: Computational linguistic approach combined with the use of a dictionary Basis: bibliographic metadata; fulltext; digitised ToCs Document formats: PDF; Epub Terminology: Integrated Authority File (GND) Output: currently maximum 10 subject headings per document or all subject headings above a defined threshold Software: Averbis GmbH (Freiburg im Breisgau) Averbis Extraction Platform (AEP) Averbis Terminology Platform (ATP) Quelle: Konfigurationen-Einstellungen unter: V:\\03_FB_EE\11_AEN\03_Erschliessungsverfahren\AEP-DNB\01_Produktiv-Stage aktuell: S_ART2_WB41_20170313_de_Release222.xlsx = 7 mit Thresholds aktuell: S_BOD2_WB41_20170313_de_Release222.xlsx = 5 mit Thresholds aktuell: S_TOC1_WB41_20170313_de_Release222.xlsx = 10 mit minimalen Thresholds aktuell: S_WA6_WB41_20170313_de_Release222.xlsx = 7 mit Thresholds | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Objectives and principles 16 Objectives and principles Goal is a homogeneous coverage with subject headings of the Integrated Authority File (GND) for printed and online publications, by intellectual and automated indexing. Dictionary care is indispensable and time-consuming. Not in all cases the GND includes the needed concept. Automated subject indexing results are of lower quality than the results of intellectual indexing. Criterion for the quality of machine-generated subject headings is the usefulness for retrieval purposes. Machine-generated subject heading strings are non syntactical. | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Integrated Authority File (GND) 17 Integrated Authority File (GND) The Integrated Authority File contains headings for descriptive as well as for subject cataloguing (so-called partial stock “s”). The copy of this partial stock „s“ as currently used for automatic indexing counts: Zahlen: Produktivstage GND_1420 (2017-04-19), ohne Hinweissätze. GND-History: - authority file for descriptive and subject cataloguing in the German-language area - collective maintenance and use - central editorial function at the DNB -- stablished in 2012 by merging and replacing four predecessor authority files: German Personal Name Authority File (PND) Corporate Body Authority File (GKD) Subject Headings Authority File (SWD) Uniform Title File of the Deutsches Musikarchiv (DMA-EST file) | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

18 Workflow and Routine 6 1 2 4 5 Quelle: Sandro Uhlmann (Folien Schweden-Vortrag 2015-10-07), Aufzählung Dokumentgruppen bearbeitet be. Automatic processes each night – in case of subject indexing: German-language scientific works German-language book-on-demand titles (without fiction) German-language scientific articles German-language digitised ToCs of series B and H (print publications) 3 | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Averbis Software: processing steps 19 Averbis Software: processing steps Ambiguity Bank [bank -> Bank] [bank -> Bank <Möbel>] Quelle: Sandro Uhlmann (Folien Schweden-Vortrag 2015-10-07). Finnisch: Bank als Kreditinstitut = p-a-nkki Finnisch: Bank <Möbel> = p-e-nkki | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Averbis Software: Dictionary 20 Averbis Software: Dictionary Terminologies are cared on a profile basis. Profile = set of single or systematic manipulations of a terminology for adjusting it to the needs of automatic processes For systematic and extensive modifications complex filters can be defined using regular expressions. Levels of terminology: Hierarchies of concepts (leaf nodes) Concepts Terms Mappingforms | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Averbis Software: Configuration 21 Averbis Software: Configuration Configuration = combination of parameter settings for regulating the indexing process The used dictionary profile is one parameter of a configuration. | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Entry of machine-generated subject cataloguing data in title record 22 Entry of machine-generated subject cataloguing data in title record title: Theater ist eine Volkssauna. Politisches Gegenwartstheater aus Finnland in der Tradition von Bertolt Brecht automatic subject cataloguing entries: 5050 DNB subject categories $E p=from parallel edition| a=deliverer $H origin (wbf=webform) $D date 5051 $L configuration used for automatic subject indexing 510X intellectually assigned GND subject headings Quelle: PPN 1058675222 = http://d-nb.info/1058675222 (Niklas Füllner; Bochum, Diss. 2014). – Intellektuelles Indexat deshalb keine Portalanzeige mSW. 540X intellectually assigned DDC numbers 5540 machine-generated subject headings $K confidential value $D date | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Entry of machine-generated subject cataloguing data in title record 23 Entry of machine-generated subject cataloguing data in title record title: Symposium: Computational Chemistry, Physics and Biology foreign data LCSH: automatic subject cataloguing entries: Quelle: http://d-nb.info/1077042000. Ulm = Verlagsort und Veranstaltungsort. | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Display of machine-generated subject headings in the DNB catalogue 24 Display of machine-generated subject headings in the DNB catalogue Quelle: http://d-nb.info/1077042000 subject headings (*machine-generated) | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Development of automatic indexing with the English-language LCSH 25 Table of contents Chronological summary of efforts on automatic indexing at the German National Library (DNB) Automatic indexing with the German- language Integrated Authority File (GND) Development of automatic indexing with the English-language LCSH | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

26 Project MAEN MAEN = automatic indexing of English-language online publications Initial position: 2010 stop of intellectual indexing of online publications (series O) English-language online publications are not covered by current productive automatic indexing routines with the Integrated Authority File (GND). English-language online publications are by far the second largest group of online publications in the entire stock. Project term: 04/2016-06/2018 | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Objectives and principles 27 Objectives and principles Subsequent use of the Averbis Software after adjustments and additions for English Primary goal: automatic assignment of English-language subject headings taken from a controlled vocabulary Use of the Library of Congress Subject Headings (LCSH) as indexing terminology and output terminology | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Library of Congress Subject Headings (LCSH) 28 Library of Congress Subject Headings (LCSH) Universal English-language subject headings authority file of the Library of Congress Approximately 415.000 concepts and 350.000 synonyms Structural principle: precoordination Low coverage of LCSH by LCC numbers (about 25%) For subject indexing used in combination with the Name Authority File (NAF) of the LoC. | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

LCSH as imported terminology in the Averbis Software 29 LCSH as imported terminology in the Averbis Software Import: SKOS/RDF-XML dump of the LoC Besides preferred and alternative labels creation of additional mappingforms for each of them according to specific rules, for example: label: Phototropism (Chemistry)  additional mapping forms: Chemistry Phototropism Chemistry Zusätzliches Beispiel (Painting, Finnish—French influences) nach 03/2017 neu spezifizierten Mappingregeln (Komma + Permutation), d.h. 2017-05-04 noch nicht umgesetzt in ATP. label: Painting, Finnish--French influences additional mappingforms: Finnish painting French influences French influences Finnish painting | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Positive indexing example of current development status: 30 Positive indexing example of current development status: title: Positive Psychology and Change : How Leadership, Collaboration, and Appreciative Inquiry Create Transformational Results machine-generated LCSH: Leadership Change (Psychology) Management Appreciative inquiry Positive psychology Organizational change Quelle: AQM-Appr-Projekt S_QM-Test_su_2017-03-31, HSG 650 (Konfiguration + WB-Profil ggf. bei su erfragbar, IDN aus Korpus NP_en_iLCSH_2_en (153). Snippet: http://d-nb.info/1099430216 (pdf) preferred label: Organizational change text matches with synonym: Organizational Development | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Negative indexing example of current development status: 31 Negative indexing example of current development status: title: The contribution of non-formal schools in enhancing the provision of basic education in Kenya intellectually determined keywords: Alternative Schools Disadvantaged Children Basic Education Kenya machine-generated LCSH: Basic education Schools Boys Girls Kenya (Southeast Asian People) Learning Teachers Match by mappingform „Kenya“. No disambiguation, no counter candidate. Dictionary care measure: to set on ignore this label/term, here the preferred label „Kenya (Southeast Asian People)“. | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

First findings Dictionary care: 32 First findings Dictionary care: Filtering terms with unwanted qualifier-patterns causing systematical errors is possible. – Another example: „Einstein (Horse)“. Terminology: Without further entities, especially personal and geographical names, there is an unwanted shift towards false homonymous names as substitutes. Solution for terminology level: a) GND-addition; b) NAF-addition; c) FAST. Einstein (Horse) – Match über „Einstein“, keine Disambiguierung. weiteres Beispiel: Chip (Fictitious character : Disney) – Computerchip in LCSH nur im Plural vorhanden. Durch kein Stemming, keine Auswahl des Headings. Hier keine Disambiguierung bei Match über „Chip“. | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Development of automatic indexing with the English-language LCSH 33 Table of contents Chronological summary of efforts on automatic indexing at the German National Library (DNB) Automatic indexing with the German language Integrated Authority File (GND) Development of automatic indexing with the English-language LCSH Outlook | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Outlook Further development of indexing procedure 34 Outlook Further development of indexing procedure Developing of a procedure and workflow of text structure analysis, recognition and segmentation Retrospective indexing of online publications Data delivery of machine generated subject heading strings Further development and extension of automatic indexing of digitised ToCs taken from print publications Project TeMa (= Terminology management for support of intellectual and automatic subject cataloguing) MARC-Datenauslieferung mSW: Herbst 2017, möglicherweise erst Frühjahr 2018. RDF noch nicht konkret, möglicherweise Frühjahr 2018. | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Thank you for your attention. Questions? 35 Thank you for your attention. Questions? Further information: DNB/Automatic cataloguing Literature: Ulrike Junger (2014) Can Indexing Be Automated? The Example of the Deutsche Nationalbibliothek, Cataloging & Classification Quarterly, 52:1, 102-109, DOI: 10.1080/01639374.2013.854127 / http://dx.doi.org/10.1080/01639374.2013.854127 Contact: Dr. Florian Betz Deutsche Nationalbibliothek / German National Library Automatic indexing; Online Publications Phone: +49–341–2271–588 mailto:f.betz@dnb.de | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Strategic Priorities 2017-2020 36 Strategic Priorities 2017-2020 Chapter: 2 Document| Disseminate. (Quelle: http://d-nb.info/1126595101/34) | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Strategic Priorities 2017-2020 37 Strategic Priorities 2017-2020 Chapter: 2 Document| Disseminate. (Quelle: http://d-nb.info/1126595101/34) | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Strategic Priorities 2017-2020 38 Strategic Priorities 2017-2020 Chapter: 2 Document| Disseminate. (Quelle: http://d-nb.info/1126595101/34) | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

39 DNB Strategic Compass 2025 Chapter: 2 Document| Disseminate. (Quelle: http://d-nb.info/1112299556/34) | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Data delivery and metadata provenance 40 Data delivery and metadata provenance Currently machine generated subject headings are only used for the DNB catalogue and are not delivered via data services to data recipients. Start of delivery is expected for 2018. For traceability of data origins „Machine-generated Metadata Provenance“ (= data about metadata) will be expressed by use of MARC 21 field 883. | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

41 Linguistic analysis Quelle: Sandro Uhlmann + Jan-Helge Jacobs (Folien Lunch-Talk 2014-09-30| 2014-10-02); Titel geändert be von „Linguistics“ zu „Linguistic analysis“. | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017

Quality management Statistical and error analysis 42 Quality management Statistical and error analysis to identify systematic errors to recognise trends to identify gaps of terminology to remove weak points from workflow to exploit technical progress Quelle: Sandro Uhlmann (Folien Schweden-Vortrag 2015-10-07) | Automaattisesti parempi? - Automatic indexing at the DNB | 4th of May 2017