Vocabulary Matching for Book Indexing Suggestion in Linked Libraries – A Prototype Implementation & Evaluation Antoine Isaac, Dirk Kramer, Lourens van.

Slides:



Advertisements
Similar presentations
Using SKOS in practice, with examples from the classification domain
Advertisements

GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Thesaurus speed dating conclusions. The ideal thesaurus… …is tailor-made for the special needs of its user community. In other words, it is different.
The Ontology Construction Problem Ontology construction requires the active engagement of domain experts Existing ontology authoring tools are not tailored.
Controlled Vocabularies in TELPlus Antoine ISAAC Vrije Universiteit Amsterdam EDLProject Workshop November 2007.
OAEI 2007: Library Track Results Antoine Isaac, Lourens van der Meij, Shenghui Wang, Henk Matthezing Claus Zinn, Stefan Schlobach, Frank van Harmelen Ontology.
The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
Interoperability in the Cultural Heritage Domain Lourens van der Meij VU Amsterdam – KB (part of sheets by A.Isaac) October 3 rd, 2008.
STITCH final event KB July Agenda Brief presentation of STITCH main achievements Demo: annotation suggestion at KB The future use of STITCH results.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Interoperability Aspects in Europeana Antoine Isaac Workshop on Research Metadata in Context 7./8. September 2010, Nijmegen.
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features INIS Training Seminar 7-11 October 2013, Vienna Domenico.
A web-based repository service for vocabularies and alignments in the Cultural Heritage domain Lourens van der Meij Antoine Isaac Claus Zinn.
Using quantitative aspects of alignment generation for argumentation on mappings Antoine Isaac, Cassia Trojahn, Shenghui Wang, Paulo Quaresma Vrije Universteit.
Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
Senior Project Database: Design and Usability Evaluation Stephanie Cheng Rachelle Hom Ronald Mg Hoang Bao CSC 484 – Winter 2005.
Aligning Thesauri for an integrated Access to Cultural Heritage Collections Antoine ISAAC (including slides by Frank van Harmelen) STITCH Project UDC Conference.
The Value of Usage Scenarios for Thesaurus Alignment in Cultural Heritage Context Antoine Isaac, Claus Zinn, Henk Matthezing, Lourens van der Meij, Stefan.
An Empirical Study of Instance-Based Ontology Mapping Antoine Isaac, Lourens van der Meij, Stefan Schlobach, Shenghui Wang funded by NWO Vrije.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. WSMX Data Mediation Adrian Mocan
Multi-Concept Alignment and Evaluation Shenghui Wang, Antoine Isaac, Lourens van der Meij, Stefan Schlobach Ontology Matching Workshop Oct. 11 th, 2007.
Putting ontology alignment in context: Usage scenarios, deployment and evaluation in a library case Antoine Isaac Henk Matthezing Lourens van der Meij.
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
Click to highlight each section of the article one by one Read the section, then click once to view the description of it If you want to read it, you.
Left click or use the forward arrows to advance through the PowerPoint Upon clicking, each section of the article will be highlighted one by one Read.
Topics Covered Abstract Headings/Subheadings Introduction/Literature Review Methods Goal Discussion Hypothesis References.
Accessing Cultural Heritage using Semantic Web Techniques Antoine ISAAC VU Amsterdam - KB Digital Access to Cultural Heritage Master March 20 th, 2008.
Left click or use the forward arrows to advance through the PowerPoint Upon clicking, each section of the article will be highlighted one by one Read.
ROI & Impact: Quantitative & Qualitative Measures for Taxonomies Wednesday, 11 February :00 – 12:30 PM MST Presented by Jay Ven Eman, Ph.D., CEO.
Carlos Lamsfus. ISWDS 2005 Galway, November 7th 2005 CENTRO DE TECNOLOGÍAS DE INTERACCIÓN VISUAL Y COMUNICACIONES VISUAL INTERACTION AND COMMUNICATIONS.
Mantova 18/10/2002 "A Roadmap to New Product Development" Supporting Innovation Through The NPD Process and the Creation of Spin-off Companies.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 18 Slide 1 Software Reuse.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
TDT4252/DT8802 Exam 2013 Guidelines to answers
Terminology services and the DDC: the High-Level Thesaurus and beyond Presented to the symposium Dewey goes Europe: on the use and development of the Dewey.
Rutherford Appleton Laboratory SKOS Ecoterm 2006 Alistair Miles CCLRC Rutherford Appleton Laboratory Semantic Web Best Practices and Deployment.
Linked Data & Europeana Antoine Isaac DARIAH Linked Data workshop Nov 24, 2010.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
PART IV: REPRESENTING, EXPLAINING, AND PROCESSING ALIGNMENTS & PART V: CONCLUSIONS Ontology Matching Jerome Euzenat and Pavel Shvaiko.
Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije.
Development of metadata in the National Statistical Institute of Spain Work Session on Statistical Metadata Genève, 6-8 May-2013 Ana Isabel Sánchez-Luengo.
Multilingual Information Exchange APAN, Bangkok 27 January 2005
Leveraging Reusability: Cost-effective Lexical Acquisition for Large-scale Ontology Translation G. Craig Murray et al. COLING 2006 Reporter Yong-Xiang.
DDI-RDF Leveraging the DDI Model for the Linked Data Web.
ESS-net DWH ESSnet DWH - Metadata in the S-DWH Harry Goossens – Statistics Netherlands Head Data Service Centre / ESSnet Coordinator
1 Everyday Requirements for an Open Ontology Repository Denise Bedford Ontolog Community Panel Presentation April 3, 2008.
GREGORY SILVER KUSHEL RIA BELLPADY JOHN MILLER KRYS KOCHUT WILLIAM YORK Supporting Interoperability Using the Discrete-event Modeling Ontology (DeMO)
Conceptual Data Modelling for Digital Preservation Planets and PREMIS Angela Dappert.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Learning Object Metadata Application Profiles: Lithuanian Approach E. Kurilovas S. Kubilinskienė Centre for IT in Education, MoE Lithuania.
GEMET GEneral Multilingual Environmental Thesaurus leading the way to federated terminologies Stefan Jensen, Head of information services group with input.
EConnect WP1 & semantic issues VU members –Guus Schreiber, Antoine Isaac, Jacco van Ossenbruggen, Jan Wielemaker.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
KB subject prediction tool. STITCH final event KB subject prediction prototype Introduction Subject prediction is a special case of book reindexing What.
NeOn Components for Ontology Sharing and Reuse Mathieu d’Aquin (and the NeOn Consortium) KMi, the Open Univeristy, UK
Using Bayesian Belief Networks in Assessing Software Architectures Jilles van Gurp & Jan Bosch.
Knowledge Support for Modeling and Simulation Michal Ševčenko Czech Technical University in Prague.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
B usiness- C entric M ethodology For Enterprise Agility & Interoperability Lubash Pyramid Challenges Today’s Approach Doctrine Operations Information Architecture.
Technische Universität München © Prof. Dr. H. Krcmar An Ontology-based Platform to Collaboratively Manage Supply Chains Tobias Engel, Manoj Bhat, Vasudhara.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Document, Index, Discover, Access
Accommodating local cataloguing traditions in a global context
Antoine Isaac SEMIC conference
Presentation transcript:

Vocabulary Matching for Book Indexing Suggestion in Linked Libraries – A Prototype Implementation & Evaluation Antoine Isaac, Dirk Kramer, Lourens van der Meij, Shenghui Wang, Stefan Schlobach, Johan Stapel

Problem: subject indexing Describing subjects of books Using concepts from vocabularies (e.g. thesauri)

Problem: re-indexing Describing a book that has already be described With a new vocabulary –Fitting a different context (e.g., different libraries)

Why re-indexing at KB? The Dutch National Library (KB) holds many books that are also in other Dutch public libraries KB deposit uses Brinkman thesaurus for indexing Public Libraries use Biblion thesaurus

A wider issue KB shares books with many other libraries All having their own description practices

Room for improvement? Libraries devote large resources to indexing –20 people at KB –About 20,000 books per year Leveraging already existing descriptions for re- indexing can be beneficial for both sides

Alignment and re-indexing STITCH project –Tackling semantic interoperability in Cultural Heritage –Using ontology alignment Mappings between concepts from different vocabularies can be used for re-indexing Basic idea: replace concepts in descriptions by conceptually equivalent concepts

Goal: a re-indexing prototype Past: preliminary experiments with KB data Now: building a prototype and –plugging it onto the KB production system –having it evaluated by its potential users (indexers) Prototype case: Dutch public libraries / KB Suggesting Brinkman subjects based on Biblion ones

Alignment and re-indexing: requirements Subjects can be complex Mappings between groups of concepts "Travel guides" + "Spain" → "Spain; travel guides" Concepts are used in descriptions Mappings taking into account extensional semantics "Building engineering" → "Learning material ; building engineering"

Obtaining re-indexing rules Lexical alignments are not good enough Probabilistic rules are calculated –Using extension of concepts: existing indexing –Simple probabilities, with adhoc adjustment "Travel guides","Spain"→"Spain; travel guides", Not only based on Biblion subjects –AUT – main authors of books –KAR – “characteristic” –DGP – intellectual level/target group

Demo Doesn't work?

User study Quantitative aspect –How well does the tool compare to human subject indexing? Qualitative aspect –User satisfaction –Improvement suggestion

Evaluation setting 6 indexers 6 weeks 284 books Evaluation integrated in daily indexing work Pre-evaluation briefing Questionnaire during evaluation Post-evaluation de-briefing & questionnaire

User study results Top ranked mappings are indeed much better Individual book satisfaction level > 70% Suggestion class# suggestionsprecisionrecall blue %47.9% purple1, %27.1% red2, %5.98% non suggested8919.0%

User study results (1) But the general satisfaction is lower –Only two out of six would use the tool as such Quality of suggestions –Lower-level suggestions are often not meaningful Perception of suggestions' quality –Long lists with wrong suggestions ad the end are bad –Ranking is appreciated, but it is not enough

User study results (2) Suggestions were found promising Bridging the indexing gap between collections –Different indexing strategies "Persian language" (Biblion) vs. "Iranian language and literature" (Brinkman) Lots of suggestions for improvement More re-indexing! –Suggesting concepts from other vocabularies –More context metadata as input

Conclusions Shows the potential of re-using data in a library network Alignment approach fitting indexing practice Concrete demonstration, in KB production environment Technology transfer: KB wants to continue efforts Flexibility: architecture ready to exploit other vocabularies –Linked data & SKOS

Prototype components

Linked libraries?

Thank you! Questions?

Screenshots

WinIBW production tool

STITCH suggestion tool

Original metadata

Concept suggestions

Comparing with human re-indexing

Complement: lexical alignments

Adding subjects using thesaurus access

Concept suggestions

Saving and back to WinIBW

Screenshots Back