Presentation is loading. Please wait.

Presentation is loading. Please wait.

Translingual Retrieval Moving between vocabularies MACS 2010 Jahns / Karg, Deutsche Nationalbibliothek Concepts in Context - Cologne Conference on Interoperability.

Similar presentations


Presentation on theme: "Translingual Retrieval Moving between vocabularies MACS 2010 Jahns / Karg, Deutsche Nationalbibliothek Concepts in Context - Cologne Conference on Interoperability."— Presentation transcript:

1 Translingual Retrieval Moving between vocabularies MACS 2010 Jahns / Karg, Deutsche Nationalbibliothek Concepts in Context - Cologne Conference on Interoperability and Semantics in Knowledge Organization 2010-07-19

2

3 1997 CENL-MACS project started 1997-1999 feasibility study sports & theatre 2000-2001 prototype of a web application 2002-2004 LMI (link management interface) was established 2005 RAMEAU-LCSH-Links were loaded by BnF 2006 CrissCross started MACS Project history

4 2007 SNB started editing links on a regular basis 2008 SNB and DNB prepared work division 2008 update and installation of an update routine for the three indexing languages 2008-2009 recruiting freelancers, training 2009 translators work phase 2009 TEL prototype of a multilingual search interface Project progress MACS 58.907 EN-FR-GER links

5 MACS Project scope 70,000 Tuple RAMEAU - LCSH from BnF 70,000 Tuple RAMEAU - LCSH from BnF 50,000 SWD concepts to connect with 50,000 SWD concepts to connect with SWD-topical headings of ALL disciplines SWD-topical headings of ALL disciplines 12 work packages for neighbouring disciplines 12 work packages for neighbouring disciplines

6 MACS Working area – LMI https://macs.hoppie.nl/lmi/ Searching and listing subject headings Searching and listing subject headings

7 Working area – LMI Update mechanism and identifier Update mechanism and identifier

8 Working area – LMI Searching links Searching links

9 Working area – LMI Editing links / Creating new links Editing links / Creating new links

10 Working area – LMI Working area – LMI Filtering DDC-domains Filtering DDC-domains

11 No language is used as a pivot No language is used as a pivot Headings are not translated Headings are not translated Mapping is done on the basis of concepts Mapping is done on the basis of concepts Linking is checked upon retrieval results Linking is checked upon retrieval results Set of practical rules and guidelines (BS 8723) Set of practical rules and guidelines (BS 8723) Methodology

12 LCSH, RAMEAU pre-coordinated strings / plural LCSH, RAMEAU pre-coordinated strings / plural SWD more flexible; post-coordination / singular SWD more flexible; post-coordination / singular LCSH, RAMEAU subdivision practice LCSH, RAMEAU subdivision practice Language differences Frauenarbeit Femmes -- travail Women -- employment 160,000280,000 96,000 Mörder AND Intensivtäter Tueurs en série Serial murderers

13 Equivalence 1 :1, 1 : n (including AND) Equivalence 1 :1, 1 : n (including AND) Null: no mapping Null: no mapping Types of links TheologieTheologieTheology Athlétisme AND EntraîneursLeichtathletiktrainer Track and field coaches TraducteursÜbersetzerTranslators TraducteursDolmetscherTranslators 0Erdalkalichloride Alkaline earth chlorides

14 exact exact inexact = overlapping inexact = overlapping partial equivalent partial equivalent Equivalence Ètudiants japonaisJapanischer Studentjapanese students ImpérialismeNeokolonialismusImperialism Cas (linguistique)Dativ0

15 Different coverage Different coverage Different semantics, semantic relations Different semantics, semantic relations Language skills Language skills Mapping problems Apfelwein Cidre Cider Antike? Altertum! Civilsation antique Civilisation, Ancient

16 Access to mappings Mapped terms in LMI -> XML-Download Mapped terms in LMI -> XML-Download Linked Open Data Linked Open Data Multilingual equivalent relations in authority records Multilingual equivalent relations in authority records

17 Linked Data: Publishing MACS in the web Linked Data: Datasets on the web linked together semantically, forming a publicly accessible network of structured information Linked Data: Datasets on the web linked together semantically, forming a publicly accessible network of structured information

18 Why Linked Data? Has become the de-facto standard for: Has become the de-facto standard for: publishing publishing exchanging data on the Internet exchanging data on the Internet Reaches more collaboration partners and customers Reaches more collaboration partners and customers Research institutions Research institutions Cultural heritage organizations Cultural heritage organizations Internet companies Internet companies online communities online communities Library data can play a major role in the semantic web Library data can play a major role in the semantic web provides reliability and stability provides reliability and stability can become a „backbone of trust“ can become a „backbone of trust“

19 MACS as Linked Data Linked Data published by the German National Library (as of May 30th, 2010) Linked Data published by the German National Library (as of May 30th, 2010) 37.547 links from SWD to LCSH 37.547 links from SWD to LCSH 28.249 links from SWD to RAMEAU 28.249 links from SWD to RAMEAU RAMEAU and LCSH also publish MACS relations as part of their Linked Data RAMEAU and LCSH also publish MACS relations as part of their Linked Data Our data has been used to update the RAMEAU  SWD links Our data has been used to update the RAMEAU  SWD links LCSH  SWD links will follow LCSH  SWD links will follow

20 Applications of MACS Data Improved Multilingual Search Functionalities Improved Multilingual Search Functionalities Europeana Europeana Deutsche Digitale Bibliothek Deutsche Digitale Bibliothek Website of the German National Library Website of the German National Library

21 Cataloguers benefit Automatic Indexing: 5050 610 5100 !040537242! |s|Schwangerschaft ; 5101 !040153320! |s|Ernährung

22 Users benefit Search with the native languageSearch with the native language Enlargement of recallEnlargement of recall Additional recall in the own catalogue Additional recall in the own catalogue Enlargement of the query to foreign catalogues and portals Enlargement of the query to foreign catalogues and portals Precision of recallPrecision of recall

23 11 titles with the heading SWD Erneuerbare Energien 25 titles with the LCSH as keyword not indexed with SWD LCSH Renewable energy sources ----- RAMEAU Energies renouvables 2.445 titles with the heading SWD Führung 818 titles with the LCSH as keyword not indexed with SWD LCSH Leadership ----- RAMEAU Leadership

24 SWD Ärztlicher Kunstfehler used for Ärztlicher Behandlungsfehler LCSH Medical errors RAMEAU Erreurs médicales search term Ärztlicher Behandlungsfehler Dokuments: in DNB SNL LoC BL BnF and so on.... Users benefit

25 Subject search results

26 Multilingual meta-searching One-stop-searching for heterogeneously indexed documents One-stop-searching for heterogeneously indexed documents Authority data (mapping) displayed to enhance searching and understanding Authority data (mapping) displayed to enhance searching and understanding Getting results from different libraries Getting results from different libraries Europe comes together: TEL, Europeana Europe comes together: TEL, Europeana

27 Library users of multicultural societies Searching for subjects in the preferred language of the user Searching for subjects in the preferred language of the user Crossing language barriers in academic and public libraries Crossing language barriers in academic and public libraries 12 % foreign students in Germany 12 % foreign students in Germany

28 Thank you! MerciDanke h.karg@d-nb.de y.jahns@d-nb.de


Download ppt "Translingual Retrieval Moving between vocabularies MACS 2010 Jahns / Karg, Deutsche Nationalbibliothek Concepts in Context - Cologne Conference on Interoperability."

Similar presentations


Ads by Google