EHRI Vocabularies and Linked Open Data : An enrichment?

Slides:



Advertisements
Similar presentations
Theo van Veen, Koninklijke Bibliotheek The European Library: opportunities for new services.
Advertisements

Europe for Citizens Programme ( ). Structure of the programme: four actions 1.Active Citizens for Europe 2.Active Civil Society in Europe 3.Together.
Controlled Vocabularies in TELPlus Antoine ISAAC Vrije Universiteit Amsterdam EDLProject Workshop November 2007.
© HarperCollins Publishers 2010 Significance Why should we study the Holocaust?
Do not use fonts other than Arial for your presentations ‘From A2A to Web 3.0’: local authority archives and the challenges in working across sectors in.
Creating electronic resources for the study of forced migration: a researcher's perspective Marilyn Deegan Refugee Studies Centre University of Oxford.
Converging parallel universes Library services as building blocks of digital humanities research 42nd LIBER Annual Conference Munich June 2013 Gregor Horstkemper.
DRIVER Summit, January 2008 NEREUS A network of leading libraries collaborate on NEEO Network of European Economists Online.
WISER: History Advanced OLIS searches Isabel Holowaty, History Librarian Kate Petherbridge, Upper Camera Superintendent.
Auschwitz-Birkenau State Museum. The International Center for Education about Auschwitz and the Holocaust constitutes the integral part of the Auschwitz-Birkenau.
My Family and My Roots: A 9 th Grade Film Project Lowell Blackman Atid Lod High School of Sciences Lod, Israel.
Opening Plenary Dov Winer Coordinator MINERVA Israel The 11 th Jerusalem EVA/Minerva International Conference on Digitisation of Culture.
Intute and Organic.Edunet Jackie Wickham ALLCU, Oxford, July 2008.
Archival description and linked data: Opportunities and implementation challenges Karen F. Gracy, Ph.D., Kent State University The Metadata Vocabulary.
TRACES OF THE PAST – schools adopt monuments Warsaw, 23th April 2012.
Terminology services and the DDC: the High-Level Thesaurus and beyond Presented to the symposium Dewey goes Europe: on the use and development of the Dewey.
My Family and My Roots: A 9 th Grade Film Project Lowell Blackman Atid Lod High School of Sciences Lod, Israel.
IATE EU tool for translation-oriented terminology work
The Potentials and Limitations of Coastal Web Atlases Valerie Cummins, Director Coastal Zone ‘07, Oregon.
‘The Universal Catalogue’ a cultural sector viewpoint David Dawson Senior Policy Adviser (Digital Futures) Museums, Libraries and archives Council.
CLARIN ERIC Progress according to the Strategy Plan Steven Krauwer, Bente Maegaard 1.
The European Thesaurus on International Relations and Area Studies A Multilingual Resource for Indexing, Retrieval, and Translation SWP Michael Kluck and.
The International Commission for the Evaluation of the Crimes of the Nazi and Soviet Occupation Regimes in Lithuania Ingrida Vilkiene Deputy director and.
Prague 24 November TEL-ME-MOR/M-CAST Seminar on Subject Access The Virtual International Authority File (VIAF) Christel Hengel.
EUscreen: Examining An Aggregator ’ s Role in Digital Preservation Samantha Losben Digital Preservation - Final Project December 15, 2010.
Péter Schönhofen – Ad Hoc Hungarian → English – CLEF Workshop 20 Sep 2007 Performing Cross-Language Retrieval with Wikipedia Participation report for Ad.
OCLC Research: Selected projects Eric Childress Larry Olszewski Presentation for Dpto. Biblioteconomía y Documentación Universidad Carlos III de Madrid.
MTA SZTAKI Department of Distributed Systems The problems of persistent identifiers in the context of the National Digital Data Archives of Hungary András.
Archival authority files and the representation of literary networks: first steps and opportunities Cataloguing Creativity, 15/11/2013: Bill Stockting,
Andy What is Wikipedia? ??? An encyclopedia The free encyclopedia, that anyone can edit Many encyclopedias (288 languages)
Techlib Cooperative Cataloguing Project The Brazilian experience Maria Luiza M. Bueno e Silva UNESCO Brasilia Documentation Centre 2005 Clearing House.
Centennial College Libraries. library.centennialcollege.ca.
From small beginnings: Developing collection level description Mapping the Information Landscape Showcase day British Library Conference Centre, London,25.
Improving the User Experience in From the absolutely ridiculous to the nearly sublime… Miriam Haardt MA WissDok MCLIP Deputy Senior Librarian and Cataloguer.
…a presentation of the, history, victims, concentration camps and liberation…
Andy Wikipedia’s sister projects.
How Linked Open Data helps Museums Collaborate, Reach New Audiences, and Improve Access to art Information Eleanor E. Fink Manager, American Art Collaborative.
Tiziana // Alessandra Lenzi - MG Breaking down the walls Project Museo Galileo and the Linked Open Data A joint project between.
Licensing in a European Perspective - case Finnish National Consortium ELAG 2001, Prague Kristiina Hormia-Poutanen.
1 Dr. Cord Pagenstecher Testimonies on Nazi Forced Labor and the Holocaust Building Digital Environments for Research and Education Dr. Cord Pagenstecher.
The Biodiversity Heritage Library (BHL-Europe) Towards a global library of life Patrick Grootaert Royal Belgian Institute of Natural Sciences IXth European.
FIND IT! USING LIBRARY CATALOGING CONCEPTS TO ORGANIZE AND MAKE RECORDS FINDABLE DIONNE L. MACK, INTERIM DIRECTOR OF QUALITY OF LIFE DEPARTMENTS.
Brill Over three centuries of scholarly publishing.
PARTHENOS-project.eu EOSC market demand for art, humanties and cultural heritage Amsterdam– EGI Conference– 7/4/2016 Franco Niccolucci Scientific Coordinator,
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
Organization of Information LSIS Summer II (2005)
Researching Holocaust survivors in Greece through VHA PhDr. Kateřina Králová, Ph.D.
The Ghettos Children eating in the ghetto streets. Warsaw, Poland, between 1940 and — US Holocaust Memorial Museum.
MICHAEL and the European Digital Library: promoting teaching, learning and research The MICHAEL Project is funded under the European Commission eTEN Programme.
Slides Template for Module 3 Contextual details needed to make data meaningful to others CC BY-NC.
Linked Open Data Approaches within the ARIADNE project
Dr Jennifer Edmond Trinity College Dublin & PARTHENOS
NGI International Liaisons Meeting
Thai AGROVOC Ontology Base for Agricultural Information Retrieval
CrissCross, Seoul
Victims of Nazi Persecution
Integrating Data for Archaeology
Multilingualism in UK websites Kate Fernie, MLA
GÉANT Planned Topology
ALA Conversation Starter
European Network of e-Lexicography
Visually Impaired Community
FRAD: Functional Requirements for Authority Data
Erasmus+ Come to study or teach in Europe Erasmus+
Antoine Isaac.
Darja Fišer CLARIN ERIC Director of User Involvement
Report on WISE Art.8 and GIS issues
Journal Entry #1 Have you ever witnessed something you believe was wrong? What did you do about it? You have five minutes to write five or more complete.
State of progress with transition to new Standard Data Form
Reporting – Art 17 of HD and Art 12 of BD
Presentation transcript:

EHRI Vocabularies and Linked Open Data : An enrichment? Annelies van Nispen 15/05/2018 CONNECTING COLLECTIONS

The EHRI Portal connecting archives and users Online inventory of institutions and collections about the Holocaust Making sources visible in a systematic fashion in order to counteract the fragmentation of the sources Reveal interconnections (e.g. through a multilingual thesaurus; collation of authority files; relationships between originals and copies) EHRI focuses on collection descriptions

EHRI Portal https://portal.ehri-project.eu/

EHRI Vocabularies EHRI Thesaurus (subject terms) Camps Ghettos Administrative districts Places (Geonames) Persons Corporate bodies

EHRI Vocabularies Main tool for multilingual information Retrieval & search functionality Cataloguing and integration tool for incoming data Holocaust related knowledge base, useful for further developments eg. NER, LOD or …..

EHRI Vocabularies and Linked Open Data Experiments with EHRI Vocabularies and LOD Places – Geonames Persons – VIAF Camps & Ghettos – Wikidata Aim: Enrich EHRIs Vocabularies and where possible publish as LOD

EHRI Places & Geonames

GeoNames Reconciliation - problematic cases Places not listed in GeoNames (e.g. Altreich) Places listed in GeoNames but missing spelling variants (e.g. Babyn Iar) More than one location per place names, e.g. "Berlin" from "1(Berlin, sowjetischer Sektor)" mapped to 176 different locations Access points which are difficult to disambiguate without context (e.g. "Bauer" can be the German word for "peasant", a German family name, or a German town)

Geonames: More issues access points with typos not clustered by OpenRefine (e.g. "Aushwitz" instead of Auschwitz) access points wrongly filtered out as person names (e.g. "Amsterdam, Landsmeer”) Common nouns sometimes give false positives, e.g. "Artillerie" from "1(Artillerie)" mapped to a part of town in New Caledonia Problem: Historical states, such as Yugoslavia or Czechoslovakia, are not properly linked to parents / children in the GeoNames dataset

EHRI Persons & VIAF

EHRI Personalities and VIAF Experiment with automatic matching to VIAF of persons data from Yad Vashem, CDEC and Cegesoma with manual quality check on matching results. Issues : Many people carry the same name Not enough information on birth/death dates, places or profession to distinguish individuals Spelling variants/mistakes

Outcome of experiment 100 YV names, 68 were matched against entries in VIAF. High ambiguity in matching: a total of 234 matches, each name was matched 3.44 times 68 matches: 31 were correct and 37 false positives. The ambiguity in cases of a correct match was sometimes higher, eg correct one in a set of 5/6 matches Cegesoma and CDEC data give similar results, with CDEC data even much higher false positives

Ghettos, Camps and Wikidata

Import Ghettos in Wikidata Name of the ghetto in different languages Unique EHRI identifier for the ghetto Associated place name and its unique identifier in Wikidata Coordinates from Yad Vashem and/or USHMM Unique identifiers from online resources, including The Yad Vashem Encyclopedia of the Ghettos During the Holocaust and the USHMM Holocaust Encyclopedia Added statement qualifying the entry as a “ghetto in Nazi-occupied Europe”

EHRI Ghettos & Wikidata

Wikidata to EHRI Portal English name of the ghetto Place where the ghetto was located Coordinates for the location EHRI-assigned unique identifier for the ghetto Associated unique identifiers from online resources Multilingual labels generated from the name of the places

EHRI Vocabularies & LOD: An enrichment? Mixed results Geonames set has problems, but we will use for further development Personalities too much errors and sensitive vocabulary Ghettos, Camps and Wikidata a positive experience 14

CONNECTING KNOWLEDGE The Jewish Museum of Greece (GR) Jewish Historical Institute (PL) King’s College London (UK)   Ontotext AD (BG) Elie Wiesel National Institute for the Study of Holocaust in Romania (RO) DANS Data Archiving and Networked Services (NL) Shoah Memorial, Museum, Center for Contemporary Jewish Documentation (FR) ITS International Tracing Service (DE) Hungarian Jewish Archives (HU) INRIA Institute for Research in Computer Science and Automation (FR) Vilna Gaon State Jewish Museum (LT) VWI Vienna Wiesenthal Institute for Holocaust Studies (AT) Foundation Jewish Contemporary Documentation Center (IT) NIOD Institute for War, Holocaust and Genocide Studies (NL) CEGESOMA Centre for Historical Research and Documentation on War and Contemporary Society (BE) Jewish Museum in Prague (CZ) Center for Holocaust Studies at the Institute for Contemporary History in Munich (DE) YAD VASHEM The Holocaust Martyrs’ and Heroes’ Remembrance Authority (IL) United States Holocaust Memorial Museum (USA) Bundesarchiv (DE) The Wiener Library Institute for the Study of the Holocaust & Genocide (UK) Holocaust Documentation Centre (SK) Polish Center for Holocaust Research (PL) EHRI is funded by the European Union CONNECTING KNOWLEDGE