Reflections from the FACET Project Doug Tudhope Hypermedia Research Unit University of Glamorgan NKOS Workshop, JCDL 2005.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Towards Terminology Services: Reflections from the FACET Project Doug Tudhope Hypermedia Research Unit University of Glamorgan OCLC seminar, April, 2006.
CONCEPTUAL WEB-BASED FRAMEWORK IN AN INTERACTIVE VIRTUAL ENVIRONMENT FOR DISTANCE LEARNING Amal Oraifige, Graham Oakes, Anthony Felton, David Heesom, Kevin.
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Applications of NKOS: some examples and questions Doug Tudhope Hypermedia Research Unit University of Glamorgan DC-2005 NKOS Special Session.
Delivering HILT as a shared service Rachel Heery UKOLN, University of Bath
Alexandria Digital Library Project Integration of Knowledge Organization Systems into Digital Library Architectures Linda Hill, Olha Buchel, Greg Janée.
Mapping domain thesauri to the CRM to assist the semantic interoperability of data archives Doug Tudhope Hypermedia Research Unit University of Glamorgan.
Towards Adaptive Web-Based Learning Systems Katerina Georgouli, MSc, PhD Associate Professor T.E.I. of Athens Dept. of Informatics Tempus.
Associative and Spatial Relationships in Thesaurus-based Retrieval Harith Alani 1, Christopher Jones 2, Douglas Tudhope 1 1 School of Computing, University.
Scoping study of KOS registries Doug Tudhope Hypermedia Research Unit University of Glamorgan NKOS workshop 2007.
Vocabulary registries and services Doug Tudhope Hypermedia Research Unit University of Glamorgan Ecoterm, FAO, Rome, Oct 2009.
Helping people find content … preparing content to be found Enabling the Semantic Web Joseph Busch.
Multilingual multimedia thesaurus for conservation and restoration collaborative networked model of construction Lucijana Leoni University of Dubrovnik.
Ontology Notes are from:
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
1 Languages for aboutness n Indexing languages: –Terminological tools Thesauri (CV – controlled vocabulary) Subject headings lists (CV) Authority files.
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
1 ALiSS Adaptive Links Suggestion Service Antonio De Marinis, Stefan Jensen (EEA) Alec Ghica (Finsiel RO), Sasha Vinčić (Systemvaruhuset) Ecoterm III FAO.
Thesaurusmanagement Quickstart Introduction. What are controlled vocabularies? organized arrangement of words and phrases used to index content and/or.
Classroom User Training June 29, 2005 Presented by:
KOS-based tools for archaeological dataset interoperability: NKOS Workshop, ECDL 2010 C. Binding, K. May 1, D. Tudhope, A. Vlachidis Hypermedia Research.
Terminology services and the DDC: the High-Level Thesaurus and beyond Presented to the symposium Dewey goes Europe: on the use and development of the Dewey.
Problems of interoperability involving Knowledge Organization Systems (KOS) Doug Tudhope Hypermedia Research Unit University of Glamorgan Helsinki, November.
Rutherford Appleton Laboratory SKOS Ecoterm 2006 Alistair Miles CCLRC Rutherford Appleton Laboratory Semantic Web Best Practices and Deployment.
A J Miles Rutherford Appleton Laboratory SKOS Standards and Best Practises for USING Knowledge Organisation Systems ON THE Semantic Web NKOS workshop ECDL.
Semantic Terminology Services: Experiences from the FACET Project Doug Tudhope Hypermedia Research Unit University of Glamorgan DELOS Workshop, Lund, June.
Automatic Subject Classification and Topic Specific Search Engines -- Research at KnowLib Anders Ardö and Koraljka Golub DELOS Workshop, Lund, 23 June.
Information Need Question Understanding Selecting Sources Information Retrieval and Extraction Answer Determina tion Answer Presentation This work is supported.
Chemical Toxicity and Safety Information System Shuanghui Luo Ying Li Jin Xu.
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
1 Issues in Reusing and Sharing the Content of Thesauri and Taxonomies in OOR Marcia Zeng NKOS (Networked Knowledge Organization Systems/Services) My participating.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Towards an ecosystem of data and ontologies Mathieu d’Aquin and Enrico Motta Knowledge Media Institute The Open University.
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
NERC DataGrid NERC DataGrid Vocabulary Server Use Cases Vocabulary Workshop, RAL, February 25, 2009.
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
Semantic Web, Web Services and Museums: Mapping the Road to Implementation John Perkins “MESMUSES Workshop” Florence, June 16-17, 2003.
Linking Tasks, Data, and Architecture Doug Nebert AR-09-01A May 2010.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
Indexing Mathematical Abstracts by Metadata and Ontology IMA Workshop, April 26-27, 2004 Su-Shing Chen, University of Florida
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
© Geodise Project, University of Southampton, Knowledge Management in Geodise Geodise Knowledge Management Team Barry Tao, Colin Puleston, Liming.
Building a Topic Map Repository Xia Lin Drexel University Philadelphia, PA Jian Qin Syracuse University Syracuse, NY * Presented at Knowledge Technologies.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Working with Ontologies Introduction to DOGMA and related research.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Information Retrieval
6 th ECDL NKOS Workshop Organisers: Doug Tudhope Traugott Koch Marianne Lykke Nielsen NKOS Workshop, Budapest, 2007.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
UNEP Terminology Workshop - Geneva, April 15, Environmental Terminology & Thesaurus Workshop UN Environment Programme Regional Office of Europe.
STAR, STELLAR and SKOS Ceri Binding, Phil Carlisle, Keith May, Doug Tudhope, Andreas Vlachidis University of Glamorgan and English Heritage.
LE:NOTRE Spring Workshop The Role of Ontologies for Mapping the Domain of Landscape Architecture An introduction.
The Agricultural Ontology Server (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Food and Agriculture Organization.
TRSS Terminology Registry Scoping Study
The Role of Ontologies for Mapping the Domain of Landscape Architecture An introduction.
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Doug Tudhope Hypermedia Research Unit University of Glamorgan
C. Binding, K. May1, R. Souza, D. Tudhope, A. Vlachidis
Presentation transcript:

Reflections from the FACET Project Doug Tudhope Hypermedia Research Unit University of Glamorgan NKOS Workshop, JCDL 2005

Presentation FACET Project –Faceted Knowledge Organisation Systems (KOS) –Semantic expansion –Web Demonstrator Reflections / Current work –Need for standard representations and API –Pilot Terminology Services –KOS and Semantic Web –Cost/Benefit issues

FACET - Faceted Access to Cultural hEritage Terminology FACET - a collaborative project investigating the potential of semantic term expansion in retrieval Aims: Integration of thesaurus into the interface Semantic term expansion and matching function taking advantage of facet structure

FACET Collaborators Research Council Funding: EPSRC 3 years National Museum of Science and Industry (NMSI): National Railway Museum and Science Museum Collections Database J. Paul Getty Trust Art and Architecture Thesaurus (AAT) Museum Documentation Association (MDA) Railway Thesaurus Canadian Heritage Information Network (CHIN) Advisors

NRM Collection examples of free text object descriptor fields Chair, London Midland & Scottish Railway, straight wooden back initials carved on back, green leatherette seat. Chair, Railway Clearing House, Curved back with blue leather inset & blue leather seat. R. C.H. carved on back Chair, M.S. & L.R., Straight back, blue leather seat with M.S. & L.R. carved across back Armchair, Pullman, green plush, fringed from Pullman section. Carver chair, Oak with oval brocade seat. Prince of Wales crest on back from Royal Saloon of 1876 Armchair, Upholstered in blue maquette with curved, buttoned back & scroll arms. Wooden legs Occasional table, Oak with drawer, ornately carved. From Royal Saloon of 1876 Set of 4 chairs, High-backed carver chairs upholstered in floral maquette Clock, made by Jno Walker, 250 Regent Street. Metal face/Roman numerals. Carved wooden square case. 20"x18"x10"

Semantic Term Expansion Reasoning over thesaurus semantic relationships allows the system to play an active role Ranking of matching items in a result set Automatic suggestion of terms to be considered for query Query reformulation and ‘more like this’ option Augmented Browsing tools – semantic expansion Underpinning technologies: Measures of distance over the semantic index space Matching Function for sets of terms

FACET Prototype SQLServer database: collections DB and Thesaurus C++ thesaurus term expansion engine Dual thesaurus representations –database –in-memory data structure Visual Basic and Web client interfaces –‘Find Term’ mapping to terms, alternates, scope notes –Browse hierarchies –Semantic browsing –Query Builder –Ranked results

Faceted Knowledge Organisation Systems Faceted classifications based on primary division into fundamental, high-level categories (facets) Compound descriptors (multi-concept headings) are synthesised by combination of terms from limited number of fundamental facets In constructing AAT, adjectival noun phrases very common: e.g. painted oak furniture “Rather than enumerate the nearly infinite number of object and subject descriptions needed by thesaurus users, the AAT decided to pursue the building blocks of these descriptors in the form of a faceted vocabulary” (Guide to Indexing and Cataloging with the Art & Architecture Thesaurus)

Matching Problem “The major problem lies in developing a system whereby individual parts of subject headings containing multiple AAT terms are broken apart, individually exploded hierarchically, and then reintegrated to answer a query with relevance” (Toni Petersen, AAT Director) Query: mahogany, dark yellow, brocading, Edwardian, armchair Descriptor: oak, light yellow, crests, ovals, brocade, Victorian, Carver chair Potentially extra / missing / partially and non-matching terms

System Architecture

FACET standalone system

FACET Web Demonstrator illustrates thesaurus content and semantic expansion in a fairly realistic Web prototype application Intended more as an exploration of FACET research outcomes as dynamically generated Web components than a general interface but suggestive of possible interface components Not rely on pre-built static HTML pages - thesaurus content is generated dynamically

FACET Web Demonstrator implementation Browser-based interface (ASP application), using a combination of server-side scripting and compiled components Persistence of state information between page requests a problematic issue - HTTP protocol is (by design) stateless Solution adopted for current demonstrator involved small 'scriptlet' interface components to communicate with server without causing a browser to refresh the entire page. But side effect of introducing some (IE) platform dependence

FACET Web Demonstator

Some lessons learned Results from FACET show potential of faceted KOS for –Query expansion (ranked results based on semantic closeness) –Semantic expansion as a browsing tool when wishing to use KOS behind the scenes Web demonstrator first step –Based on custom API –KOS and database on same server (but need not be) –How to generalise these techniques?  need for Common KOS representations and APIs for general terminology (KOS) services

KOS integration into DL services from Hill et al Research Agenda (SigCR Workshop 2002) Taxonomy of KOS - KOS types linked to DL service protocols Registries of KOS and KOS-level metadata to represent them RDF/XML KOS representations - customisable Core set of relationship types across all KOS General KOS service protocol from which protocols for specific types of KOS can be derived Robust linking model in which DL entities (collections, objects, and services) can refer to KOS entities (concepts, labels, and relationships) Visualization tools that fully use and display the rich semantics embedded in KOS

Towards Terminology Services KOS-based services as elements of applications with some form of search/indexing component Next phase of work looks at common KOS representation formats and API protocols - making content available via programmatic interfaces Eg SKOS Core (RDF/XML) Schema and SKOS API deliverables of SWAD-Europe Thesaurus Activity Experiments with XPATH-based KOS interfaces (using XML and SKOS schemas) promising for relatively small KOS held within the web browserXPATH-based KOS interfaces

Pilot KOS Browser Client Web Service SKOS API designed to provide programmatic access to thesauri and related KOS via the web –Builds on Zthes, ADL Protocols DREFT demonstration web services server based on SKOS API available(?) at ILRT Only a subset of SKOS API calls were available at time of work we investigated possibilities with just 2 API calls – pilot SKOS API browsing client demonstrates browsing of online thesaurus (GEMET - GEneral Multilingual Environmental Thesaurus) via web service calls. Also GEMET thesaurus own work on web service APIweb service API

Pilot SKOS API Web Service Browser getConcept getAllConceptRelatives show semantically connected concepts but not relationships Navigation history and local cache of retrieved concepts implemented API needs more work but is a basis for web services

Semantic Expansion Service API should reflect use patterns and include composite calls in addition to returning atomic KOS data elements Ongoing work - semantic expansion as a service –as an API protocol element would yield different configurations KOS interface displays by single call novel interfaces, such as navigation via semantic expansion Query expansion for various ranked result query services Term suggestion to assist indexing/annotation More details: KOS at your Service: Programmatic Access to Knowledge Organisation Systems

Future work - KOS and Semantic Web? Important to provide a bridge/migration between KOS and Ontologies. KOS can be an element of higher level ontologies and schemas and can help leverage them. Eg utilising SKOS RDF/XML Schemas Eg DELOS JPA semantic interoperability project mapping a thesaurus to CRM Upper Ontology Ontologies as formal precise definition of relationships can be combined with inference rules and automated systems many useful applications (eg e-Science) where well defined objects and operations but also Take advantage of existing KOS in Semantic Web Some confusion as to how KOS intended to be used Need for education as to KOS design context/purpose

The ‘ontological ideology’ (Adorno) Assumption that allocation of instances to categories is unproblematic (in everyday life) –tendency to make invisible the ‘interpretive work’ in assigning objects to concepts, the bending of categories and evolution of the meaning of concepts through use DL application of concepts to ‘documents’ in indexing/search is also not unproblematic –Related via “aboutness” not clear-cut instance relationship –Indexer - Searcher (and Indexer) variation in concept selection –Use of results based on probable relevance judgements

KOS (intellectual) usually Designed in order to assist generalised retrieval Basis of construction is perceived assistance in indexing/ searching/browsing as much as logical properties of attributes Recognition that the semantic structure is to some extent ‘conventional’ with different possible cognitive viewpoints but that users can be assisted to explore a given structure and make use of it for own purposes

How to apply KOS? Domain dependent level of precision in concept use Important to take into account how applications will process concepts Current KOS relationships at a useful level of generality for many applications (with some specialisation?) where results are based on probable relevance judgements Eg Thesaurus pragmatic tool includes semantics, domain lexicon (UF/ALTs, Scope Notes) Cost/benefit issues for KOS applications in granularity of relationships and degree of formalisation Role for knowledge-based interactive tools in semantic web –old debates on Expert Systems Vs Systems for Experts

NKOS Workshop at ECDL 2005 on related theme to this workshop NKOS Workshop – Mapping Knowledge Organisation Systems: User-centred Strategies EDCL2005, September 22nd, Vienna see Selected papers from the NKOS workshop will be considered for forthcoming special issue of journal New Review of Hypermedia and MultimediaNew Review of Hypermedia and Multimedia along with an open call for papers.

References Binding C., Tudhope D KOS at your Service: Programmatic Access to Knowledge Organisation Systems. JoDI 4(4), FACET Case Study, DigiCult Thematic Issue 6: Resource Discovery Technologies for the Heritage Sector, [pdf] FACET website. FACET Web demonstrator FACET Xpath work Hill et al Integration of Knowledge Organization Systems into Digital Library Architectures. ASIST SigCR - final.dochttp:// final.doc Tudhope D., Binding C., Blocks D., Cunliffe D Compound Descriptors in Context: A Matching Function for Classifications and Thesauri. JCDL 2002, full paper (pdf)full paper (pdf)

Contact Information Doug Tudhope School of Computing University of Glamorgan Pontypridd CF37 1DL Wales, UK