Design of a Pilot SRW-compliant Terminologies Mapping Service (HILT) Terminologies Workshop, ECDL, Alicante, 2006 Dennis Nicholson, CDLR, Strathclyde University.

Slides:



Advertisements
Similar presentations
Terminology Services Ralph LeVan Senior Research Scientist OCLC.
Advertisements

Searching Options and Result Sets Sara Randall Endeavor Information Systems October 30, 2003.
Cultural Heritage in REGional NETworks REGNET. October 2001Project presentation REGNET 2 T1.3. IDENTIFICATION OF STANDARDS TO BE USED 1. OBJECTIVES 2.
OCLC Research April 2008 Terminology Services Experimental Services for Controlled Vocabularies.
Intute Repository Search Project An iterative approach to developing a national search service to support scholarly communication, teaching and learning.
The HILT Pilot Terminologies Server Dennis Nicholson: Centre for Digital Library Research, Strathclyde University.
A centre of expertise in digital information management UKOLN is supported by: SRU: An overview of the SRU protocol and how it can be used.
Names Project Web Services and repositories workshop Daniel Needham.
Metadata workshop, June The Workshop Workshop Timetable introduction to the Go-Geo! project metadata overview Go-Geo! portal hands on session.
Report on progress Stakeholder workshop, 29 Jan 2003.
OCLC Online Computer Library Center Terminology Services Diane Vizine-Goetz OCLC Research.
Cross-browsing subject gateways with the Dewey Decimal Classification in the Renardus Service Michael Day UKOLN, University of Bath JISC.
Delivering HILT as a shared service Rachel Heery UKOLN, University of Bath
1 Leonard Will Willpower Information Evaluation of HILT 2.
HILT II: Towards Interoperable Subject Descriptions Report to the JISC Terminologies Workshop, February Dennis Nicholson: Centre for Digital Library.
WikiD (Wiki/Data) Jeffrey A. Young OCLC Office of Research Distributed Service Registry Workshop Warwick, UK 14 July 2005.
HILT IV Pilot Toolkit Demonstration Emma McCulloch Centre for Digital Library Research CIG 2008, Glasgow.
Not just numbers on shelves: using the DDC for information retrieval Gordon Dunsire Presented at the Symposium “Bridging the class(ification) divide: the.
The OCLC Metadata Switch Project Jean Godby, Thomas Hickey, Diane Vizine-Goetz OCLC Office of Research Digital Library Federation May 14, 2003.
WWW Challenges : Supporting Users in Search and Navigation Natasa Milic-Frayling Microsoft Research, Cambridge UK SOFSEM 2004 January 28, 2004.
Leveraging Your Taxonomy to Increase User Productivity MAIQuery and TM Navtree.
University of Adelaide Library Life Impact The University of Adelaide The well connected catalogue Patricia Scott, Denise Tobin and Helen Attar.
The KB on its way to Web 2.0 Lower the barrier for users to remix the output of services. Theo van Veen, ELAG 2006, April 26.
Case study - usability evaluation Howell Istance.
Joan S. Mitchell Executive Director & Editor in Chief Dewey Decimal Classification OCLC WebDewey.
21 21 Web Content Management Architectures Vagan Terziyan MIT Department, University of Jyvaskyla, AI Department, Kharkov National University of Radioelectronics.
DoW text: Task and WP leaders will prepare syntheses reports of the project progress, its results and its implications. These synthesis reports will be.
WISER: Newspapers online : an introduction to the scope and range of recent and current newspapers available on Oxlip, including hints on effective search.
Z39 Intro DigiTool Version 3.0. Z39 Intro 2 Overview What is z39.50? “A network protocol which specifies rules that allow searching of a range of different.
A Globally Interoperable Scottish Subject Landscape? HILT, SPEIR and a Scottish Terminologies Server CIGS Scottish Terminologies Day, September Dennis.
Workshops in Information Skills and Electronic Resources Oxford University Library Services WISER Social Sciences: Finding Journal Articles Angela Carritt:
Presented By: Product Activation Group Syndication.
1 Web Server Concepts Dr. Awad Khalil Computer Science Department AUC.
C Copyright © 2009, Oracle. All rights reserved. Appendix C: Service-Oriented Architectures.
Terminology services and the DDC: the High-Level Thesaurus and beyond Presented to the symposium Dewey goes Europe: on the use and development of the Dewey.
Using the SAS® Information Delivery Portal
1 Catalog Displays, Retrieval, and FAST May 31, 2005.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
 2001 Prentice Hall, Inc. All rights reserved. 1 Chapter 21 - Web Servers (IIS, PWS and Apache) Outline 21.1 Introduction 21.2 HTTP Request Types 21.3.
CINAHL DATABASE FOR HINARI USERS: nursing and allied health information (Module 7.1)
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
IESR Interfaces: Current Services and Future Plans Ann Apps MIMAS, The University of Manchester, UK.
Distributed Information Retrieval Using a Multi-Agent System and The Role of Logic Programming.
1 © Netskills Quality Internet Training, University of Newcastle HTML Forms © Netskills, Quality Internet Training, University of Newcastle Netskills is.
The UNESCO Thesaurus Meeting for Managers of UNESCO Documentation Networks Meron Ewketu UNESCO Library June
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
Indexing Mathematical Abstracts by Metadata and Ontology IMA Workshop, April 26-27, 2004 Su-Shing Chen, University of Florida
Computing Ontology Part II. So far, We have seen the history of the ACM computing classification system – What have you observed? – What topics from CS2013.
MetaLib 4 User Guide. 2 MetaLib 4 Access MetaLib at: – MetaLib may be used at two different levels –
JISC Information Environment Service Registry (IESR) Ann Apps MIMAS, The University of Manchester, UK.
The World Wide Web: Information Resource. Hock, Randolph. The Extreme Searcher’s Internet Handbook. 2 nd ed. CyberAge Books: Medford. (2007). Internet.
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
CNI, 4th April 2006 Slide 1 Key Standards Update: SRU (“Technical” Details) Dr. Robert Sanderson Dept. of Computer Science University of Liverpool
SRW/U: Re-Introduction SRW is a Web Services based Information Retrieval Protocol Motivations: Create an easy to implement protocol with the power of Z39.50.
Optimising Interoperability in Multi-KOS Subject Searching: Framework for a Collaborative Approach? Dennis Nicholson, Centre for Digital Library Research.
ALA Annual Meeting Claire Cocco Global Product Manager CONTENTdm Users Group June 30th, 2008.
PDS4 Demonstration Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
Diane Vizine-Goetz Senior Research Scientist, OCLC Research Joan S. Mitchell Editor in Chief, DDC Michael Panzer Assistant Editor, DDC Publisher and Librarian.
HILT High Level Thesaurus Project Report to the JISC/NSF Conference on HILT Phase I (completed) and HILT Phase II (just starting) Dennis Nicholson: Centre.
Discovery and Metadata March 9, 2004 John Weatherley
TRSS Terminology Registry Scoping Study
WEB SERVICES From Chapter 19 of Distributed Systems Concepts and Design,4th Edition, By G. Coulouris, J. Dollimore and T. Kindberg Published by Addison.
High-Level Thesaurus (HILT) project: Recent work
CINAHL DATABASE FOR HINARI USERS
Service-centric Software Engineering
Health On-Line Patient Education Web Site
JISC Information Environment Service Registry (IESR)
MEDLINE with Full Text Searching
WEB SERVICES From Chapter 19, Distributed Systems
Presentation transcript:

Design of a Pilot SRW-compliant Terminologies Mapping Service (HILT) Terminologies Workshop, ECDL, Alicante, 2006 Dennis Nicholson, CDLR, Strathclyde University

HILT: Background and Overview Funded JISC; Support: OCLC; Collaborative Aim: provide subject interoperability in a multi- scheme environment via inter-scheme mapping Ideally by identifying a generic approach, able to be built up through distributed collaborative action… Originally: intellectual mapping, but now see how model can include range of interoperability services Phase I, II, M2M FS; Now: Phase III (main focus)

HILT Phase III [Nov 05 – Jan 07] Aim: an M2M pilot that: Offers terminology services via SRW but is open to later extension to ( Z39.50; SRU) Uses SKOS-Core as the mark-up for sending out terminology sets and classification data but open to later extension to other formats (MARC; Zthes) Is open to the possibility of a distributed approach to building a full service up via wide collaboration Extends the user-accessible (non-M2M) Phase II pilot beyond inter-scheme mapping

Phase III: How Phase II Pilot works Offers mapping based subject interoperability via a DDC spine, and works like this: The user enters a subject term, which is used to search the database for DDC captions that may fit the users topic Captions / numbers returned; user chooses best match The DDC number chosen is used to find collections covering the users subject and the subject schemes they use in a collections database, and the best term for the users topic in any given scheme Sample retrieval is provided where possible Screen shots to illustrate

Description Top levels browse hierarchy Search box – common - teeth

System responds by finding term Identifying possible DDC captions Returning to user as shown User then chooses best fit for topic Number 3, dental diseases

Dental diseases in DDC, used for 3 things Truncation used to find these relevant collections at DDC 610 This identifies the subject schemes they use (MeSH here) Best term in scheme via mapping to DDC (dentition)

Finally, dentition used to search last of these collections via OpenURL; send back relevant hits

Diagram: architecture SRW version; Blue (II): Users, browsers, HILT RH PHP/web service, screens Grey: additional SRW elements: users, services, embedded clients, Two: HILT Phase II +, GoGeo service specific; Collections/services dbase via client/ RH SOAP server queries database; SKOS

Sample SKOS wrapped record URL later

Database structure

Data; 6 Functions; Emulate Phase II Get_DDC_records Returns DDC captions and numbers related to an input term; list for user to choose best fit caption/no. Get_collections Returns collections classified under a specified DDC number or its stem, including subject scheme used Get_non_DDC_records Returns mappings to other schemes from a specified (untruncated) DDC number

Functions: Phase II and beyond Get_all_records Combines the functions of get_DDC_records and get_non_DDC_records seen above Get_explain Provides information to feed SRW Explain requests Get_filtered_set Allows specified fields from specific terminologies or combinations of terminologies to be searched – enabling functionality beyond phase II to be added

Next: how SRW clients use functions to emulate pilot Top:SOAP get_DDC_records indexed under teeth Middle: DDC captions returned (with numbers) User picks best fit caption: dental diseases; Bottom: SOAP get_collections ; stem (610)

Lower middle: get_non_ddc_records for terms mapped to Bottom: one returned term (dentition best term in MeSH) Top: repeat of request; get_collections&DDC= Upper middle: collections classified at 610; schemes used (Schemes)

Dentition used to search last collection for relevant hits

Switch to M2M: Advantages Services can use a standard web services protocol and query language to interact with HILT and other terminology services They can use HILT services selectively They can offer enhanced services that are transparent to their users They can utilise what they know about their users and their behaviour to interact more usefully with HILT

Phase III pilot: Not Just Phase II Extended Features: A baseline SRW open source client that includes the DDC collections-finding code More schemes: DDC, LCSH, IPSV, AAT, GCMD HASSET, MeSH, NMR, JACS, UNESCO Additional – but still illustrative - mappings Detailed data on terms in schemes: BT, NT,RT, synonyms, scope note etc. Last s hould allow clients to generate – and allow users to navigate - scheme hierarchies

Future Possibility: A Baseline Service? Basis of generic collaborative distributed solution? If clients can generate scheme hierarchies; initial but extendable service based on top level mappings and hierarchy based collection retrieval looks feasible Deeper levels of mapping as and when possible Distributed approach could allow faster progress on scheme expansion and deeper mapping Model open to external interoperability/terminology services, not just local intellectual mappings (CSD)

Beyond Phase III – A Possible Path? Could be used for distributed collaborative work; research into retrieval effectiveness; user needs Open question; Obvious practical issues Local hits can improve user topic identification But can hierarchies offer effective retrieval? Is DDC practical for deeper mapping? (costs) If not, is mapping via SKOS concept URIs? How can services database best categorise mapping and terminology services, schemes and versions for clients?

Other Issues Impacting on Futures Currently a probable path, but too early to be sure Within Phase III, still have to: Complete illustrative mappings, SOAP, SRW server/clients Look at the feasibility and design of a distributed approach Baseline service possibility looks most attractive if distributed, but implications of this yet to be explored Unknowns: JISC review of shared services (like HILT) and a report to JISC on the terminologies area generally

Further Information Website (all HILT phases): s: HILT (SOAP) Demonstrator: