Potential of freely faceted classification for knowledge retrieval and browsing Claudio Gnoli University of Pavia, Italy 7th NKOS workshop, Aarhus, 19.

Slides:



Advertisements
Similar presentations
8th ISKO Spain conference León: April 2007 A new relationship for multidisciplinary knowledge organization systems: dependence Claudio Gnoli (University.
Advertisements

Solving the Information Needs of Transdisciplinarians through Classification? Rick Szostak, University of Alberta Claudio Gnoli, University of Pavia TD-Net.
Ranganathan revisited: facets for the future ISKO UK meeting, London, November 5th, 2007 Classic vs. freely faceted classification Claudio Gnoli (ISKO.
Universal and Domain-specific Classifications from an Interdisciplinary Perspective Rick Szostak University of Alberta, Canada.
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Thesaurus speed dating conclusions. The ideal thesaurus… …is tailor-made for the special needs of its user community. In other words, it is different.
The Biosafety Clearing-House of the Cartagena Protocol on Biosafety Tutorial – BCH Resources.
Alexandria Digital Library Project Integration of Knowledge Organization Systems into Digital Library Architectures Linda Hill, Olha Buchel, Greg Janée.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
Semantic Access to Data from the Web Raquel Trillo *, Laura Po +, Sergio Ilarri *, Sonia Bergamaschi + and E. Mena * 1st International Workshop on Interoperability.
Not just numbers on shelves: using the DDC for information retrieval Gordon Dunsire Presented at the Symposium “Bridging the class(ification) divide: the.
Search Engines. 2 What Are They?  Four Components  A database of references to webpages  An indexing robot that crawls the WWW  An interface  Enables.
1 Oct 30, 2006 LogicSQL-based Enterprise Archive and Search System How to organize the information and make it accessible and useful ? Li-Yan Yuan.
A New Learning Tools. Topic Maps is a standard for the representation and interchange of knowledge, with an emphasis on the findability of information.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Semantic Search Jiawei Rong Authors Semantic Search, in Proc. Of WWW Author R. Guhua (IBM) Rob McCool (Stanford University) Eric Miller.
SCULPTEUR: Multimedia Retrieval for Museums S. Goodall, P. H. Lewis, K. Martinez, P. A. S. Sinclair, F. Giorgini, M. J. Addis, M. J. Boniface, C. Lahanier,
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
Introduction to Databases CIS 5.2. Where would you find info about yourself stored in a computer? College Physician’s office Library Grocery Store Dentist’s.
Antonella De Robbio, Dario Maguolo Mathematics Library – University Library System University of Padova – ITALY Mathematics Subject Classification and.
Search engines. The number of Internet hosts exceeded in in in in in
“A successful man is usually a classifier and a chartmaker. This applies as much to modern business as to science or libraries… A large business or work.
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Why classification matters The foundations of bibliographic classification.
Developing facets in UDC for online retrieval Claudio Gnoli (University of Pavia) Aida Slavic (UDC Consortium) 8th NKOS Workshop, Corfu, 1 Oct 2009.
1/ 27 The Agriculture Ontology Service Initiative APAN Conference 20 July 2006 Singapore.
On the Razor’s Edge Between Local and Overall Needs in Knowledge Organization.
In The Name Of God. Jhaleh Narimisaei By Guide: Dr. Shadgar Implementation of Web Ontology and Semantic Application for Electronic Journal Citation System.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
LIS618 lecture 1 Thomas Krichel economic rational for traditional model In olden days the cost of telecommunication was high. database use.
IFLA Classification and indexing satellite pre-conference Florence, August 20-21, 2009 Animals belonging to the emperor: enabling viewpoint warrant in.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
updated CmpE 583 Fall 2008 Ontology Integration- 1 CmpE 583- Web Semantics: Theory and Practice ONTOLOGY INTEGRATION Atilla ELÇİ Computer.
Food and Agriculture Organization of the UN Library and Documentation Systems Division July 2005 Ontologies creation, extraction and maintenance 6 th AOS.
ISKO UK 2011, London Vickery’s late ideas on classification by phenomena and activities Claudio Gnoli (University of Pavia. Science and Technology Library)
8th ISKO France conference, Lille, June Metadata about what ? Distinguishing between ontic, epistemic, and documental dimensions in KO Claudio.
Search Engine Architecture
ISKO 2010, Rome, Feb 2010 Workshop on Levels of reality as a KO paradigm Levels, types, facets: three structural principles for KO Claudio Gnoli.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
ARD Prasad Indian Statistical Institute, Bangalore.
Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
Domain Modeling In FREMA Yvonne Howard David Millard Hugh Davis Gary Wills Lester Gilbert Learning Societies Lab University of Southampton, UK.
Judit Tóvári PhD Eszterházy Károly College, Eger (Hungary) Institute of Media Informatics From librarian to information manager.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Information Retrieval
CIW Lesson 6MBSH Mr. Schmidt1.  Define databases and database components  Explain relational database concepts  Define Web search engines and explain.
Ontological foundations in knowledge organization The theory of integrative levels applied in citation order Claudio Gnoli University of Pavia. Science.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
FSR, Feb 2014, Rome Which knowledge organization systems for conceptual interoperability? Claudio Gnoli ISKO Italy.
Topic Maps for Cultural Heritage Collections Conal Tuohy Senior Developer New Zealand Electronic Text Centre
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
CS276B Text Information Retrieval, Mining, and Exploitation Practical 1 Jan 14, 2003.
SciGator a DDC-based browsing library interface
Search Engine Architecture
User-Adaptive Systems
Eric Sieverts University Library Utrecht Institute for Media &
From Knowledge Organization (KO) to Knowledge Representation (KR)
Information Retrieval
Introduction to Information Retrieval
Search Engine Architecture
Information Retrieval and Web Design
Presentation transcript:

Potential of freely faceted classification for knowledge retrieval and browsing Claudio Gnoli University of Pavia, Italy 7th NKOS workshop, Aarhus, 19 September 2008

Classification has a reputation of old KOS, aimed at pre-digital applications... Mr. Dewey BNCF Florence...but can be applied to indexing and extracting any information from a database !

Classification unlike other KOSs, allows for systematic browsing (“helpful sequences”). FAT-HUM Ontology researchers don’t look much interested in sorting, though this is essential to find relevant information (e.g. Google Page Rank)

Classified browsing Systematic sorting is functional... A engagement B marriage C separation D divorce...as compared to alphabetical divorce engagement marriage separation (from a real case while designing a government website)

Frequent limitations to digital exploitment. 1 Notation not expressive of hierarchy: A animals B birds C mammals CA carnivores CB rodents...  truncated queries are problematic: how to search for “any animal” ?

Frequent limitations to digital exploitment. 2 Single concept = different notations depending on discipline  multidisciplinary search is difficult: how to search for “anything related to iron” ? Current efforts in UDC to improve this 546 inorganic chemistry iron 553 economic geology iron and manganese ores 669 metallurgy ferrous metallurgy 67 industries, trades and crafts 672 articles of iron and steel

Integrative Level Classification Developing and testing a non-disciplinary, freely faceted classification scheme

Integrative Level Classification Each concept has constant notation, combinable with any other  Notation acts as unique concept identifier recorded in MySQL with: verbal caption synonyms facets semantic factors

Integrative Level Classification Retrieval and sorting are made through notation, although users can interact through verbal captions 

Applications different domains chemistry bioacoustics government traditional culture facet analysis different resource types bibliographies web directory website architecture

Search by class  User browses classes and selects one:

Classified retrieval Occurrences of the class combined with any other are displayed and sorted by class

Global vs. local But, within a domain, specialized concepts with long notations can be frequent: jUe Europe jUei Italian peninsula jUeip Apennine jUeipg Ligurian Apennine jUeipg Antola chain jUeipgh Curone valley Universal KOSs help interoperability, as each concept is uniquely identified: mpf flowering plants

Global vs. local To conceal the two, we need to specify both global and local meanings by an AR-complex [Wåhlin 1971] We state: H = jUeipgh Curone valley  mpf H flowering plants : Curone valley (free cl.) mpf2H flowering plants, in Curone valley (freely faceted cl.)

AR-complex mpf H flowering plants : Curone valley lower-case letters are from the Reference system upper-case letters are from the Adapted system (deictics: meaning changes according to context) In ASCII, upper-case letters file before, thus local concepts will be listed before (favoured host class [Ranganathan 1967] )

AR-complex Local KOSs can interoperate with a global KOS by mapping their deictics on it: H = jUeipgh Curone valley Mapping can be more complex if the two KOSs have different hierarchies: O = jUeipgo + jUeilgau +... Trebbia valley, being partly in Genova province (Liguria), partly in Piacenza province (Emilia), and a little section in Pavia province (Lombardia)

Mapping different hierarchies H Trebbia valley

Retrieval of a facet IR systems can retrieve a facet from any position within a string: mpf H flowering plants : Curone valley Still, citation order is crucial for browsing: H Curone valley mpf H flowering plants : Curone valley nyr H woods : Curone valley nyr mpf H woods : flowering plants : Curone valley

Freely faceted classification is powerful for retrieval, sorting, browsing......so why is it poorly used? Most Internet directories prefer alphabetical sorting:

Faceted ontologies? Integration between ontology and FC is rare (ongoing research in DRTC, GFO) implying a high risk of reinventing the wheel.

Reasons for under-exploitment of FC poorly known, nor taken seriously enough in LIS schools; not considered by designers of interfaces, even if known by librarians cooperating in the same projects; more complex than other KOSs, thus requiring initial investment; convincing examples hardly available on the Internet...

Complex KOSs need several layers working together : conceptual structure local schemes (deictics) notation verbal captions in natural languages database management indexing interface user interface...  indexer-friendly!

Indexing interface The indexer can edit the classmark and dynamically see the caption she is producing

Indexing interface She can be helped by automatic suggestions generated by matching title with DB thesaurus document title edited notation automatic caption suggested classes

to produce the convincing examples (FACET docet). Strategies: produce user-friendly interfaces through open source software (MySQL, PHP) make results freely available on the Internet provide links to very popular resources (eg Wikipedia) ? join FFC with some Web 2.0 resource? Work in progress

ILC people: Claudio Gnoli, Mela Bosch, Enzo Cesanelli, Philippe Cousson, Viviana Doldi, Hong Mei, Gabriele Merli, Marcella Patania, Roberto Poli, Rick Szostak, Lorena Zuccolo Published reports: Gnoli & Poli 2004, Levels of reality and levels of representation, Knowl org 31, 3, Gnoli & Merli 2005, Notazione e interfaccia di ricerca per una classificazione a livelli, AIDA informazioni, 23, 1-2, Hong 2005, A phenomenon approach to faceted classification, 53th conf Japan Soc LIS Gnoli 2006, The meaning of facets in nondisciplinary classifications, proc 9th ISKO conf, Vienna, Gnoli & Hong 2006, Freely faceted classification for Web-based information retrieval, New rev hypermedia & multimedia, 12, 1, Gnoli, Bosch & Mazzocchi 2007, A new relationship for multidisciplinary knowledge organization systems: dependence, proc 8th ISKO Spain conf, León, Gnoli 2007, “Classic” vs. “freely” faceted classification, ISKO UK meeting Ranganathan revisited, London Gnoli 2008, Freely faceted classification for a Web-based bibliographic archive: the BioAcoustic Reference Database, proc ISKO D conf, Konstanz Szostak & Gnoli 2008, Classifying by phenomena, theories, and methods, proc 10th ISKO conf, Montréal

Thank you! Comments welcome: