Interfaces for Selecting and Understanding Collections.

Slides:



Advertisements
Similar presentations
Data Science for Business: Semantic Verses Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Advertisements

Taxonomies of Knowledge: Building a Corporate Taxonomy Wendi Pohs, Iris Associates
Jane Reid, AMSc IRIC, QMUL, 13/11/01 1 IR interfaces Purpose: to support users in information-seeking tasks Issues: –Functionality –Usability Motivations.
MULTIMEDIA Development Team.
Mastering the Internet, XHTML, and JavaScript Chapter 7 Searching the Internet.
SIMS 296a-3: Aids for Source Selection Carol Butler Fall ‘98.
Managing Distributed Collections: Evaluating Web Page Change, Movement, and Replacement Richard Furuta and Frank Shipman Center for the Study of Digital.
Retrieval Evaluation. Brief Review Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
Searching The Web Search Engines are computer programs (variously called robots, crawlers, spiders, worms) that automatically visit Web sites and, starting.
1 Discussion Class 12 User Interfaces and Visualization.
Search engines. The number of Internet hosts exceeded in in in in in
Retrieval Evaluation. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
CSE 730 Information Retrieval of Biomedical Data The use of medical lexicon in biomedical IR.
Information Retrieval
ISP 433/633 Week 12 User Interface in IR. Why care about User Interface in IR Human Search using IR depends on –Search in IR and search in human memory.
Connecting Diverse Web Search Facilities Udi Manber, Peter Bigot Department of Computer Science University of Arizona Aida Gikouria - M471 University of.
Projects in the Intelligent User Interfaces Group Frank Shipman Associate Director, Center for the Study of Digital Libraries.
Overview of Search Engines
Internet Research Search Engines & Subject Directories.
Vienna University of Economics and Business Administration User-centered Navigation Re-Design for Web-based Information Systems Michael Hahsler, Department.
Blaz Fortuna, Marko Grobelnik, Dunja Mladenic Jozef Stefan Institute ONTOGEN SEMI-AUTOMATIC ONTOLOGY EDITOR.
Enterprise & Intranet Search How Enterprise is different from Web search What to think about when evaluating Enterprise Search How Intranet use is different.
CONCLUSION & FUTURE WORK Normally, users perform triage tasks using multiple applications in concert: a search engine interface presents lists of potentially.
Information Need Question Understanding Selecting Sources Information Retrieval and Extraction Answer Determina tion Answer Presentation This work is supported.
AuthorLink: Instant Author Co-Citation Mapping for Online Searching Xia Lin Howard D. White Jan Buzydlowski Drexel University Philadelphia,
Internet Research Fourth Edition Unit C. Internet Research – Illustrated, Fourth Edition 2 Internet Research: Unit C Browsing Subject Guides.
Bio-Medical Information Retrieval from Net By Sukhdev Singh.
Which of the two appears simple to you? 1 2.
Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
Fourth Edition Discovering the Internet Discovering the Internet Complete Concepts and Techniques, Second Edition Chapter 3 Searching the Web.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
The Internet 8th Edition Tutorial 4 Searching the Web.
Personalized Search Xiao Liu
Introduction to Search Tools
Search Engines. Search Strategies Define the search topic(s) and break it down into its component parts What terms, words or phrases do you use to describe.
4 1 SEARCHING THE WEB Using Search Engines and Directories Effectively New Perspectives on THE INTERNET.
Recuperação de Informação B Cap. 10: User Interfaces and Visualization , , 10.9 November 29, 1999.
The Savvy Cyber Teacher ® Using the Internet Effectively in the K-12 Classroom 1 Copyright © 2003 Stevens Institute of Technology, CIESE, All Rights Reserved.
Algorithmic Detection of Semantic Similarity WWW 2005.
Translating Dialects in Search: Mapping between Specialized Languages of Discourse and Documentary Languages Vivien Petras UC Berkeley School of Information.
Working Memory and Learning Underlying Website Structure
Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.
Searching for NZ Information in the Virtual Library Alastair G Smith School of Information Management Victoria University of Wellington.
Internet Network of networks Mother of all networks
LIR 10: Week 10 Advanced WWW Topics. Class Announcements New features on Section 2904 Schedule Missing Homework Online Quiz due 11/16 Another WWW directory.
1 SEARCHING FOR TRUTH Locating Information on the WWW chapter 5.
Information Visualization, Human-Computer Interaction, and Cognitive Psychology: Domain Visualizations Kevin W. Boyack Sandia National Laboratories.
Web 2.0: Making the Web Work for You, Illustrated Unit A: Research 2.0.
Bringing Order to the Web : Automatically Categorizing Search Results Advisor : Dr. Hsu Graduate : Keng-Wei Chang Author : Hao Chen Susan Dumais.
Use Google Scholar! What the experts say: Use Google Scholar Use simple search for articles on library homepage Better: in the digital library main screen.
Smart Web Search Agents Data Search Engines >> Information Search Agents - Traditional searching on the Web is done using one of the following three: -
Types Pros & cons.  A program for the retrieval of data, files, or documents from a database or network, esp. the Internet.  Search engines usually.
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
Searching the Web for academic information Ruth Stubbings.
Information Organization: Overview
Federated & Meta Search
PubMed Database Interface (Basic Course Module 4 Part A)
Search Engines & Subject Directories
Computer Literacy BASICS: A Comprehensive Guide to IC3, 3rd Edition
Data Mining Chapter 6 Search Engines
Magnet & /facet Zheng Liang
Search Engines & Subject Directories
Search Engines & Subject Directories
CS/INFO 430 Information Retrieval
Information Retrieval and Web Design
Information Organization: Overview
Information Retrieval
Presentation transcript:

Interfaces for Selecting and Understanding Collections

Selecting from Collections Collections are sets of documents that have been coalesced by a human or system. Traditional collections: –NLM’s MedLine –ACM Digital Library –LEXIS-NEXIS –Library/museum resources from a particular donor How do people with information needs locate and identify the appropriate collections?

Does it Matter? Web search engines (e.g. Google) get us the information we need … –well maybe Web search drops users into the middle of a collection without any understanding of the collection and its overall characteristics. Web search misses –Lots of more structured materials “the hidden web” –Subscription-based content Which is likely the best edited, most accurate, and most valuable in specialized domains

Interfaces over Multiple Collections Interfaces for Selecting and Understanding Collections –Lists –Overviews –Examples –Automated source selection

Lists of Collections Usually just provides a list of collection names. –Difficult to select from if user does not know the collections beforehand –Over time people bookmark collections of value Need tools for helping users who are outside of their areas of expertise

Example

Examples

Overviews of Collections Overviews provide a sense of what is in a collection Overviews can be –Based on a category or directory structure –Automatically derived from the collection Presentation of an overview is often a form of information visualization

Category-based Overviews MedLine – biomedical collection –Medical Subject Headings (MeSH) consists of 18,000 categories in a directed acyclic graph ACM Digital Library – computer science collection –Hierarchy of 1200 catgory (keyword) labels Yahoo – the Web –Graph of directories (probably a DAG) Humans have to place documents in categories –Author for ACM DL, subject experts for MedLine, surfers for Yahoo

MeSH Browser

HiBrowse Browser

ConeTrees

Radial Views

Hyperbolic Views

MediaMetro

Automatically Derived Overviews Apply clustering algorithms to document collection –Remember Automatic Global Analysis Use of co-occurrance and co-citation Use of distance-based clustering approaches like hierarchic agglomerative clustering Need methods to determine labels for clusters –Could be a document identification of centroid (document most similar to all others) Identification of hubs (document most mentioned by cluster) –Could be one or more terms Use most common / best differentiator (using TF-IDF) No human intervention required –but people are likely to be valuable as editors

Scatter Gather

Evaluation of Scatter-Gather Scatter-Gather –Scatter-Gather conveyed overview of collection contents –Scatter-Gather without search was less effective than a basic search –Need to combine clustering with search

Themescapes

More Themescapes

Kohonen Maps

Evaluation of Graphical Overviews One study found that non-experts found the clustering results difficult to use (worse than text-based views like Scatter-Gather) Comparison of Kohonen map and Yahoo –11 of 15 subjects found “interesting” page using Kohonen 8 were able to find same page using Yahoo –14 of 16 subjects found “interesting” page using Yahoo 2 were able to find same page using Kohonen –Subjects liked ability to jump between categories without backing out of current category Unsupervised thematic overviews probably better for giving a gist of what is in a collection than for search.

Examples, Dialogs, Wizards Retrieval by reformulation –Start with example queries Rabbit, Helgon –Can be difficult to find appropriate starting query Wizards –Found to be helpful for users without necessary domain knowledge get through many step processes –Not helpful when wizard not accompanied with help –Not useful when goal is teaching how to use the interface. Guided tours –Presents a logical sequence of navigation choices for accomplishing a goal (e.g. Waldens Paths) –Not evaluated with regards to information access

Automated Source Selection Selecting collection automatically (but explicitly) Need a model of each collection –What it covers, need model of topics –What it is good at, need metric for good Develop a model of the user’s information need Match the information need to the most valuable collections for that topic Used in meta-search – interesting area of research Could be starting point for interactive collection selection.