ISP 433/633 Week 12 User Interface in IR. Why care about User Interface in IR Human Search using IR depends on –Search in IR and search in human memory.

Slides:



Advertisements
Similar presentations
Recuperação de Informação B Cap. 10: User Interfaces and Visualization 10.1,10.2,10.3 November 17, 1999.
Advertisements

Recuperação de Informação B Cap. 10: User Interfaces and Visualization , , , November 29, 1999.
UCLA : GSE&IS : Department of Information StudiesJF : 276lec1.ppt : 5/2/2015 : 1 I N F S I N F O R M A T I O N R E T R I E V A L S Y S T E M S Week.
Information Retrieval: Human-Computer Interfaces and Information Access Process.
10/4/01 IS202: Information Organization & Retrieval Interfaces for Information Retrieval Ray Larson & Warren Sack IS202: Information Organization and Retrieval.
1 DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen, Germany.
Jane Reid, AMSc IRIC, QMUL, 13/11/01 1 IR interfaces Purpose: to support users in information-seeking tasks Issues: –Functionality –Usability Motivations.
Information Retrieval Visualization CPSC 533c Class Presentation Qixing Zheng March 22, 2004.
Information Retrieval in Practice
Search Engines and Information Retrieval
Search and Retrieval: More on Term Weighting and Document Ranking Prof. Marti Hearst SIMS 202, Lecture 22.
Interfaces for Retrieval Results. Information Retrieval Activities Selecting a collection –Talked about last class –Lists, overviews, wizards, automatic.
Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,
SIMS 213: User Interface Design & Development Marti Hearst Thurs, March 3, 2005.
More Interfaces for Retrieval. Information Retrieval Activities Selecting a collection –Lists, overviews, wizards, automatic selection Submitting a request.
Database Management Systems, R. Ramakrishnan1 Computing Relevance, Similarity: The Vector Space Model Chapter 27, Part B Based on Larson and Hearst’s slides.
9/18/2001Information Organization and Retrieval Vector Representation, Term Weights and Clustering (continued) Ray Larson & Warren Sack University of California,
1 CS 430 / INFO 430 Information Retrieval Lecture 15 Usability 3.
SIMS 296a-3: Aids for Source Selection Carol Butler Fall ‘98.
Interfaces for Selecting and Understanding Collections.
INFM 700: Session 7 Unstructured Information (Part II) Jimmy Lin The iSchool University of Maryland Monday, March 10, 2008 This work is licensed under.
SIMS 202 Information Organization and Retrieval Prof. Marti Hearst and Prof. Ray Larson UC Berkeley SIMS Tues/Thurs 9:30-11:00am Fall 2000.
SIMS 296a-3: UI Background Marti Hearst Fall ‘98.
Information Retrieval: Human-Computer Interfaces and Information Access Process.
SLIDE 1IS 202 – FALL 2003 Lecture 23: Interfaces for Information Retrieval II Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday.
1 Discussion Class 12 User Interfaces and Visualization.
WMES3103: INFORMATION RETRIEVAL WEEK 10 : USER INTERFACES AND VISUALIZATION.
SLIDE 1IS 202 – FALL 2002 Lecture 24: Interfaces for Information Retrieval Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and.
Overview of Search Engines
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Search Engines and Information Retrieval Chapter 1.
Ch 6 - Menu-Based and Form Fill-In Interactions Yonglei Tao School of Computing & Info Systems GVSU.
JASS 2005 Next-Generation User-Centered Information Management Information visualization Alexander S. Babaev Faculty of Applied Mathematics.
1 Adapting the TileBar Interface for Visualizing Resource Usage Session 602 Adapting the TileBar Interface for Visualizing Resource Usage Session 602 Larry.
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
User Interfaces & IR Readings Review - Googling Further - HomeFinder Live Demos!
Basic Machine Learning: Clustering CS 315 – Web Search and Data Mining 1.
1 Computing Relevance, Similarity: The Vector Space Model.
1 Automatic Classification of Bookmarked Web Pages Chris Staff Second Talk February 2007.
Document Collections cs5984: Information Visualization Chris North.
Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.
Recuperação de Informação B Cap. 10: User Interfaces and Visualization , , 10.9 November 29, 1999.
Interaction LBSC 734 Module 4 Doug Oard. Agenda Where interaction fits Query formulation Selection part 1: Snippets  Selection part 2: Result sets Examination.
How Do We Find Information?. Key Questions  What are we looking for?  How do we find it?  Why is it difficult? “A prudent question is one-half of wisdom”
Translating Dialects in Search: Mapping between Specialized Languages of Discourse and Documentary Languages Vivien Petras UC Berkeley School of Information.
Conceptual structures in modern information retrieval Claudio Carpineto Fondazione Ugo Bordoni
ASSOCIATIVE BROWSING Evaluating 1 Jinyoung Kim / W. Bruce Croft / David Smith for Personal Information.
Augmenting (personal) IR Readings Review Evaluation Papers returned & discussed Papers and Projects checkin time.
Interaction LBSC 734 Module 4 Doug Oard. Agenda Where interaction fits Query formulation Selection part 1: Snippets Selection part 2: Result sets  Examination.
User Interfaces for Information Access Prof. Marti Hearst SIMS 202, Lecture 26.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Query Refinement and Relevance Feedback.
What Happens After the Search? User Interface Ideas for Information Retrieval Results Marti A. Hearst Xerox PARC.
A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.
WHIM- Spring ‘10 By:-Enza Desai. What is HCIR? Study of IR techniques that brings human intelligence into search process. Coined by Gary Marchionini.
SIMS 202, Marti Hearst Content Analysis Prof. Marti Hearst SIMS 202, Lecture 15.
Information Retrieval in Practice
Information Organization: Overview
Search Engine Architecture
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
User Interfaces & IR Readings Review Live Demos!
Augmenting (personal) IR
Professor John Canny Fall 2001 Nov 29, 2001
Professor John Canny Spring 2003
SIMS 202 Information Organization and Retrieval
Visualization of Web Search Results in 3D
Visualizing Document Collections
Document Clustering Matt Hughes.
Information Organization: Overview
Introduction to Search Engines
Presentation transcript:

ISP 433/633 Week 12 User Interface in IR

Why care about User Interface in IR Human Search using IR depends on –Search in IR and search in human memory One search in service of another Lexical Memory -> ComputerIR -> Visual Design problem is really to make them work together better –Don’t just speed up search engine! –Provide help for Lexical Memory Problem –Provide help for Visual Search of Results

Slide by James Landay What is HCI? HumansTechnology Task Design Organizational & Social Issues

Goal of HCI Well-designed interactive computer systems –Promote Positive feelings of success Competence Mastery –Allow users to concentrate on their work, exploration, or pleasure, rather than on the system or the interface

Interaction Paradigms for IR Direct manipulation –Query specification –Query refinement –Result selection Delegation –Agents –Recommender systems –Filtering

HCI Questions for IR Where does a user start? –Faced with a large set of collections, how can a user choose one to begin with? How will a user formulate a query? How will a user scan, evaluate, and interpret the results? How can a user reformulate a query?

Starting Points Faced with a prompt or an empty entry form … how to start? –Lists of sources –Overviews Clusters Category Hierarchies/Subject Codes Co-citation links –Examples, Wizards, and Guided Tours –Automatic source selection

List of Sources Have to guess based on the name Requires prior exposure/experience

Overviews Supervised (manual) category overviews –Yahoo! –MeSHBrowse Unsupervised (automated) groupings –Clustering –Kohonen feature maps

Example: MeSH and MedLine MeSH category hierarchy –Medical Subject Headings –~18,000 labels –Manually assigned –~8 labels/article on average –Average depth: 4.5 –Max depth: 9 Top level categories: anatomydiagnosisrelated disc animalspsychtechnology diseasebiologyhumanities drugsphysics

MeshBrowse (Korn & Shneiderman 95)

Pro and con of Category Labels Advantages: –Interpretable –Capture summary information –Describe multiple facets of content –Domain dependent, and so descriptive Disadvantages –Do not scale well (for organizing documents) –Domain dependent, so costly to acquire –Vocabulary Problem!

Text Clustering What clustering does: –Finds overall similarities among groups of documents –Finds overall similarities among groups of tokens –Picks out some themes, ignores others How clustering works: –Cluster entire collection –Find cluster centroid that best matches the query –Problems with clustering It is expensive It doesn’t work well

Scatter/Gather How it works –Cluster sets of documents into general “themes”, like a table of contents –Display the contents of the clusters by showing topical terms and typical titles –User chooses subsets of the clusters and re- clusters the documents within –Resulting new groups have different “themes” Originally used to give collection overview Evidence suggests more appropriate for displaying retrieval results in context

S/G Example: Query on “star”

Another Use of Clustering Use clustering to map the entire huge multidimensional document space into a number of small clusters “Project” these onto a 2D graphical representation

“ThemeScapes” Clustering

Kohonen Feature Maps on Text

Pro and con of Clustering Advantages: –Get an overview of main themes –Domain independent Disadvantages: –Many of the ways documents could group together are not shown –Not always easy to understand what they mean –Can’t see what documents are about –Documents forced into one position in semantic space –Hard to view titles Perhaps more suited for pattern discovery –Not good for search

Query Specification Interaction styles (Shneiderman 97) –Command language –Form fill –Menu selection –Direct manipulation –Natural language What about gesture, eye-tracking, or implicit inputs like reading habits?

Direct Manipulation Query Specification

Menu-Based Query Specification

Display of Retrieval Results Goal: –Minimize time/effort for deciding which documents to examine in detail Idea: –Show the roles of the query terms in the retrieved documents, making use of document structure

Putting Results in Context Interfaces should –Give hints about the roles terms play in the collection –Give hints about what will happen if various terms are combined –Show explicitly why documents are retrieved in response to the query –Summarize compactly the subset of interest To support both Query and Navigation

{Query v. Nav} x {Search v. Browse} QueryNavigate Browse Search Tasks Strategies

Hybrid strategies: Query + Navigation 1. Query Guided Navigation E.g., superbook –Query hits posted against structure –Pattern of hits gives additional navigational cues for your target 2. Query Initiated Navigation E.g., typical use of WWW query engine –Query to neighborhood, then navigate to target –Your vocabulary choice just has to occur near your target

Superbook Interface to very large hierarchically structured texts. Full text searching Posting hits against the table of contents Fisheye View of table of contents Highlighting hits in the text

TileBars Graphical representation of term distribution and overlap Simultaneously indicate: –Relative document length –Query term frequencies –Query term distributions –Query term overlap

TileBars Example Mainly about both DBMS & reliability Mainly about DBMS, discusses reliability Mainly about, say, banking, with a subtopic discussion on DBMS/Reliability Mainly about high-tech layoffs Query terms: DBMS and Reliability

VIBE (Olson et al. 93, Korfhage 93)

VR-VIBE

Query Reformulation Thesaurus expansion –Suggest terms similar to query terms Relevance feedback –Suggest terms (and documents) similar to retrieved documents that have been judged to be relevant –“More like this” interaction

Letizia (Lieberman) Watches what you like –did you save page –use it as a jumping off place –what search terms have you used –passed-over link Extracts keywords etc. to profile user’s interest Looks ahead in your neighborhood evaluating what it finds