Faceted Metadata in Search Interfaces Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS-9984741.

Slides:



Advertisements
Similar presentations
Research Methods and Usability Guidelines for Ecommerce Web Sites Mary Czerwinski Microsoft Research Note: Many of these slides came from a Keynote address.
Advertisements

Content Metadata and Search Remarks to the Dublin Core Workshop Marti Hearst SIMS, UC Berkeley September 28, 2003.
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
ORGANIZING THE CONTENT Physical Structure
Bringing Order to the Web: Automatically Categorizing Search Results Hao Chen SIMS, UC Berkeley Susan Dumais Adaptive Systems & Interactions Microsoft.
UCB Computer Vision Animals on the Web Tamara L. Berg CSE 595 Words & Pictures.
Automating Creation of Hierarchical Faceted Metadata Structures Emilia Stoica, Marti Hearst and Megan Richardson* School of Information, Berkeley *Dept.
Information Retrieval: Human-Computer Interfaces and Information Access Process.
1 Using Words to Search a Thousand Images Hierarchical Faceted Metadata in Search & Browsing Marti Hearst SIMS, UC Berkeley Research funded by: NSF CAREER.
Semi-Automated Creation of Facet Hierarchies Marti Hearst School of Information, UC Berkeley Joint work with Dr. Emilia Stoica.
Search and Retrieval: More on Term Weighting and Document Ranking Prof. Marti Hearst SIMS 202, Lecture 22.
Castanet: Using WordNet to Build Facet Hierarchies Emilia Stoica and Marti Hearst School of Information, Berkeley.
MaNIS Interface Project Mayjane Co Denise Green Jane Lee Rebecca Shapley.
Measuring Information Architecture CHI 01 Panel Position Statement Marti Hearst UC Berkeley.
1 CS 430 / INFO 430 Information Retrieval Lecture 15 Usability 3.
1 Ideas for Integrating Browsing and Search in the CDL Marti Hearst SIMS, UC Berkeley
Faceted Metadata for Site Navigation and Search Marti Hearst 12/17/2009.
Social Tagging and Search Marti Hearst UC Berkeley.
Nearly-Automated Metadata Hierarchy Creation Emilia Stoica and Marti Hearst SIMS University of California, Berkeley.
Faceted Metadata in Search Interfaces Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS
1 Flexible Search and Navigation using Faceted Metadata Prof. Marti Hearst Dr. Rashmi Sinha, Ame Elliott, Jennifer English, Kirsten Swearingen, Ping Yee.
Measuring Information Architecture Marti Hearst UC Berkeley.
Castanet: Using WordNet to Build Facet Hierarchies Emilia Stoica and Marti Hearst School of Information, Berkeley.
Measuring Information Architecture Marti Hearst UC Berkeley.
HCI Part 2 and Testing Session 9 INFM 718N Web-Enabled Databases.
The Information School of the University of Washington Information System Design Info-440 Autumn 2002 Session #10 BOO! BOO!
Semi-Automated Creation of Facet Hierarchies Marti Hearst School of Information, UC Berkeley Joint work with Dr. Emilia Stoica.
A metadata-based approach Marti Hearst Associate Professor BT Visit August 18, 2005.
Yahoo Visit Day Joint Reseach Opportunities Marti Hearst UC Berkeley School of Information.
Information Retrieval: Human-Computer Interfaces and Information Access Process.
Faceted Metadata in Search Interfaces Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS
SLIDE 1IS 202 – FALL 2003 Lecture 26: Final Review Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am - 12:00.
Incorporating Metadata into Search User Interfaces Marti Hearst UC Berkeley.
Faceted Metadata in Search Interfaces Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS
Faceted Metadata for Information Architecture and Search Marti Hearst, SIMS at UC Berkeley Preston Smalley & Corey Chandler, eBay User Experience & Design.
2D or 3D ? Presented by Xu Liu, Ming Luo. Is 3D always better than 2D? NO!
SIMS 213: User Interface Design & Development Marti Hearst Thurs, Jan 22, 2004.
UIs for Faceted Navigation Recent Advances and Remaining Open Problems HCIR’08 Marti Hearst, UC Berkeley (including some slides from Corey Chandler of.
Measuring Information Architecture Marti Hearst UC Berkeley.
Incorporating Metadata into Search UIs Marti Hearst UC Berkeley.
Transforming Tags to (Faceted) Tagsonomies Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS
1 Flexible Search and Navigation using Faceted Metadata Prof. Marti Hearst University of California, Berkeley Search Engines Meeting, April 2002 Research.
Considering a Faceted Search-based Model Marti Hearst UCB SIMS NAS CSTB DNS Meeting on Internet Navigation and the Domain Name.
Mining the Web for Design Guidelines Marti Hearst, Melody Ivory, Rashmi Sinha UC Berkeley.
SIMS 213: User Interface Design & Development Marti Hearst Thurs, March 14, 2002.
1 User Interface Design CIS 375 Bruce R. Maxim UM-Dearborn.
Information retrieval thur jan data…. framework for today’s lecture…
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
CSI-553 Internet Information Presented by: Ignacio Castro June 28, 2006 Internet Usability.
1 The BT Digital Library A case study in intelligent content management Paul Warren
Visual User Interfaces David Rashty. “Grasping the whole is a gigantic theme. Arguably, intellectual history’s most important. Ant-vision is humanity’s.
JASS 2005 Next-Generation User-Centered Information Management Information visualization Alexander S. Babaev Faculty of Applied Mathematics.
Information retrieval wed sept data…. -start at 6.45.
-1- Philipp Heim, Thomas Ertl, Jürgen Ziegler Facet Graphs: Complex Semantic Querying Made Easy Philipp Heim 1, Thomas Ertl 1 and Jürgen Ziegler 2 1 Visualization.
©2003 Paula Matuszek CSC 9010: Text Mining Applications Document Summarization Dr. Paula Matuszek (610)
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
Electronic Scriptorium, Ltd. AIIM Minnesota Chapter Metadata and Taxonomy Presentation Copyright Electronic Scriptorium, Ltd. All rights reserved, 1991.
Faceted Search Zhao Jing Outline  What is faceted search?  Why use faceted search?  Topics of interests  Faceted Search in Dataspace.
How can Search Interfaces Enhance the Value of Semantic Annotations (and Vice Versa?) Keynote Talk ESAIR’13: Sixth International Workshop on Exploiting.
Faceted Navigation An Alternative to Search and Browse Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Websites with good heuristics Irene Wachirawutthichai.
Advanced Semantics and Search Beyond Tag Clouds and Taxonomies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
Module 10a: Display and Arrangement IMT530: Organization of Information Resources Winter, 2008 Michael Crandall.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Supporting the design of interactive systems a perspective on supporting people’s work Hans de Graaff 27 april 2000.
NLP Support for Faceted Navigation in Scholarly Collections
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
CSE 635 Multimedia Information Retrieval
Incorporating Metadata into Search User Interfaces
Presentation transcript:

Faceted Metadata in Search Interfaces Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Focus: Search and Navigation of Large Collections Image Collections E-Government Sites Example: the University of California Library Catalog Shopping Sites Digital Libraries

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS What do we want done differently? Organization of results Hints of where to go next Flexible ways to move around … How to structure the information?

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS The Problem with Hierarchy

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS The Problem With Hierarchy

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS The Problem with Hierarchy

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS The Problem With Hierarchy Where is Berkeley? College and University > Colleges and Universities >United States > U > University of California > Campuses > Berkeley U.S. States > California > Cities >Berkeley > Education > College and University > Public > UC Berkeley

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Outline Motivation: support for browsing big collections –Focus on usability for a wide range of lay users Approach: flexible application of hierarchical faceted metadata –Advantages of the approach –Results of usability studies Opportunities for AI: –Creating faceted category hierarchies –Assigning items to categories –Combine categories to identify tasks –A way to focus for personalization research

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Why Care? These folks do: NYTimes archive eBay California Digital Library US Census

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS How to Structure Information for Search and Browsing? Hierarchy is too rigid KL-One is too complex Hierarchical faceted metadata: –A useful middle ground

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS What are facets? Sets of categories, each of which describe a different aspect of the objects in the collection. Each of these can be hierarchical. (Not necessarily mutually exclusive nor exhaustive, but often that is a goal.) Time/DateTopicRoleGeoRegion 

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Facet example: Recipes Course Main Course Cooking Method Stir-fry Cuisine Thai Ingredient Red Bell Pepper Curry Chicken

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Example of Faceted Metadata: Categories for Biomedical Journal Articles 1. Anatomy [A] 2. Organisms [B] 3. Diseases [C] 4. Chemicals and Drugs [D] 1. Lung 2. Mouse 3. Cancer 4. Tamoxifen

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Goal: assign labels from facets

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Motivation Description: 19th c. paint horse; saddle and hackamore; spurs; bandana on rider; old time cowboy hat; underchin thong; flying off. Nature Animal Mammal Horse Occupations Cowboy Clothing Hats Cowboy Hat Media Engraving Wood Eng. Location North America America

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Motivation Description: 19th c. paint horse; saddle and hackamore; spurs; bandana on rider; old time cowboy hat; underchin thong; flying off. By using facets, what we are not capturing? The hat flew off; The bandana stayed on. The thong is part of the hat. The bandana is on the cowboy (not the horse). The saddle is on the horse (not the cowboy).

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Hierarchical Faceted Metadata A simplification of knowledge representation Does not represent relationships directly BUT can be understood well by many people when browsing rich collections of information.

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS How to Put In an Interface? Some Challenges: Users don’t like new search interfaces. How to show lots of information without overwhelming or confusing?

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS A Solution (The Flamenco Project) Use proper HCI methods. Organize search results according to the faceted metadata so navigation looks similar throughout –Easy to see what to go next, were you’ve been –Avoids empty result sets –Integrates seamlessly with keyword search

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS The Flamenco Project Incorporating Faceted Hierarchical Metadata into Interfaces for Large Collections Key Goals: –Support integrated browsing and keyword search Provide an experience of “browsing the shelves” –Add power and flexibility without introducing confusion or a feeling of “clutter” –Allow users to take the path most natural to them Method: –User-centered design, including needs assessment and many iterations of design and testing

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Art History Images Collection

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Questions we are trying to answer How many facets are allowable? Should facets be mixed and matched? How much is too much? Should hierarchies be progressively revealed, tabbed, some combination? How should free-text search be integrated?

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Information previews Use the metadata to show where to go next –More flexible than canned hyperlinks –Less complex than full search Help users see and return to previous steps Reduces mental work –Recognition over recall –Suggests alternatives More clicks are ok iff (J. Spool) The “scent” of the target does not weaken If users feel they are going towards, rather than away, from their target.

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS What is Tricky About This? It is easy to do it poorly It is hard to be not overwhelming –Most users prefer simplicity unless complexity really makes a difference –Small details matter It is hard to “make it flow”

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS eBay Products

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Search Usability Design Goals 1.Strive for Consistency 2.Provide Shortcuts 3.Offer Informative Feedback 4.Design for Closure 5.Provide Simple Error Handling 6.Permit Easy Reversal of Actions 7.Support User Control 8.Reduce Short-term Memory Load From Shneiderman, Byrd, & Croft, Clarifying Search, DLIB Magazine, Jan

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Usability Studies Usability studies done on 3 collections: –Recipes: 13,000 items –Architecture Images: 40,000 items –Fine Arts Images: 35,000 items Conclusions: –Users like and are successful with the dynamic faceted hierarchical metadata, especially for browsing tasks –Very positive results, in contrast with studies on earlier iterations.

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Post-Test Comparison FacetedBaseline Overall Assessment More useful for your tasks Easiest to use Most flexible More likely to result in dead ends Helped you learn more Overall preference Find images of roses Find all works from a given period Find pictures by 2 artists in same media Which Interface Preferable For:

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Advantages of the Approach Honors many of the most important usability design goals –User control –Provides context for results –Reduces short term memory load –Allows easy reversal of actions –Provides consistent view Allows different people to add content without breaking things Can make use of standard technology

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Advantages of the Approach Systematically integrates search results: –reflect the structure of the info architecture –retain the context of previous interactions Gives users control and flexibility –Over order of metadata use –Over when to navigate vs. when to search Allows integration with advanced methods –Collaborative filtering, predicting users’ preferences

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Disadvantages Does not model relations explicitly Does it scale to millions of items? –Adaptively determine which facets to show for different combinations of items Requires faceted metadata!

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Opportunities for AI Creating hierarchical faceted categories –Assigning items to those categories –Adaptively adding new facets as data changes A new approach to personalization: –User-tailored facet combinations Create task-based search interfaces –Equate a task with a sequence of facet types

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Creating Classifications from Data Most approaches are associational –AKA clustering, LSA, LDA, etc. –This leads to poor results when applied to text To derive facets, need a different angle –We have a simple approach based on WordNet

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Clustering (The Hope)

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Clustering (The Hope)

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Clustering (The Reality)

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Clustering (The Reality)

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Example: Recipes (3500 docs)

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Blei, Ng, & Jordan ’03 (Latent Dirichlet Allocation)

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Blei, Ng, & Jordan ’03 (Latent Dirichlet Allocation)

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Sanderson & Croft ’99 Term Subsumption

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Sanderson & Croft ’99 Term Subsumption

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Stoica & Hearst ’04 WordNet-based

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Stoica & Hearst ’04 WordNet-based

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Stoica & Hearst ’04 WordNet-based

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Stoica & Hearst ’04 WordNet-based

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Example: AP Newswire P-2 ABSTRACT The Bechtel Group Inc. offered in 1985 to sell oil to Israel at a discount of at least $650 million for 10 years if it promised not to bomb a proposed Iraqi pipeline, a Foreign Ministry official said Wednesday. But then-Prime Minister Shimon Peres said the offer from Bruce Rappaport, a partner in the San Francisco-based construction and engineering company, was ``unimportant,'' the senior official told The Associated Press. Peres, now foreign minister, never discussed the offer with other government ministers, said the official, who spoke on condition of anonymity. The comments marked the first time Israel has acknowledged any offer was made for assurances not to bomb the planned $1 billion pipeline, which was to have run near Israel's border …

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Blei, Ng, & Jordan ’03 (Latent Dirichlet Allocation)

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Stoica & Hearst ’04 WordNet-based

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Stoica & Hearst ’04 WordNet-based

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Stoica & Hearst ’04 WordNet-based

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Stoica & Hearst ’04 WordNet-based

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Stoica & Hearst ’04 WordNet-based

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Associational techniques Pros: –Sometimes terms grouped to get a general concept Airline, airplane, pilots, flight Cons: –Highly unpredictable –Not comprehensive Dollar and yen but no deutchmarks Eastern but no other directions –Not uniform in subject matter Mixing currencies with countries with timing Mixing compass directions with airlines

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Lexical Hierarchy-based Pros –Faceted and hierarchical –Consistent is-a hierarchies –Comprehensiveness more likely Cons –Doesn’t provide overall themes Airlines, pilots, airplanes –Sometimes uses wrong word sense –Sometimes the right term/hierarchy is not present Doesn’t have “dish type” nor “cuisine” for recipes Specialized domains won’t work

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Our Approach Leverage the structure of WordNet Documents WordNet Get hypernym paths Select terms Build tree Compress tree

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Our Approach Leverage the structure of WordNet

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS 1. Select Terms red blue Select well distributed terms from collection Documents WordNet Get hypernym paths Select terms Build tree Comp. tree

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS 2. Get Hypernym Path red blue chromatic color abstraction property visual property color red, redness abstraction property visual property color blue, blueness chromatic color Documents WordNet Get hypernym paths Select terms Build tree Comp. tree

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS 3. Build Tree red blue chromatic color abstraction property visual property color red, redness abstraction property visual property color blue, blueness chromatic color red blue abstraction property visual property color red, redness chromatic color blue, blueness Documents WordNet Get hypernym paths Select terms Build tree Comp. tree

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS 4. Compress Tree Documents WordNet Get hypernym paths Select terms Build tree Comp. tree red, redness color red chromatic color blue, blueness blue green, greenness green red color chromatic color blue

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS 4. Compress Tree (cont.) red color chromatic color blue green color redbluegreen Documents WordNet Get hypernym paths Select terms Build tree Comp. tree

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Disambiguation Ambiguity in: –Word senses –Paths up the hypernym tree Sense 1 for word “tuna” organism, being => plant, flora => vascular plant => succulent => cactus => tuna Sense 2 for word “tuna” organism, being => fish => food fish => tuna => bony fish => spiny-finned fish => percoid fish => tuna 2 paths for same word2 paths for same sense

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS How to Select the Right Senses and Paths? First: build core tree –(1) Create paths for words with only one sense –(2) Use Domains Wordnet has 212 Domains –medicine, mathematics, biology, chemistry, linguistics, soccer, etc. Automatically scan the collection to see which domains apply The user selects which of the suggested domains to use or may add own Paths for terms that match the selected domains are added to the core tree Then: add remaining terms to the core tree.

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Using Domains dip glosses: Sense 1: A depression in an otherwise level surface Sense 2: The angle that a magnet needle makes with horizon Sense 3: Tasty mixture into which bite-size foods are dipped dip hypernyms Sense 1 Sense 2 Sense 3 solid shape, form food => concave shape => space => ingredient, fixings => depression => angle => flavorer Given domain “food”, choose sense 3

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Opportunities for AI New opportunity: Tagging, folksonomies –(flickr de.lici.ous) –People are created facets in a decentralized manner –They are assigning multiple facets to items –This is done on a massive scale –This leads naturally to meaningful associations

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS This Doesn’t Solve Everything Harder to determine what’s related to more complex terms Still not good for finding a recipe using potatoes

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Linking Metadata Into Tasks Old Yahoo restaurant guide combined: –Region –Topic (restaurants) –Related Information Other attributes (cuisines) Other topics related in place and time (movies)

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Green: restaurants & attributes Red: related in place & time Yellow: geographic region

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Other Possible Combinations Region + A&E City + Restaurant + Movies City + Weather City + Education: Schools Restaurants + Schools …

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Creating Tasks from HFM Recipes Example: –Click Ingredient > Avocado –Click Dish > Salad –Implies task of “I want to make a Dish type d with an Ingredient i that I have lying around” –Maybe users will prefer to select tasks like these over navigating through the metadata.

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Summary Flexible application of hierarchical faceted metadata is a proven approach for navigating large information collections. –Midway in complexity between simple hierarchies and deep knowledge representation. Perhaps HFM is a good stepping stone to deeper semantic relations –Currently in use on e-commerce sites; spreading to other domains

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS AI Opportunities Creating hierarchical faceted categories –Assigning items to those categories –Adaptively adding new facets as data changes A new approach to personalization: –User-tailored facet combinations Create task-based search interfaces –Equate a task with a sequence of facet types Make use of folksonomies data!

AAAI’05 Invited talk: Faceted Metadata in Search InterfacesMarti Hearst: UC Berkeley SIMS Acknowledgements Flamenco team –Brycen Chun –Ame Elliott –Jennifer English –Kevin Li –Rashmi Sinha –Emilia Stoica –Kirsten Swearingen –Ping Yee Thanks also to NSF (IIS )

Thank you! Marti Hearst UC Berkeley School of Information This Research Supported by NSF IIS