1 Ranked-Listed or Categorized Results in IR Zheng Zhu, Ingemar J. Cox, Mark Levene Birkbeck College, University of London UCL.

Slides:



Advertisements
Similar presentations
TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
Advertisements

Info to Enterprise Migration Implementation Case Study: SBC Corporation Presented to the Crystal Decisions Regional Users Group for the Bay Area on October.
Vorlesung Datawarehousing Table of Contents Prof. Rudolf Bayer, Ph.D. Institut für Informatik, TUM SS 2002.
Slide 1 Insert your own content. Slide 2 Insert your own content.
1 Copyright © 2004 M. E. Kabay. All rights reserved. Database Design (2) IS 240 – Database Management Lecture #11 – Prof. M. E. Kabay, PhD,
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
1 Chapter 40 - Physiology and Pathophysiology of Diuretic Action Copyright © 2013 Elsevier Inc. All rights reserved.
By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.
Improving Human-Semantic Web Interaction: The Rhizomer Experience Roberto García and Rosa Gil GRIHO - Human Computer Interaction Research Group Universitat.
eClassifier: Tool for Taxonomies
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
Designing Services for Grid-based Knowledge Discovery A. Congiusta, A. Pugliese, Domenico Talia, P. Trunfio DEIS University of Calabria ITALY
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
Coordinate Plane Practice The following presentation provides practice in two skillsThe following presentation provides practice in two skills –Graphing.
0 - 0.
ALGEBRAIC EXPRESSIONS
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
MULTIPLYING MONOMIALS TIMES POLYNOMIALS (DISTRIBUTIVE PROPERTY)
ADDING INTEGERS 1. POS. + POS. = POS. 2. NEG. + NEG. = NEG. 3. POS. + NEG. OR NEG. + POS. SUBTRACT TAKE SIGN OF BIGGER ABSOLUTE VALUE.
MULTIPLICATION EQUATIONS 1. SOLVE FOR X 3. WHAT EVER YOU DO TO ONE SIDE YOU HAVE TO DO TO THE OTHER 2. DIVIDE BY THE NUMBER IN FRONT OF THE VARIABLE.
SUBTRACTING INTEGERS 1. CHANGE THE SUBTRACTION SIGN TO ADDITION
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
Addition Facts
Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London.
Mark Levene, An Introduction to Search Engines and Web Navigation © Pearson Education Limited 2005 Slide 4.1 Chapter 4 : Searching the Web The mechanics.
WWW Search and Navigation Mark Levene SCIS, Birkbeck College University of London
COMP3410 DB32: Technologies for Knowledge Management Lecture 4: Inverted Files and Signature Files for IR By Eric Atwell, School of Computing, University.
ZMQS ZMQS
Photo Composition Study Guide Label each photo with the category that applies to that image.
Richmond House, Liverpool (1) 26 th January 2004.
BT Wholesale October Creating your own telephone network WHOLESALE CALLS LINE ASSOCIATED.
Relevance Feedback & Query Expansion
ACM/JETT Workshop - August 4-5, :Design of Classes using CRC cards.
Use the buttons on the top to navigate through the presentation 1 Next Menu.
Context-aware Generation of User Interface Containers for Mobile devices Francisco J. Martínez Ruiz 1,2, Jean Vanderdonckt 1 and Jaime Muñoz Arteaga 3.
Project Supervisor: Dr. Sanath Jayasena Project Coordinator: Mr. Shantha Fernando Athukorala A.U.B Dissanayake C.P. Kumara M.G.C.P. Priyadarshana G.V.J.
O X Click on Number next to person for a question.
© S Haughton more than 3?
© Arjen P. de Vries Arjen P. de Vries Fascinating Relationships between Media and Text.
© Charles van Marrewijk, An Introduction to Geographical Economics Brakman, Garretsen, and Van Marrewijk.
1 Directed Depth First Search Adjacency Lists A: F G B: A H C: A D D: C F E: C D G F: E: G: : H: B: I: H: F A B C G D E H I.
1 Evaluations in information retrieval. 2 Evaluations in information retrieval: summary The following gives an overview of approaches that are applied.
Twenty Questions Subject: Twenty Questions
Take from Ten First Subtraction Strategy -9 Click on a number below to go directly to that type of subtraction problems
Linking Verb? Action Verb or. Question 1 Define the term: action verb.
Energy & Green Urbanism Markku Lappalainen Aalto University.
David Walker Ottawa TMG Users Group 15 March 2014.
Lets play bingo!!. Calculate: MEAN Calculate: MEDIAN
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 4 Slide 1 Software processes 2.
Chapter 10 Software Testing
Past Tense Probe. Past Tense Probe Past Tense Probe – Practice 1.
This, that, these, those Number your paper from 1-10.
Document Clustering Carl Staelin. Lecture 7Information Retrieval and Digital LibrariesPage 2 Motivation It is hard to rapidly understand a big bucket.
1 First EMRAS II Technical Meeting IAEA Headquarters, Vienna, 19–23 January 2009.
Addition 1’s to 20.
25 seconds left…...
Test B, 100 Subtraction Facts
11 = This is the fact family. You say: 8+3=11 and 3+8=11
Week 1.
We will resume in: 25 Minutes.
1 Unit 1 Kinematics Chapter 1 Day
O X Click on Number next to person for a question.
1 PART 1 ILLUSTRATION OF DOCUMENTS  Brief introduction to the documents contained in the envelope  Detailed clarification of the documents content.
Learning to Recommend Questions Based on User Ratings Ke Sun, Yunbo Cao, Xinying Song, Young-In Song, Xiaolong Wang and Chin-Yew Lin. In Proceeding of.
Application of Ensemble Models in Web Ranking
Bringing Order to the Web: Automatically Categorizing Search Results Hao Chen SIMS, UC Berkeley Susan Dumais Adaptive Systems & Interactions Microsoft.
Interaction LBSC 734 Module 4 Doug Oard. Agenda Where interaction fits Query formulation Selection part 1: Snippets  Selection part 2: Result sets Examination.
Bringing Order to the Web : Automatically Categorizing Search Results Advisor : Dr. Hsu Graduate : Keng-Wei Chang Author : Hao Chen Susan Dumais.
Presentation transcript:

1 Ranked-Listed or Categorized Results in IR Zheng Zhu, Ingemar J. Cox, Mark Levene Birkbeck College, University of London UCL

2 Content Motivation Methodology Results Conclusions

3 The motivation Improve navigational experience for both normal users and users of handheld devices. Intuitively, we would expect grouping documents to reduce search time.

4 Introduction We quantify the benefits of grouping documents based on classification. We study how the benefits of grouping degrade with classification errors. We take into account errors that arise from both the user and the classifier.

5 The methodology Three types of simulated user model: 1.The user knows the class. 2.The user doesnt know the class. 3.The user thinks he knows the class. Two classification scenarios: 1.Correct classification 2.misclassification

6 The methodology To measure the benefits, we define: –class rank. –document rank. For ranked-list results, scroll rank is used

7 The Methodology For categorized results, based on different user models and operation scenarios, we define: –In-Class Rank(ICR), –Scrolled-Classification Rank(SCR), –Out-Class/Scroll-Class Rank(OSCR) –Out-Class/Revert Rank(ORR).

8 The methodology query doc1 doc2 doc3 doc4 doc5 doc6 doc7 doc1 doc2 doc3 doc4 doc5 doc6 doc7 Class1: Class2: SR=6 ICR=1+3=4 SCR=1+3=4 ORR=2+4+6 =12 OSCR=2+4+ 4=10

9 The methodology Simulated user/target Correctly classified misclassified Knows classICROSCR or ORR Does not know class SCR or SR Thinks knows class OSCR or ORR

10 The methodology Known-Item Search (Target Testing), followed by comparison of the ranks. Given a document, we generate a query so that the target document appears within a designated range of scroll rank.

11 The implementation Open Directory Project provides an oracle for classification so that we can control both user and machine error. Search Engine is based on Lucene, which is an open source tool.

12 The ideal case with an Oracle

13 KNN Classifier

14 More realistic scenario

15 Conclusions Classification-based display can improve users interaction with SE However, this depends on the user strategy: –The hybrid strategy has the best performance. –Using a hybrid strategy, performance degrades gracefully with errors

16 Thanks!

17

18

19 Reference Kummamuru, K., Lotlikar, R., Roy, S., Singal, K., Krishnapuram, R.: A hierarchi-cal monothetic document clustering algorithm for summarization and browsing search results. In: Proceedings of the 13th International Conference on World Wide Web, pp. 658–665 (2004) Chen, H., Dumais, S.: Bring order to the web: Automatically categorizing search results. In: CHI 2000: Proceedings of the SIGCHI conference on Human factors in computing systems, pp. 145–152. ACM Press, New York (2000)