Tuning Before Feedback: Combining Ranking Discovery and Blind Feedback for Robust Retrieval* Weiguo Fan, Ming Luo, Li Wang, Wensi Xi, and Edward A. Fox.

Slides:



Advertisements
Similar presentations
Chapter 5: Introduction to Information Retrieval
Advertisements

1 An Adaptive GA for Multi Objective Flexible Manufacturing Systems A. Younes, H. Ghenniwa, S. Areibi uoguelph.ca.
Intelligent Control Methods Lecture 12: Genetic Algorithms Slovak University of Technology Faculty of Material Science and Technology in Trnava.
Using Parallel Genetic Algorithm in a Predictive Job Scheduling
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
Biologically Inspired AI (mostly GAs). Some Examples of Biologically Inspired Computation Neural networks Evolutionary computation (e.g., genetic algorithms)
EvoNet Flying Circus Introduction to Evolutionary Computation Brought to you by (insert your name) The EvoNet Training Committee The EvoNet Flying Circus.
Institute of Intelligent Power Electronics – IPE Page1 Introduction to Basics of Genetic Algorithms Docent Xiao-Zhi Gao Department of Electrical Engineering.
Mechanics of Genetic Programming
EvoNet Flying Circus Introduction to Evolutionary Computation Brought to you by (insert your name) The EvoNet Training Committee The EvoNet Flying Circus.
Data Mining CS 341, Spring 2007 Genetic Algorithm.
Two-Dimensional Channel Coding Scheme for MCTF- Based Scalable Video Coding IEEE TRANSACTIONS ON MULTIMEDIA,VOL. 9,NO. 1,JANUARY Yu Wang, Student.
1 Genetic Algorithms. CS The Traditional Approach Ask an expert Adapt existing designs Trial and error.
Learning to Advertise. Introduction Advertising on the Internet = $$$ –Especially search advertising and web page advertising Problem: –Selecting ads.
Genetic Algorithms Learning Machines for knowledge discovery.
Genetic Algorithms Overview Genetic Algorithms: a gentle introduction –What are GAs –How do they work/ Why? –Critical issues Use in Data Mining –GAs.
Evolutionary algorithms
Genetic Algorithm.
A Genetic Algorithms Approach to Feature Subset Selection Problem by Hasan Doğu TAŞKIRAN CS 550 – Machine Learning Workshop Department of Computer Engineering.
An Approach of Artificial Intelligence Application for Laboratory Tests Evaluation Ş.l.univ.dr.ing. Corina SĂVULESCU University of Piteşti.
Soft Computing Lecture 18 Foundations of genetic algorithms (GA). Using of GA.
1 Retrieval and Feedback Models for Blog Feed Search SIGIR 2008 Advisor : Dr. Koh Jia-Ling Speaker : Chou-Bin Fan Date :
Genetic Learning for Information Retrieval Andrew Trotman Computer Science 365 * 24 * 60 / 40 = 13,140.
Optimizing Sorting With Genetic Algorithms Xiaoming Li, María Jesús Garzarán, and David Padua University of Illinois at Urbana-Champaign.
Zorica Stanimirović Faculty of Mathematics, University of Belgrade
Applying Genetic Algorithm to the Knapsack Problem Qi Su ECE 539 Spring 2001 Course Project.
An Introduction to Genetic Algorithms Lecture 2 November, 2010 Ivan Garibay
1/27 Discrete and Genetic Algorithms in Bioinformatics 許聞廉 中央研究院資訊所.
Evolving Virtual Creatures & Evolving 3D Morphology and Behavior by Competition Papers by Karl Sims Presented by Sarah Waziruddin.
1 Machine Learning: Lecture 12 Genetic Algorithms (Based on Chapter 9 of Mitchell, T., Machine Learning, 1997)
Genetic Algorithms Siddhartha K. Shakya School of Computing. The Robert Gordon University Aberdeen, UK
Today’s Topics Read –For exam: Chapter 13 of textbook –Not on exam: Sections & Genetic Algorithms (GAs) –Mutation –Crossover –Fitness-proportional.
Artificial Intelligence Chapter 4. Machine Evolution.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
DYNAMIC FACILITY LAYOUT : GENETIC ALGORITHM BASED MODEL
Templated Search over Relational Databases Date: 2015/01/15 Author: Anastasios Zouzias, Michail Vlachos, Vagelis Hristidis Source: ACM CIKM’14 Advisor:
1 Genetic Algorithms and Ant Colony Optimisation.
Chapter 9 Genetic Algorithms.  Based upon biological evolution  Generate successor hypothesis based upon repeated mutations  Acts as a randomized parallel.
Introduction to Genetic Algorithms. Genetic Algorithms We’ve covered enough material that we can write programs that use genetic algorithms! –More advanced.
Genetic Algorithms Genetic algorithms provide an approach to learning that is based loosely on simulated evolution. Hypotheses are often described by bit.
ECE 103 Engineering Programming Chapter 52 Generic Algorithm Herbert G. Mayer, PSU CS Status 6/4/2014 Initial content copied verbatim from ECE 103 material.
Genetic Algorithms Abhishek Sharma Piyush Gupta Department of Instrumentation & Control.
Chapter 12 FUSION OF FUZZY SYSTEM AND GENETIC ALGORITHMS Chi-Yuan Yeh.
EE749 I ntroduction to Artificial I ntelligence Genetic Algorithms The Simple GA.
Comparing Document Segmentation for Passage Retrieval in Question Answering Jorg Tiedemann University of Groningen presented by: Moy’awiah Al-Shannaq
Waqas Haider Bangyal 1. Evolutionary computing algorithms are very common and used by many researchers in their research to solve the optimization problems.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A self-organizing map for adaptive processing of structured.
The Loquacious ( 愛說話 ) User: A Document-Independent Source of Terms for Query Expansion Diane Kelly et al. University of North Carolina at Chapel Hill.
A Genetic Algorithm-Based Approach to Content-Based Image Retrieval Bo-Yen Wang( 王博彥 )
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
1 Autonomic Computer Systems Evolutionary Computation Pascal Paysan.

An Introduction to Genetic Algorithms Lecture 2 November, 2010 Ivan Garibay
Principles in the Evolutionary Design of Digital Circuits J. F. Miller, D. Job, and V. K. Vassilev Genetic Programming and Evolvable Machines.
Agenda  INTRODUCTION  GENETIC ALGORITHMS  GENETIC ALGORITHMS FOR EXPLORING QUERY SPACE  SYSTEM ARCHITECTURE  THE EFFECT OF DIFFERENT MUTATION RATES.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Overview Last two weeks we looked at evolutionary algorithms.
EVOLUTIONARY SYSTEMS AND GENETIC ALGORITHMS NAME: AKSHITKUMAR PATEL STUDENT ID: GRAD POSITION PAPER.
LEARNING IN A PAIRWISE TERM-TERM PROXIMITY FRAMEWORK FOR INFORMATION RETRIEVAL Ronan Cummins, Colm O’Riordan (SIGIR’09) Speaker : Yi-Ling Tai Date : 2010/03/15.
Evolutionary Algorithms Jim Whitehead
C.-S. Shieh, EC, KUAS, Taiwan
Example: Applying EC to the TSP Problem
Artificial Intelligence Chapter 4. Machine Evolution
John Lafferty, Chengxiang Zhai School of Computer Science
Dr. Unnikrishnan P.C. Professor, EEE
Artificial Intelligence Chapter 4. Machine Evolution
EE368 Soft Computing Genetic Algorithms.
Machine Learning: UNIT-4 CHAPTER-2
Genetic Algorithm Soft Computing: use of inexact t solution to compute hard task problems. Soft computing tolerant of imprecision, uncertainty, partial.
Presentation transcript:

Tuning Before Feedback: Combining Ranking Discovery and Blind Feedback for Robust Retrieval* Weiguo Fan, Ming Luo, Li Wang, Wensi Xi, and Edward A. Fox Digital Library Research Laboratory, Virginia Tech *This research is supported by the National Science Foundation under Grant Numbers IIS , DUE and DUE

Outline Introduction Research Questions Approach: Ranking Tuning + Blind Fdbk Experiment Results Conclusion

Introduction Ranking functions play an important role in IR performance Blind feedback (pseudo-relevance feedback) has been found very useful for ad hoc retrieval Why not combine ranking function optimization with blind feedback to improve robustness?

Research Questions Does blind feedback work even better on fine- tuned ranking functions as compared to on traditional ranking functions such as Okapi BM25? Does the type of query (very short vs. very long) have any impact on the combination approach? Can the ranking function discovered, in combination with blind feedback, extrapolate well for new unseen queries?

Our Approach Use ARRANGER a Genetic Programming-based discovery engine to perform the ranking function tuning [Fan 2003tkde, Fan 2004ip&m, Fan 2004jasist] Combine ranking tuning and feedback Test on different types of queries

RF Discovery Problem Feedback Training Data Input Ranking Function Discovery Ranking Function f Output

Ranking Function Optimization Ranking Function Tuning is an art! – Paul Kantor Why not adaptively discover RF by Genetic Programming? Huge search space Discrete objective function Modeling advantage What is GP? Problem solving system designed based on principles of evolution and heredity Widely used for structure discovery, functional form discovery, other data mining and optimization tasks

Genetic Algorithms/Programming Representation: Vector of bit strings or real numbers for GA Complex data structures: trees, arrays for GP Genetic transformation Reproduction Crossover Mutation IR application [Gordon’88, ’91], [Chen’98a, ’98b], [Pathak’00], etc.

Essential GP Components ComponentsMeaning TerminalsLeaf nodes in the tree structure (i.e., x, y). FunctionsNon-leaf nodes used to combine the leaf nodes. Commonly, numerical operations: +, -, *, /, log, sqrt. Fitness function The objective function GP aims to optimize. ReproductionA genetic operator that copies the individuals with the best fitness values directly into the population of the next generation without going through the crossover operation. CrossoverA genetic operator aiming to improve the diversity as well as the genetic fitness of the population. See details in next slide.

Example of Crossover in GP tf*(tf+df) tf*(N/df) + df * tf * + df Crossover Parent1 Parent2 Child1 Child2 N/df+df (tf*df)+df N / df tf + Generation: N Generation: N+1 N / df tf +

The ARRANGER Engine 1.Split the training data into training and validation 2.Generate an initial population of random “ranking functions” 3.Evaluate the fitness of each “ranking function” in the population and record 10 best ones 4.If stopping criteria is not met, generate the next generation of population by genetic transformation, go to Step 3. 5.Validate the recorded best “ranking functions” and select the best one as the RF Start Initialize Population Evaluate Fitness Apply Crossover Stop? Validate and Output End

The ARRANGER Engine 1.Split the training data into training and validation 2.Generate an initial population of random “ranking functions” 3.Evaluate the fitness of each “ranking function” in the population and record 10 best ones 4.If stopping criteria is not met, generate the next generation of population by genetic transformation, go to Step 3. 5.Validate the recorded best “ranking functions” and select the best one as the RF

The ARRANGER Engine Start Initialize Population Evaluate Fitness Apply Crossover Stop? Validate and Output End

Blind Feedback Automatically adds more terms to a user’s query to enhance the performance of search engines by assuming top ranked docs relevant Some examples Rocchio (performs best in our experiment) Dec-Hi Kullback-Leibler Divergence (KLD) Chi-Square

An Integrated Model

Experiment Setting Data 2003 Robust Track data (from TREC 6, 7, 8) Training Queries 150 old queries from TREC 6, 7, 8 Test Questions 50 very hard queries + 50 new queries

The Results on 150 Training Queries Run No.DescShort Okapi without BF (Baseline) Okapi with BF (+10.4%) (+8.7%) RF 1 without BF (+15.6%) (+9.1%) RF 1 with BF (+28.8%) (+21.3%)

Results on Test Queries (1)

Results on Test Queries (2)

Conclusions Blind feedback works well on GP trained queries. Ranking function combined with blind feedback works with new queries Two stage model responds differently to Desc query (slightly better) and Long query

Thank You! Q&A?