Application of Ensemble Models in Web Ranking

Slides:



Advertisements
Similar presentations
You have been given a mission and a code. Use the code to complete the mission and you will save the world from obliteration…
Advertisements

Advanced Piloting Cruise Plot.
Kapitel 21 Astronomie Autor: Bennett et al. Galaxienentwicklung Kapitel 21 Galaxienentwicklung © Pearson Studium 2010 Folie: 1.
Chapter 1 The Study of Body Function Image PowerPoint
1 Alexander Gelbukh Moscow, Russia. 2 Mexico 3 Computing Research Center (CIC), Mexico.
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 5 Author: Julia Richards and R. Scott Hawley.
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.
UNITED NATIONS Shipment Details Report – January 2006.
A Novel Visualization Model for Web Search Results An Application of the Solar System Metaphor Tien N. Nguyen and Jin Zhang Electrical and Computer Engineering.
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
1/25 Generic and Automatic Address Configuration for Data Center Networks 1 Kai Chen, 2 Chuanxiong Guo, 2 Haitao Wu, 3 Jing Yuan, 4 Zhenqian Feng, 1 Yan.
Electronic Resources in the EUI Library
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
Determine Eligibility Chapter 4. Determine Eligibility 4-2 Objectives Search for Customer on database Enter application signed date and eligibility determination.
My Alphabet Book abcdefghijklm nopqrstuvwxyz.
Multiplying binomials You will have 20 seconds to answer each of the following multiplication problems. If you get hung up, go to the next problem when.
0 - 0.
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
SUBTRACTING INTEGERS 1. CHANGE THE SUBTRACTION SIGN TO ADDITION
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Addition Facts
Year 6 mental test 5 second questions
ZMQS ZMQS
BT Wholesale October Creating your own telephone network WHOLESALE CALLS LINE ASSOCIATED.
ABC Technology Project
Lecture 6: Boolean to Vector
1 Undirected Breadth First Search F A BCG DE H 2 F A BCG DE H Queue: A get Undiscovered Fringe Finished Active 0 distance from A visit(A)
© S Haughton more than 3?
VOORBLAD.
Text Categorization.
1 Breadth First Search s s Undiscovered Discovered Finished Queue: s Top of queue 2 1 Shortest path from s.
“Start-to-End” Simulations Imaging of Single Molecules at the European XFEL Igor Zagorodnov S2E Meeting DESY 10. February 2014.
1 Evaluations in information retrieval. 2 Evaluations in information retrieval: summary The following gives an overview of approaches that are applied.
Copyright © 2013, 2009, 2006 Pearson Education, Inc.
Factor P 16 8(8-5ab) 4(d² + 4) 3rs(2r – s) 15cd(1 + 2cd) 8(4a² + 3b²)
Squares and Square Root WALK. Solve each problem REVIEW:
Traditional IR models Jian-Yun Nie.
Lets play bingo!!. Calculate: MEAN Calculate: MEDIAN
Copyright © 2013, 2009, 2006 Pearson Education, Inc. 1 Section 5.4 Polynomials in Several Variables Copyright © 2013, 2009, 2006 Pearson Education, Inc.
Chapter 5 Test Review Sections 5-1 through 5-4.
GG Consulting, LLC I-SUITE. Source: TEA SHARS Frequently asked questions 2.
1 First EMRAS II Technical Meeting IAEA Headquarters, Vienna, 19–23 January 2009.
Macromedia Dreamweaver MX 2004 – Design Professional Dreamweaver GETTING STARTED WITH.
Addition 1’s to 20.
25 seconds left…...
1 Atlantic Annual Viewing Trends Adults 35-54, Total TV, By Daypart Average Minute Audience (000) Average Weekly Reach (%) Average Weekly Hours Viewed.
Week 1.
We will resume in: 25 Minutes.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
CSE3201/4500 Information Retrieval Systems
A SMALL TRUTH TO MAKE LIFE 100%
1 Unit 1 Kinematics Chapter 1 Day
PSSA Preparation.
Essential Cell Biology
1 PART 1 ILLUSTRATION OF DOCUMENTS  Brief introduction to the documents contained in the envelope  Detailed clarification of the documents content.
How Cells Obtain Energy from Food
CpSc 3220 Designing a Database
Does one size really fit all? Evaluating classifiers in Bag-of-Visual-Words classification Christian Hentschel, Harald Sack Hasso Plattner Institute.
Autumn Web Information retrieval (Web IR) Handout #0: Introduction Ali Mohammad Zareh Bidoki ECE Department, Yazd University
Autumn Web Information retrieval (Web IR) Handout #11:FICA: A Fast Intelligent Crawling Algorithm Ali Mohammad Zareh Bidoki ECE Department, Yazd.
Autumn Web Information retrieval (Web IR) Handout #14: Ranking Based on Click Through data Ali Mohammad Zareh Bidoki ECE Department, Yazd University.
Web Information retrieval (Web IR)
Web Information retrieval (Web IR)
Presentation transcript:

Application of Ensemble Models in Web Ranking Homa B. Hashemi Nasser Yazdani Azadeh Shakery Mahdi Pakdaman Naeini School of Electrical and Computer Engineering University of Tehran

Information Explosion

Web Challenges Huge size of information 25 billion pages Proliferation and dynamic nature Creation of New pages New links are created at rate 25% per week Heterogeneous contents HTML/Text/Audio/… Application of Ensemble Models in Web Ranking

Search Engine as A Tool http://seo-related.com/ Application of Ensemble Models in Web Ranking

Inside Search Engine Crawling Indexing Ranking

Inside Search Engine Crawling Indexing Ranking

Ranking Approaches Content-based (query dependent) TF, IDF BM25 Classical IR … Connectivity based (web) PageRank HITS Application of Ensemble Models in Web Ranking

Our General Framework … Query Retrieval Model Ensemble Model List 1 2 List N … Ensemble Model Final List Application of Ensemble Models in Web Ranking

Simple Ensemble Models Sum rule Add (normalized) values of different methods Product rule Multiply (normalized) values of different methods Borda rule Combination of ranking Application of Ensemble Models in Web Ranking

Complicated Ensemble Models OWA (Ordered Weighted Averaging) Click-Through Data SVM Use the distance from discriminating hyper plane as the measure for relevancy of a page to a specific query Application of Ensemble Models in Web Ranking

OWA operator the weights of each vector Application of Ensemble Models in Web Ranking

Simulated Click-Through Data How can we use the user behavior? 80% of user clicks are related to query Click-through data Application of Ensemble Models in Web Ranking

Simulated Click-Through Data (example) L(b) D1 D4 D7 D9 D2 d8

Simulated Click-Through Data (example) Interleaved results L(a,b) D1 D4 D3 D7 D2 D9 D5 D8 D6 L(b) D1 D4 D7 D9 D2 d8

Simulated Click-Through Data (example) Interleaved results L(a,b) D1  First D4 D3 D7 D2  Second D9 D5  Third D8 D6 L(b) D1 D4 D7 D9 D2 d8

Simulated Click-Through Data (example) Interleaved results L(a,b) D1  First D4 D3 D7 D2  Second D9 D5  Third D8 D6 L(b) D1 D4 D7 D9 D2 d8

Experimental Datasets LETOR benchmark (English) Microsoft Research Asia, 2007 DotIR benchmark (Persian) Iran Telecommunication Research Center (ITRC),2009 Application of Ensemble Models in Web Ranking

LETOR Benchmark – p@k Application of Ensemble Models in Web Ranking

LETOR Benchmark – MAP Application of Ensemble Models in Web Ranking

DotIR Benchmark – p@k

DotIR Benchmark – MAP Application of Ensemble Models in Web Ranking

Summary Motivation: Important role of Ranking algorithms Low precision of content and connectivity algorithms Solution: Use different Ensemble models to combine Ranking algorithms based on Learning Results: LETOR benchmark has been used for evaluation More research needed to be done on newly built DotIR collection Application of Ensemble Models in Web Ranking

LABS Application of Ensemble Models in Web Ranking

Reference Ali Mohammad Zareh Bidoki, Pedram Ghodsnia, Nasser Yazdani, “A3CRank: An Adaptive Ranking method based on Connectivity, Content and Click-through data”, Information Processing and Management, 2010. Ali Mohammad Zareh Bidoki, “Combination of Documents Features Based on Simulated Click-through Data”, ECIR 2009. Application of Ensemble Models in Web Ranking

Thank You Any Questions?