Output URL Bidding Panagiotis Papadimitriou, Hector Garcia-Molina, (Stanford University) Ali Dasdan, Santanu Kolay (Ebay Inc)

Slides:



Advertisements
Similar presentations
Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London.
Advertisements

Data Mining and Text Analytics Advertising Laura Quinn.
Output URL Bidding Panagiotis Papadimitriou, Hector Garcia-Molina, (Stanford University) Ali Dasdan, Santanu Kolay (Ebay Inc) Related papers: VLDB 2011,
BiG-Align: Fast Bipartite Graph Alignment
Ziv Bar-YossefMaxim Gurevich Google and Technion Technion TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A AA.
Comparing Twitter Summarization Algorithms for Multiple Post Summaries David Inouye and Jugal K. Kalita SocialCom May 10 Hyewon Lim.
Simrank++: Query Rewriting through link analysis of the click graph Ioannis Antonellis Hector Garcia-Molina
Subscription Subsumption Evaluation for Content-Based Publish/Subscribe Systems Hojjat Jafarpour, Bijit Hore, Sharad Mehrotra, and Nalini Venkatasubramanian.
1 Polynomial Church-Turing thesis A decision problem can be solved in polynomial time by using a reasonable sequential model of computation if and only.
Detecting Near Duplicates for Web Crawling Authors : Gurmeet Singh Mank Arvind Jain Anish Das Sarma Presented by Chintan Udeshi 6/28/ Udeshi-CS572.
CSCI 5708: Query Processing I Pusheng Zhang University of Minnesota Feb 3, 2004.
Learning to Advertise. Introduction Advertising on the Internet = $$$ –Especially search advertising and web page advertising Problem: –Selecting ads.
Retrieval Evaluation: Precision and Recall. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity.
J. Chen, O. R. Zaiane and R. Goebel An Unsupervised Approach to Cluster Web Search Results based on Word Sense Communities.
Query Biased Snippet Generation in XML Search Yi Chen Yu Huang, Ziyang Liu, Yi Chen Arizona State University.
The community-search problem and how to plan a successful cocktail party Mauro SozioAris Gionis Max Planck Institute, Germany Yahoo! Research, Barcelona.
1 External Sorting for Query Processing Yanlei Diao UMass Amherst Feb 27, 2007 Slides Courtesy of R. Ramakrishnan and J. Gehrke.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
Deep-Web Crawling “Enlightening the dark side of the web”
A REVIEW OF FEATURE SELECTION METHODS WITH APPLICATIONS Alan Jović, Karla Brkić, Nikola Bogunović {alan.jovic, karla.brkic,
SA2014.SIGGRAPH.ORG SPONSORED BY. SA2014.SIGGRAPH.ORG SPONSORED BY Approximate Pyramidal Shape Decomposition Ruizhen Hu Honghua Li Hao Zhang Daniel Cohen-Or.
Introduction The large amount of traffic nowadays in Internet comes from social video streams. Internet Service Providers can significantly enhance local.
Topology Design for Service Overlay Networks with Bandwidth Guarantees Sibelius Vieira* Jorg Liebeherr** *Department of Computer Science Catholic University.
Supervised Design Space Exploration by Compositional Approximation of Pareto Sets Hung-Yi Liu 1, Ilias Diakonikolas 2, Michele Petracca 1, and Luca P.
1 Algebraic Structure in Almost-Symmetries Igor Markov, Univ. of Michigan Presented by Ian Gent, St. Andrews.
 An important problem in sponsored search advertising is keyword generation, which bridges the gap between the keywords bidded by advertisers and queried.
Mehdi Kargar Aijun An York University, Toronto, Canada Keyword Search in Graphs: Finding r-cliques.
Ex-MATE: Data-Intensive Computing with Large Reduction Objects and Its Application to Graph Mining Wei Jiang and Gagan Agrawal.
1 Artificial Evolution: From Clusters to GRID Erol Şahin Cevat Şener Dept. of Computer Engineering Middle East Technical University Ankara.
Clustering-based Collaborative filtering for web page recommendation CSCE 561 project Proposal Mohammad Amir Sharif
Bayesian Sets Zoubin Ghahramani and Kathertine A. Heller NIPS 2005 Presented by Qi An Mar. 17 th, 2006.
« Pruning Policies for Two-Tiered Inverted Index with Correctness Guarantee » Proceedings of the 30th annual international ACM SIGIR, Amsterdam 2007) A.
Bug Localization with Machine Learning Techniques Wujie Zheng
Interpreting Advertiser Intent in Sponsored Search BHANU C VATTIKONDA, SANTHOSH KODIPAKA, HONGYAN ZHOU, VACHA DAVE, SAIKAT GUHA, ALEX C SNOEREN 1.
Retrieval Models for Question and Answer Archives Xiaobing Xue, Jiwoon Jeon, W. Bruce Croft Computer Science Department University of Massachusetts, Google,
Introduction to Algorithms By Mr. Venkatadri. M. Two Phases of Programming A typical programming task can be divided into two phases: Problem solving.
Major objective of this course is: Design and analysis of modern algorithms Different variants Accuracy Efficiency Comparing efficiencies Motivation thinking.
Dynamic Covering for Recommendation Systems Ioannis Antonellis Anish Das Sarma Shaddin Dughmi.
Detecting Dominant Locations from Search Queries Lee Wang, Chuang Wang, Xing Xie, Josh Forman, Yansheng Lu, Wei-Ying Ma, Ying Li SIGIR 2005.
Generating RCPSP instances with Known Optimal Solutions José Coelho Generator and generated instances in:
Q2Semantic: A Lightweight Keyword Interface to Semantic Search Haofen Wang 1, Kang Zhang 1, Qiaoling Liu 1, Thanh Tran 2, and Yong Yu 1 1 Apex Lab, Shanghai.
Mehdi Kargar Aijun An York University, Toronto, Canada Keyword Search in Graphs: Finding r-cliques.
Search Engine Architecture
Autumn Web Information retrieval (Web IR) Handout #1:Web characteristics Ali Mohammad Zareh Bidoki ECE Department, Yazd University
Which is better (A or B)? Jim Jansen College of Information Sciences and Technology The Pennsylvania State University
Zibin Zheng DR 2 : Dynamic Request Routing for Tolerating Latency Variability in Cloud Applications CLOUD 2013 Jieming Zhu, Zibin.
LOGO 1 Corroborate and Learn Facts from the Web Advisor : Dr. Koh Jia-Ling Speaker : Tu Yi-Lang Date : Shubin Zhao, Jonathan Betz (KDD '07 )
1.5 Solving Inequalities. Write each inequality using interval notation, and illustrate each inequality using the real number line.
A BRIEF INTRODUCTION TO CACHE LOCALITY YIN WEI DONG 14 SS.
More Than Relevance: High Utility Query Recommendation By Mining Users' Search Behaviors Xiaofei Zhu, Jiafeng Guo, Xueqi Cheng, Yanyan Lan Institute of.
Advisor: Koh Jia-Ling Nonhlanhla Shongwe EFFICIENT QUERY EXPANSION FOR ADVERTISEMENT SEARCH WANG.H, LIANG.Y, FU.L, XUE.G, YU.Y SIGIR’09.
A New Algorithm for Inferring User Search Goals with Feedback Sessions.
1 Implicant Expansion Methods Used in The BOOM Minimizer Petr Fišer, Jan Hlavička Czech Technical University, Karlovo nám. 13, Prague 2
Crowd Fraud Detection in Internet Advertising Tian Tian 1 Jun Zhu 1 Fen Xia 2 Xin Zhuang 2 Tong Zhang 2 Tsinghua University 1 Baidu Inc. 2 1.
TFA: A Tunable Finite Automaton for Regular Expression Matching Author: Yang Xu, Junchen Jiang, Rihua Wei, Yang Song and H. Jonathan Chao Publisher: ACM/IEEE.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Custom Computing Machines for the Set Covering Problem Paper Written By: Christian Plessl and Marco Platzner Swiss Federal Institute of Technology, 2002.
Where (Online) to Display Your Ad Jim Jansen College of Information Sciences and Technology The Pennsylvania State University
Logical Agents Chapter 7. Outline Knowledge-based agents Propositional (Boolean) logic Equivalence, validity, satisfiability Inference rules and theorem.
GENERATING RELEVANT AND DIVERSE QUERY PHRASE SUGGESTIONS USING TOPICAL N-GRAMS ELENA HIRST.
Optimal Relay Placement for Indoor Sensor Networks Cuiyao Xue †, Yanmin Zhu †, Lei Ni †, Minglu Li †, Bo Li ‡ † Shanghai Jiao Tong University ‡ HK University.
TU/e Algorithms (2IL15) – Lecture 13 1 Wrap-up lecture.
Ariel Fuxman, Panayiotis Tsaparas, Kannan Achan, Rakesh Agrawal (2008) - Akanksha Saxena 1.
Search Engine Architecture
Chapter 12: Query Processing
Objective of This Course
Graph Indexing for Shortest-Path Finding over Dynamic Sub-Graphs
Overview of Query Evaluation
Project Title: (Your project title here)
Computational Advertising and
Presentation transcript:

Output URL Bidding Panagiotis Papadimitriou, Hector Garcia-Molina, (Stanford University) Ali Dasdan, Santanu Kolay (Ebay Inc)

Search Engine Results Page (SERP) Organic Results Sponsored Ads Query Sponsored Search Ads 2

Keyword bidding Advertiser Search Engines the social network lord of the rings the matrix lotr III... # keywords = ~ 10K KEYWORDS 3

Example SERPs en.wikipedia.org/wiki/The_Social_Network en.wikipedia.org/wiki/The_Matrix en.wikipedia.org/wiki/The_Lord_of_the_rings the social network the matrix the lord of the rings lotr iii 4

Output URL bidding Advertiser Search Engines imdb.com AND wikipedia.org # URLs = 2 URLs 5

Topics Implementation Evaluation 6

Implementation challenge 7 SERP

Alternative implementation solutions 1. Serialization 8 O: Organic Search Component S: Sponsored Search Component Latency Simplicity SERP 2. Parallelization SERP O: Organic Search Comp. (Or + Op) Or’: Small replica of Or S: Sponsored Search Component V: Ad validation More resources No latency

Topics Implementation Evaluation 9

Bid language model Output Expression – e.g., a := (u1  u2)  u3  (h1  h2) – u: URL e.g., en.wikipedia.org/wiki/The_Social_Network – h: host e.g., en.wikipedia.org 10

How to evaluate/study output bidding? Use existing keyword campaigns to generate realistic output expressions to study 11 The social network lord of the rings the matrix lotr III … Output Expression Generator imdb.com AND wikipedia.org

INPUT: set of keywords R (from a keyword campaign) OUTPUT: expression a that “covers” R, i.e.,  q  R, matches(a, results of q) Generator input & output Candidate expressions Output Expression a 1 := u 1  u 2  u 3 a 2 := u 1  u 4 a 3 := u 5 12

1.Compactness Contain few URLs 2.Spill minimization Do not match “irrelevant” queries Which expression to select? Candidate expressions Output Expression Size |a| spill(a,R) a 1 := u 1  u 2  u 3 3{} a 2 := u 1  u 4 2{q 5 } a 3 := u 5 1{q 4,q 5, q 6 } 13

Output expression generation problem statement Query Set Output Cover min. γ|a| + (1-γ)|spill(a, R)| subj. to matches(a, q),  q  R γ : regularization parameter NP-hard to solve Reduction from Set Cover, Red- Blue Set Cover (see paper) Developed Greedy Algorithm (see paper) 14

Is spill always “bad”? Example: – q 1 : lord of the rings – q 2 : the matrix – q 3 : the social network – q 4 : … – q 5 : lotr – q 6 : … 15 Output Expression Size |a| spill(a,R) a 1 := u 1  u 2  u 3 3{} a 2 := u 1  u 4 2{q 5 } a 3 := u 5 1{q 4,q 5, q 6 } →(lord of the rings)

Spill may be good! Cluster queries Q using the bipartite graph (see paper) Divide spill(a, R) into: – positive: relevant – negative: irrelevant 16 Output Expression Size |a| spill(a,R) +- a 1 := u 1  u 2  u 3 3{} a 2 := u 1  u 4 2{q 5 }{} a 3 := u 5 1{q 5 }{q 4, q 6 }

Experimental evaluation goals Recall output expression looks like – a := (u1  u2)  u3  (h1  h2) Evaluation questions – URLs, hosts or mixed? – Specific or generic?(# conjuncts) – Long or compact? (# disjuncts) Comparison criteria – Compactness vs spill tradeoff – Positive vs negative spill 17

Experimental setup Dataset (from Yahoo query logs) – 2,251 ads – 13M queries, 63M URLs (7M hosts) 18 The social network lord of the rings the matrix lotr III … Output Expression Generator imdb.com AND wikipedia.org γ - URLs - hosts - mixed # conjuncts

Compactness vs spill tradeoff URLs, hosts or mixed? Mixed expression curves dominate others Specific vs generic? 2 conjuncts suffice Long or compact? Next slide… 19

Positive vs negative spill 20 For |a|>60, more than 50% of spill is positive URLs, hosts or mixed? Mixed expression curves dominate others Long or compact? |a|=70-80 suffice (1/3 the size of equivalent keyword set)

More experiments in paper Combining keyword and output bidding – E.g., a movie advertiser uses: 1. imdb.com  wikipedia.org cover 80% of queries 2. keyword1, keyword2, …cover 20% of queries Combined expressions – are as compact as output expressions – yield less (negative) spill than output expressions 21

Conclusions Output URL bidding can be implemented efficiently Advantages over keyword bidding – Bid compactness – More relevant queries (positive spill) Combining keyword and output bidding seems to be the most promising direction

Thank you! Contact: