Jun Li, Peng Zhang, Yanan Cao, Ping Liu, Li Guo Chinese Academy of Sciences State Grid Energy Institute, China Efficient Behavior Targeting Using SVM Ensemble.

Slides:



Advertisements
Similar presentations
A Comparison of Implicit and Explicit Links for Web Page Classification Dou Shen 1 Jian-Tao Sun 2 Qiang Yang 1 Zheng Chen 2 1 Department of Computer Science.
Advertisements

Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos.
Chapter 5: Introduction to Information Retrieval
Modelling Relevance and User Behaviour in Sponsored Search using Click-Data Adarsh Prasad, IIT Delhi Advisors: Dinesh Govindaraj SVN Vishwanathan* Group:
Personalized Query Classification Bin Cao, Qiang Yang, Derek Hao Hu, et al. Computer Science and Engineering Hong Kong UST.
Supervised Learning Techniques over Twitter Data Kleisarchaki Sofia.
VisualRank: Applying PageRank to Large-Scale Image Search Yushi Jing, Member, IEEE, and Shumeet Baluja, Member, IEEE.
Learning to Cluster Web Search Results SIGIR 04. ABSTRACT Organizing Web search results into clusters facilitates users quick browsing through search.
Optimal Design Laboratory | University of Michigan, Ann Arbor 2011 Design Preference Elicitation Using Efficient Global Optimization Yi Ren Panos Y. Papalambros.
Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
Support Vector Machines (SVMs) Chapter 5 (Duda et al.)
Time-dependent Similarity Measure of Queries Using Historical Click- through Data Qiankun Zhao*, Steven C. H. Hoi*, Tie-Yan Liu, et al. Presented by: Tie-Yan.
Context-Aware Query Classification Huanhuan Cao 1, Derek Hao Hu 2, Dou Shen 3, Daxin Jiang 4, Jian-Tao Sun 4, Enhong Chen 1 and Qiang Yang 2 1 University.
Predictive Automatic Relevance Determination by Expectation Propagation Yuan (Alan) Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani.
INFO 624 Week 3 Retrieval System Evaluation
1 Web Query Classification Query Classification Task: map queries to concepts Application: Paid advertisement 问题:百度 /Google 怎么赚钱?
1 Ensembles of Nearest Neighbor Forecasts Dragomir Yankov, Eamonn Keogh Dept. of Computer Science & Eng. University of California Riverside Dennis DeCoste.
12 -1 Lecture 12 User Modeling Topics –Basics –Example User Model –Construction of User Models –Updating of User Models –Applications.
Overview of Search Engines
TransRank: A Novel Algorithm for Transfer of Rank Learning Depin Chen, Jun Yan, Gang Wang et al. University of Science and Technology of China, USTC Machine.
Transfer Learning From Multiple Source Domains via Consensus Regularization Ping Luo, Fuzhen Zhuang, Hui Xiong, Yuhong Xiong, Qing He.
Semi-Supervised Learning with Concept Drift using Particle Dynamics applied to Network Intrusion Detection Data Fabricio Breve Institute of Geosciences.
Personalized Influence Maximization on Social Networks
User Browsing Graph: Structure, Evolution and Application Yiqun Liu, Yijiang Jin, Min Zhang, Shaoping Ma, Liyun Ru State Key Lab of Intelligent Technology.
Understanding and Predicting Graded Search Satisfaction Tang Yuk Yu 1.
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
8/25/05 Cognitive Computations Software Tutorial Page 1 SNoW: Sparse Network of Winnows Presented by Nick Rizzolo.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
GA-Based Feature Selection and Parameter Optimization for Support Vector Machine Cheng-Lung Huang, Chieh-Jen Wang Expert Systems with Applications, Volume.
Research Ranked Recall: Efficient Classification by Learning Indices That Rank Omid Madani with Michael Connor (UIUC)
Implicit Acquisition of Context for Personalization of Information Retrieval Systems Chang Liu, Nicholas J. Belkin School of Communication and Information.
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
Center for Evolutionary Functional Genomics Large-Scale Sparse Logistic Regression Jieping Ye Arizona State University Joint work with Jun Liu and Jianhui.
Improving Web Search Results Using Affinity Graph Benyu Zhang, Hua Li, Yi Liu, Lei Ji, Wensi Xi, Weiguo Fan, Zheng Chen, Wei-Ying Ma Microsoft Research.
Greedy is not Enough: An Efficient Batch Mode Active Learning Algorithm Chen, Yi-wen( 陳憶文 ) Graduate Institute of Computer Science & Information Engineering.
BEHAVIORAL TARGETING IN ON-LINE ADVERTISING: AN EMPIRICAL STUDY AUTHORS: JOANNA JAWORSKA MARCIN SYDOW IN DEFENSE: XILING SUN & ARINDAM PAUL.
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
Department of Electrical Engineering and Computer Science Kunpeng Zhang, Yu Cheng, Yusheng Xie, Doug Downey, Ankit Agrawal, Alok Choudhary {kzh980,ych133,
Artificial Intelligence Final Project Text document Classification with new type Rule-based PLM Chang, Jung Woo Shin, Dong In Jung, Hyun Joon School of.
Powerpoint Templates Page 1 Powerpoint Templates Scalable Text Classification with Sparse Generative Modeling Antti PuurulaWaikato University.
Learning from Positive and Unlabeled Examples Investigator: Bing Liu, Computer Science Prime Grant Support: National Science Foundation Problem Statement.
CoCQA : Co-Training Over Questions and Answers with an Application to Predicting Question Subjectivity Orientation Baoli Li, Yandong Liu, and Eugene Agichtein.
One-class Classification of Text Streams with Concept Drift
Data Mining, ICDM '08. Eighth IEEE International Conference on Duy-Dinh Le National Institute of Informatics Hitotsubashi, Chiyoda-ku Tokyo,
Learning User Behaviors for Advertisements Click Prediction Chieh-Jen Wang & Hsin-Hsi Chen National Taiwan University Taipei, Taiwan.
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
Chapter. 3: Retrieval Evaluation 1/2/2016Dr. Almetwally Mostafa 1.
Virtual Examples for Text Classification with Support Vector Machines Manabu Sassano Proceedings of the 2003 Conference on Emprical Methods in Natural.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
A Framework to Predict the Quality of Answers with Non-Textual Features Jiwoon Jeon, W. Bruce Croft(University of Massachusetts-Amherst) Joon Ho Lee (Soongsil.
 Effective Multi-Label Active Learning for Text Classification Bishan yang, Juan-Tao Sun, Tengjiao Wang, Zheng Chen KDD’ 09 Supervisor: Koh Jia-Ling Presenter:
Learning A Better Compiler Predicting Unroll Factors using Supervised Classification And Integrating CPU and L2 Cache Voltage Scaling using Machine Learning.
Cross-modal Hashing Through Ranking Subspace Learning
Click to edit Present’s Name AP-Tree: Efficiently Support Continuous Spatial-Keyword Queries Over Stream Xiang Wang 1*, Ying Zhang 2, Wenjie Zhang 1, Xuemin.
Experience Report: System Log Analysis for Anomaly Detection
Search User Behavior: Expanding The Web Search Frontier
Instance Based Learning
Cross Domain Distribution Adaptation via Kernel Mapping
Alan Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani
Text Categorization Rong Jin.
Agenda What is SEO ? How Do Search Engines Work? Measuring SEO success ? On Page SEO – Basic Practices? Technical SEO - Source Code. Off Page SEO – Social.
Michal Rosen-Zvi University of California, Irvine
Socialized Word Embeddings
A Classification-based Approach to Question Routing in Community Question Answering Tom Chao Zhou 22, Feb, 2010 Department of Computer.
Ping LUO*, Fen LIN^, Yuhong XIONG*, Yong ZHAO*, Zhongzhi SHI^
Modeling IDS using hybrid intelligent systems
Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.
Interactive Information Retrieval
Presentation transcript:

Jun Li, Peng Zhang, Yanan Cao, Ping Liu, Li Guo Chinese Academy of Sciences State Grid Energy Institute, China Efficient Behavior Targeting Using SVM Ensemble Indexing

 Behavior Targeting (BT) uses users’ historical behavior data to select the most relevant ads for display.  Example from Yahoo! Research Example from Yahoo! Research Behavior targeting ads User behavior data Targeted users

Regression for BT  Poisson Regression model (Ye Chen, eBay, 2009).  x: ad clicks and views, page views, search queries and clicks.  y: click-through rate (CTR). Ye Chen et al., Large-scale behavior targeting (KDD’09 best paper award) View data Click data Poisson dis. Poisson reg. on view Poisson reg. on click ad category

Limitations  Limitations:  parameter tuning is very difficult.  the Poisson assumption is not always true for real-world behavior data.  Clicks are typically several orders of magnitude fewer than views.  User interests are not always fixed, but rather transient.

Classification for BT  SVM for classification  Example 1: 3 users on Nikon ( ad a View data Click data ad category View and click data(+) View but no click data(-) SVM for classification Challenges 1,2,3

Classification for BT  Ensemble SVM on data streams  Merits  no complicated parameters  no statistical assumptions  Dynamic model on data streams Challenge 4

Limitations  Time cost is heavy for online computing  ensemble prediction  time cost: A (advertisers)*W(ensemble size)*N(support vectors)*T(features) Example 2: We collect 2 million behavior events (W = 10) in 1 minute, and prediction result costs 53 minutes.

Solutions  Construct Index structure for Ensemble SVM. Why the index work ?  Trade space for time.  shared features among multiple support vectors  the sparse structure of support vectors Support vector Text terms Features Document Ensemble SVMDocument set map P. Zhang et al., knowledge index for online data streams ( KDD 2011 & ICDM 2011)

The index structure  The SVM-index structure  Example 3: based on example 1, consider a SVM with 3 support vectors Ensemble information Support vectors Inverted hashing table Time complexity O(T)

The index structure  Operations – Search: Predict the label of each incoming user data x, Step 1: searches support vectors in the left inverted indexes Step 2: calculate x’s class label – Insert: Integrate new classifiers into ensemble – Delete: Drop outdated classifiers from ensemble  Memory See our source codes.

Experiments  Data sets  Search engine data Comparisons – Possion – E-SVM – E-Index (our method)

 Observations Comparisons E-index has sub-linear prediction time E-SVM consumes more memory

Comparisons Ensemble models are more accurate than Poisson regression model

Comparisons The index method can significantly improve the efficiency, especially when the ensemble size is large.

Related Work  Behavior targeting  Regression models vs. classification models  Stream indexing  Boolean expression indexing in Publish/subscribe systems  Ensemble models  Concept drifting

Conclusions  Contributions  Identify and address the prediction efficiency problem for ensemble models for behavior targeting.  Convert ensemble SVM model to a document set, and propose a new type of invert text index structure to achieve sub-linear prediction time.  Future work  Index more complicated SVM models with non-linear kernels.

For source code, visit our website streamming.org/homepages/lijun.html Questions?