Large-Scale Real-Time Product Recommendation at Criteo

Slides:



Advertisements
Similar presentations
Recommender Systems & Collaborative Filtering
Advertisements

Google News Personalization Scalable Online Collaborative Filtering
Large-scale Recommendations in a Dynamic Marketplace Jay Katukuri Rajyashree Mukherjee Tolga Konik Chu-Cheng Hsieh LSRS
Business Intelligence Systems
LIBRA: Lightweight Data Skew Mitigation in MapReduce
R and HDInsight in Microsoft Azure
Copyright © 2014 Criteo millions de prédictions par seconde Les défis de Criteo Nicolas Le Roux Scientific Program Manager - R&D.
Implicit Queries for Vitor R. Carvalho (Joint work with Joshua Goodman, at Microsoft Research)
Hadoop in the Wild CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook.
Girish Nathan Misha Bilenko Microsoft Azure Machine Learning How to Work with Large Datasets to Build Predictive Models.
Recommender Systems Aalap Kohojkar Yang Liu Zhan Shi March 31, 2008.
Chapter 8 Logistic Regression 1. Introduction Logistic regression extends the ideas of linear regression to the situation where the dependent variable,
Scaling Distributed Machine Learning with the BASED ON THE PAPER AND PRESENTATION: SCALING DISTRIBUTED MACHINE LEARNING WITH THE PARAMETER SERVER – GOOGLE,
UC Berkeley Monitoring Hadoop through Tracing Andy Konwinski and Matei Zaharia.
Sponsors. Smart Apps with Azure ML CHRIS MCHENRY VP OF TECHNOLOGY, INTEGRO
Business Intelligence Systems
Real-Time Big Data Analytics From Deployment to Production 1 David Smith Revolution
Large-Scale Content-Based Image Retrieval Project Presentation CMPT 880: Large Scale Multimedia Systems and Cloud Computing Under supervision of Dr. Mohamed.
Big data analytics with R and Hadoop Chapter 5 Learning Data Analytics with R and Hadoop 데이터마이닝연구실 김지연.
E-Commerce Michael Andrianus – Vincentius
Microsoft ® Site Server Commerce Edition Jay Sauls Microsoft Consulting Services.
USING HADOOP & HBASE TO BUILD CONTENT RELEVANCE & PERSONALIZATION Tools to build your big data application Ameya Kanitkar.
Big data analytics Rafal Lukawiecki Strategic Consultant Project Botticelli
GDG DevFest Central Italy Joint work with J. Feldman, S. Lattanzi, V. Mirrokni (Google Research), S. Leonardi (Sapienza U. Rome), H. Lynch (Google)
CS525: Big Data Analytics Machine Learning on Hadoop Fall 2013 Elke A. Rundensteiner 1.
Distributed Indexing of Web Scale Datasets for the Cloud {ikons, eangelou, Computing Systems Laboratory School of Electrical.
Canadian Advertising in Action, 6th ed. Keith J. Tuckwell ©2003 Pearson Education Canada Inc Elements of the Internet World Wide Web World.
9 February 2000CHEP2000 Paper 3681 CDF Data Handling: Resource Management and Tests E.Buckley-Geer, S.Lammel, F.Ratnikov, T.Watts Hardware and Resources.
Chao Liu Internet Services Research Center Microsoft Research-Redmond.
Analytics from 330 million smartphones Sean Byrnes CTO & Co-founder.
CONFIDENTIAL1 Hidden Decision Trees to Design Predictive Scores – Application to Fraud Detection Vincent Granville, Ph.D. AnalyticBridge October 27, 2009.
Multiworld Testing Machine Learning for Contextual Decision-Making.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
Near Real-Time Verification At The Forecast Systems Laboratory: An Operational Perspective Michael P. Kay (CIRES/FSL/NOAA) Jennifer L. Mahoney (FSL/NOAA)
Customer Views Home Page Your AccountView CartHelp Search for in Top-level list of catalogs, which will include “bikes,” “bike parts,” “clothing,” and.
Probabilistic Machine Learning in Computational Advertising Thore Graepel, Thomas Borchert, Ralf Herbrich and Joaquin Quiñonero Candela Online Services.
© 2009 Eyeblaster. All rights reserved What it means to you Channel Connect for Search.
Matthew Winter and Ned Shawa
Google News Personalization Big Data reading group November 12, 2007 Presented by Babu Pillai.
Site Technology TOI Fest Q Celebration From Keyword-based Search to Semantic Search, How Big Data Enables That?
The Value of Information in Real-Time Display Advertising Björn Hoppe 28. April 2015.
11 Why tune relevance Because we want to find the one single best item, among a large group of possible candidates….
Applications on Spark Prof. Harold Liu Beijing Institute of Technology December 2015.
Improving Search for Emerging Applications * Some techniques current being licensed to Bimaple Chen Li UC Irvine.
Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.
Risk Solutions & Research © Copyright IBM Corporation 2005 Default Risk Modelling : Decision Tree Versus Logistic Regression Dr.Satchidananda S Sogala,Ph.D.,
E-commerce in Your Inbox Product Recommendations at Scale
5-7 May 2003 SCD Exec_Retr 1 Research Data, May Archive Content New Archive Developments Archive Access and Provision.
Azure Machine Learning Introduction to Azure ML. Setting Expectations This presentation is for you if…  you hear the buzzword “Machine Learning” and.
Distributed Process Discovery From Large Event Logs Sergio Hernández de Mesa {
Big Data Analytics with Excel Peter Myers Bitwise Solutions.
Esri UC 2014 | Technical Workshop | Enhancing Web Map Performance in ArcGIS Online Julia Guard & Melanie Summers.
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
Hadoop in the Wild CMSC 491 Hadoop-Based Distributed Computing Spring 2016 Adam Shook.
9/24/2017 7:27 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Utilizing “big Data” analytics for student success
A Simple Approach for Author Profiling in MapReduce
Mark Davey & Rich Brosius
Big Data is a Big Deal!.
Recommender Systems & Collaborative Filtering
Beijing Institute of Technology December 2015
Dynamics 365; What’s new and what’s not
Make Predictions Using Azure Machine Learning Studio
Enhancing Web Map Performance in ArcGIS Online
DATA SCIENCE Online Training at GoLogica
What is Commerce According to Dictionary.com
Movie Recommendation System
Azure Machine Learning
E-Distribution Service uses Cloud Infrastructure to Reduce Costs, Gain Flexibility, and Expand Globally “Windows Azure helps makes it simple, fast, and.
Copyright © JanBask Training. All rights reserved Get Started with Hadoop Hive HiveQL Languages.
Presentation transcript:

Large-Scale Real-Time Product Recommendation at Criteo Simon Dollé RecSys FR, December 1st, 2015

Catalog data Feed provided by the merchants User behavior data Large scale intent data All visits to merchant websites Page views, basket, sales events Ad display data Displayed and clicked ads

We buy Ad spaces

We buy Ad spaces We sell Clicks

We buy Ad spaces We sell Clicks that convert

We buy Ad spaces We sell Clicks that convert a lot

We buy Ad spaces We sell Clicks that convert a lot We take the risk

10 000 displays

10 000 displays leads to 50 clicks

10 000 displays leads to 50 clicks leads to 1 sale

3 billion ads/day 3 billion products

10ms to pick relevant products

7 data centers 15 000 servers 1200-node hadoop cluster

Catalog data 3B+ products Catalog data Feed provided by the merchants User behavior data Large scale intent data All visits to merchant websites Page views, basket, sales events Ad display data Displayed and clicked ads

Catalog data Browsing history 3B+ products 2B events / day Feed provided by the merchants User behavior data Large scale intent data All visits to merchant websites Page views, basket, sales events Ad display data Displayed and clicked ads

Catalog data Browsing history Ad display data 3B+ products 2B events / day Ad display data 20B events / day Catalog data Feed provided by the merchants User behavior data Large scale intent data All visits to merchant websites Page views, basket, sales events Ad display data Displayed and clicked ads

How do we do it ?

Recommend products for a user What we want: reco(user) = products 1B users x 3B products ! But we need to scale and keep it fresh What we can do : Pre-select products offline Refine scoring online to get final candidates

Bob saw orange shoes

Bob saw orange shoes Some candidate products Historical

Bob saw orange shoes Some candidate products Historical Most viewed

Bob saw orange shoes Some candidate products Historical Most viewed

Bob saw orange shoes Some candidate products Historical Most viewed Similar

Bob saw orange shoes Some candidate products Historical Most viewed Similar

Bob saw orange shoes Some candidate products Historical Most viewed Similar Complementary

Recommendation Service 20K qps

HADOOP 20K qps Recommendation Service 50B Browsing history Preselection computation Map-Reduce jobs 50B Browsing history

HADOOP 20K qps Recommendation Service Preselections 12h 500M 50B Preselection computation Map-Reduce jobs 50B Browsing history

Online: sources Similarities Most viewed Most bought

Online: merge of products Similarities Most viewed Most bought

ML model Logistic regression models because : They scale They are fast They can handle lots of features Product-specific User-specific User-product interactions Display-specific Product-specific: price, category User-specific: usersegment, user last category User-product interactions: time since last view, category match Display-specific: desktop vs mobile

HADOOP 20K qps Recommendation Service Preselections 12h 500M 50B Preselection computation Map-Reduce jobs 50B Browsing history

HADOOP 20K qps Recommendation Service Preselections 6h 12h 500M Preselection computation Map-Reduce jobs Prediction models 50B Browsing history

HADOOP 20K qps Recommendation Service Display, Click, Sale logs Preselections 6h 12h 500M HADOOP Preselection computation Map-Reduce jobs Prediction models 50B Browsing history

HADOOP 20K qps Recommendation Service Display, Click, Sale logs Preselections 6h 12h 500M HADOOP Preselection computation Map-Reduce jobs Prediction models 50B Browsing history

Online: scoring Similarities Most viewed Most bought 0,02 0,12 0,06 0,18 0,03 0,05 0,01 0,005 0,011 0,013 0,004 0,007

Online: scoring Similarities Most viewed Most bought 0,18 0,12 0,06 0,05 0,03 0,02 0,013 0,011 0,01 0,007 0,005 0,004

Online: candidates -50% SHOP SHOP SHOP SHOP 0,18 0,12 0,06 0,05 0,03 0,02 0,013 0,011 0,01 0,007 0,005 0,004

What’s next ?

What’s next for us: Upcoming challenges Long(er)-term user profiles

What’s next for us: Upcoming challenges Long(er)-term user profiles More and better product information (images, semantic, NLP)

What’s next for us: Upcoming challenges Long(er)-term user profiles More and better product information (images, semantic, NLP) Instant-update of similarities

What’s next for us: Upcoming challenges Long(er)-term user profiles More and better product information (images, semantic, NLP) Instant-update of similarities Joint product scoring (score full banner and not products independently)

What’s next for you: Fancy a try? On your own: We published datasets for click prediction 4GB display-click data: Kaggle challenge in 2014 http://bit.ly/1vgw2XC 1TB Display-Click data (industry’s largest dataset): http://bit.ly/1PyH4Vq 4 billion of observations 156 billion feature-value available on Microsoft Azure used by edX (UC Berkeley) With us ! http://labs.criteo.com/jobs/

Questions?

s.dolle@criteo.com @simondolle @recsysfr Thank you ! s.dolle@criteo.com @simondolle @recsysfr Credits: Creative Stall, Gilbert Bages