CFA: A Practical Prediction System for Video Quality Optimization

Slides:



Advertisements
Similar presentations
A Quest for an Internet Video Quality-of-Experience Metric
Advertisements

Junchen Jiang (CMU) Vyas Sekar (Stony Brook U)
1 Developing a Predictive Model for Internet Video Quality-of-Experience Athula Balachandran, Vyas Sekar, Aditya Akella, Srinivasan Seshan, Ion Stoica,
Bring Order to Your Photos: Event-Driven Classification of Flickr Images Based on Social Knowledge Date: 2011/11/21 Source: Claudiu S. Firan (CIKM’10)
Contextual Advertising by Combining Relevance with Click Feedback D. Chakrabarti D. Agarwal V. Josifovski.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada ISP-Friendly Peer Matching without ISP Collaboration Mohamed Hefeeda (Joint.
1 Finding a Needle in a Haystack: Pinpointing Significant BGP Routing Changes in an IP Network Jian Wu (University of Michigan) Z. Morley Mao (University.
Network Coding for Large Scale Content Distribution Christos Gkantsidis Georgia Institute of Technology Pablo Rodriguez Microsoft Research IEEE INFOCOM.
Exploiting Content Localities for Efficient Search in P2P Systems Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang 1 1 College of William and Mary,
CS Instance Based Learning1 Instance Based Learning.
Data Mining – Intro.
A Search-based Method for Forecasting Ad Impression in Contextual Advertising Defense.
Tradeoffs in CDN Designs for Throughput Oriented Traffic Minlan Yu University of Southern California 1 Joint work with Wenjie Jiang, Haoyuan Li, and Ion.
Using Conviva 29 Aug Summary Who are we? What is the problem we needed to solve? How was Spark essential to the solution? What can Spark.
- Conviva Confidential - Understanding and Improving Video Quality Vyas Sekar, Ion Stoica, Hui Zhang.
New Challenges in Cloud Datacenter Monitoring and Management
Ao-Jan Su, David R. Choffnes, Fabián E. Bustamante and Aleksandar Kuzmanovic Department of EECS Northwestern University Relative Network Positioning via.
Chen Cai, Benjamin Heydecker Presentation for the 4th CREST Open Workshop Operation Research for Software Engineering Methods, London, 2010 Approximate.
SIGCOMM Outline  Introduction  Datasets and Metrics  Analysis Techniques  Engagement  View Level  Viewer Level  Lessons  Conclusion.
Developing a Predictive Model of Quality of Experience for Internet Video Athula Balachandran -CMU.
A Quest for an Internet Video Quality-of-Experience Metric A. Balachandran, V. Sekar, A. Akella, S. Seshan, I. Stoica and H. Zhang In Proceedings of the.
Physical Layer Informed Adaptive Video Streaming Over LTE Xiufeng Xie, Xinyu Zhang Unviersity of Winscosin-Madison Swarun KumarLi Erran Li MIT Bell Labs.
Aditya Akella The Performance Benefits of Multihoming Aditya Akella CMU With Bruce Maggs, Srini Seshan, Anees Shaikh and Ramesh Sitaraman.
DAQ: A New Paradigm for Approximate Query Processing Navneet Potti Jignesh Patel VLDB 2015.
MURI: Integrated Fusion, Performance Prediction, and Sensor Management for Automatic Target Exploitation 1 Dynamic Sensor Resource Management for ATE MURI.
Today Ensemble Methods. Recap of the course. Classifier Fusion
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Elastic Pathing: Your Speed Is Enough to Track You Presented by Ali.
Analysing Clickstream Data: From Anomaly Detection to Visitor Profiling Peter I. Hofgesang Wojtek Kowalczyk ECML/PKDD Discovery.
BlinkDB: Queries with Bounded Errors and Bounded Response Times on Very Large Data ACM EuroSys 2013 (Best Paper Award)
USE RECIPE INGREDIENTS TO PREDICT THE CATEGORY OF CUISINE Group 7 – MEI, Yan & HUANG, Chenyu.
Jan 17, 2001CSCI {4,6}900: Ubiquitous Computing1 Announcements Did you hear about the Microsoft site crash yesterday? Compiling in Solaris (gemini) gcc.
Real-Time Trip Information Service for a Large Taxi Fleet
Drafting Behind Akamai (Travelocity-Based Detouring) Ao-Jan Su, David R. Choffnes, Aleksandar Kuzmanovic and Fabián E. Bustamante Department of EECS Northwestern.
TBAS: Enhancing Wi-Fi Authentication by Actively Eliciting Channel State Information Muye Liu, Avishek Mukherjee, Zhenghao Zhang, and Xiuwen Liu Florida.
Introduction to Machine Learning, its potential usage in network area,
SketchVisor: Robust Network Measurement for Software Packet Processing
Junchen Jiang, Rajdeep Das, Ganesh Ananthanarayanan, Philip A
Accelerating Peer-to-Peer Networks for Video Streaming
A CASE FOR A COORDINATED INTERNET VIDEO CONTROL PLANE
Clickprints on the Web: Are there Signatures in Web Browsing Data?
Data Mining – Intro.
Jacob R. Lorch Microsoft Research
Jian Wu (University of Michigan)
Pytheas: Enabling Data-Driven Quality of Experience Optimization Using Group-Based Exploration-Exploitation Junchen Jiang (CMU) Shijie Sun (Tsinghua Univ.)
Science Behind Cross-device Conversion Tracking
Table 1. Advantages and Disadvantages of Traditional DM/ML Methods
Video through a Crystal Ball:
Available Bit Rate Streaming
ECE 671 – Lecture 16 Content Distribution Networks
Video Summarization via Determinantal Point Processes (DPP)
Software Defined Networking (SDN)
DDoS Attack Detection under SDN Context
Aditya Ganjam, Bruce Maggs*, and Hui Zhang
Automating Profitable Growth™
Steve Zhang Armando Fox In collaboration with:
Paraskevi Raftopoulou, Euripides G.M. Petrakis
IT351: Mobile & Wireless Computing
Where Intelligence Lives & Intelligence Management
Replica Placement Heuristics of Application-level Multicast
Evaluation of Relational Operations: Other Techniques
Challenges with developing a Commercial P2P System
Topological Signatures For Fast Mobility Analysis
Memory-Based Learning Instance-Based Learning K-Nearest Neighbor
Efficient Aggregation over Objects with Extent
Conviva & Sky A real-world OTT video Quality of Experience case study
Provider Survey Peer-led sessions Initial remarks on local sites
Design and Implementation of OverLay Multicast Tree Protocol
Yu Guan, Chengyuan Zheng, Xinggong Zhang, Zongming Guo, Junchen Jiang
Advisor: Dr.vahidipour Zahra salimian Shaghayegh jalali Dec 2017
Presentation transcript:

CFA: A Practical Prediction System for Video Quality Optimization Junchen Jiang, Vyas Sekar, Henry Milner, Davis Shepherd, Ion Stoica, Hui Zhang

One-Minute Overview Prediction leads to dramatic quality improvement Predicting video quality is very challenging Persistence of critical features  CFA

Internet Video Quality Matters! 40% sessions 13% sessions User Engagement User Engagement Avg Bitrate Buffering ratio Better quality  Longer user Engagement  More revenues! A significant room of improvement!

New Paradigm: Centralized Control Platform Prediction Oracle Answer “What-if” questions. e.g., What if I use 400Kbps, Akamai? Local reactive adaptation is too slow Fundamentally crippled for initial selections A Case for Centralized Control Plane Real-time global network view  Prediction Oracle e.g., potentially 50% less re-buffering [SIGCOMM12,NSDI15] Local adaptation Internet 400Kbps 1Mbps 400Kbps 1Mbps

Key Missing Piece: How to Build a Prediction Oracle? CFA Prediction Oracle Our contribution: Critical Feature Analytics (CFA) Data-Driven Video Quality Prediction System Internet 400Kbps 1Mbps 400Kbps 1Mbps

Outline Motivation  Challenges of Video Quality Prediction System The CFA Approach Evaluation

Why is Building a Quality Prediction System Challenging? Trains a Quality Prediction Model Pred(quality of other sessions) Quality of other sessions Quality prediction for new sessions Challenge 1: Complex factors affect video quality  Need expressive models to capture these factors Challenge 2: Video quality changes quickly  Need to refresh predictions in near real-time (e.g., 30 sec)

Challenge 1: Complex relation between video quality and features Comcast NY Level3 CDN AT&T Akamai PIT Quality depends on combinations of features City ASN CDN Video Device Quality NY Comcast Level3 “Foo” “bar” PIT AT&T Akamai

Such feature combinations differ cross clients & time City ASN CDN Video Device Quality 3:00PM NY Comcast Level3 “foo” “bar” PIT Akamai 7:00PM Combinational effects: Quality depends on combinations of multiple features Spatial diversity: Quality-determining features differ cross clients Model drift: Quality-determining features change over time

Challenge 2: Video Quality Changes Quickly Using fresh quality measurement is critical!

Needs both model expressiveness & fast update Not expressive enough to model complex factors e.g., NaiveBayes, Decision Tree Algorithm for Problems that have “Persistent” critical features Update speed Fast Slow Simple ML CFA Needs tens of min to update model, Not interpretable e.g., SVM Better Complex ML Low High Expressiveness

Outline Motivation Challenges  The CFA Approach Evaluation

Strawman: Matching on all features Curse of dimensionality: The Basic CFA Workflow: Similar feature values short history  similar quality All historical sessions with observed quality Session under prediction Similar sessions to s Finding similar sessions Quality estimate (e.g., median) Quality prediction for s s Strawman: Matching on all features Accurate Reliable Matching on all features ✔ ✖ Curse of dimensionality: Hard to find sessions matching on all features

Insight to Find Similar Sessions: Critical Features Critical features: subset of features ultimately determines video quality City ASN CDN Content Device NY Comcast Level3 “foo” “bar” Quality F( ) Strawman: Matching on all features in the last minute ≈ F( ) NY Comcast Level3 * Use sessions matching “NY, Comcast, Level3, foo, bar” in last minute Enough sessions matching “NY, Comcast, Level3” in last minute Curse of dimensionality: Few sessions matching on all features in one minute s

The CFA Workflow Based on Critical Features Similar sessions to s Finding similar sessions Quality estimate (e.g., median) Quality prediction for s s Matching on Critical Features Finding Critical feature of s Accurate Reliable Matching on all features ✔ ✖ Matching on critical features How to get critical features?

Insight to Learn Critical Features: Critical Features are Persistent Insight: Critical features typically last for 10s minutes or longer. Persistence: 𝜟 𝑪𝒓𝒊𝒕𝒊𝒄𝒂𝒍𝑭𝒆𝒂𝒕𝒖𝒓𝒆𝒔 ≫ 𝜟 𝑸𝒖𝒂𝒍𝒊𝒕𝒚 Strawman: Learn Critical Features over last minute Curse of dimensionality: No data of ground truth Longer history  Enough sessions to construct ground-truth quality s CFA approach: Learn Critical Features over last hour

How to Estimate Quality with Fresh Updates? Similar sessions to s Finding similar sessions Quality estimate (e.g., median) Quality prediction for s s Matching on Critical Features A few seconds A few seconds Finding Critical feature of s Takes 10s minutes

CFA Approach to Fresh Updates Estimate quality (every tens of sec) Learn critical features (every tens of min) Learn critical features (every tens of min) Sequential workflow Enabled by Persistence of Critical Features Time Learn critical features (every tens of min) Learn critical features (every tens of min) Slow path Decoupled workflow Fast path

Putting Everything Together: CFA Implementation The C3 platform [NSDI’15] Backend cluster Learn critical features + estimate quality Update quality prediction per 10s of sec Geo-distributed frontend clusters Pick the (CDN, bitrate) of the best predicted quality and return it in 10s of ms Video clients

Outline Motivation Challenges The CFA Approach  Evaluation

Real-world A/B Testing 12.3% increase 32% reduction Substantial quality improvement by CFA.

CFA leads to better quality CFA vs. Strawman Prediction Algorithms (Decision tree, Naïve Bayes, kNN, Last-mile, ASN-based etc) 23% less error 16% higher bitrate CFA is more accurate CFA leads to better quality

Conclusion Higher video quality  Long user engagement  More revenues! Prediction has huge potential but is also challenging: Quality-determining features are complex, heterogeneous and dynamic. CFA uses domain-specific insights Video quality depends on a subset of persistent critical features. CFA leads to 30% less buffering ratio and 12% high bitrate Key takeaway: Prediction  Performance improvement Persistence of critical features  Accurate prediction