Spatio-Temporal Compressive Sensing Yin Zhang The University of Texas at Austin Joint work with Matthew Roughan.

Slides:



Advertisements
Similar presentations
Nonnegative Matrix Factorization with Sparseness Constraints S. Race MA591R.
Advertisements

Face Alignment by Explicit Shape Regression
Active Learning for Streaming Networked Data Zhilin Yang, Jie Tang, Yutao Zhang Computer Science Department, Tsinghua University.
Siddharth Choudhary.  Refines a visual reconstruction to produce jointly optimal 3D structure and viewing parameters  ‘bundle’ refers to the bundle.
Fast Bayesian Matching Pursuit Presenter: Changchun Zhang ECE / CMR Tennessee Technological University November 12, 2010 Reading Group (Authors: Philip.
On Selfish Routing In Internet-like Environments Lili Qiu (Microsoft Research) Yang Richard Yang (Yale University) Yin Zhang (AT&T Labs – Research) Scott.
SIGCOMM 2003 Making Intra-Domain Routing Robust to Changing and Uncertain Traffic Demands: Understanding Fundamental Tradeoffs David Applegate Edith Cohen.
Robust Network Compressive Sensing
Sensitivity of PCA for Traffic Anomaly Detection Evaluating the robustness of current best practices Haakon Ringberg 1, Augustin Soule 2, Jennifer Rexford.
Bayesian Robust Principal Component Analysis Presenter: Raghu Ranganathan ECE / CMR Tennessee Technological University January 21, 2011 Reading Group (Xinghao.
Robust Network Compressive Sensing Lili Qiu UT Austin NSF Workshop Nov. 12, 2014.
A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University.
Traffic Engineering With Traditional IP Routing Protocols
NetQuest: A Flexible Framework for Internet Measurement Lili Qiu Joint work with Mike Dahlin, Harrick Vin, and Yin Zhang UT Austin.
1 Traffic Engineering for ISP Networks Jennifer Rexford IP Network Management and Performance AT&T Labs - Research; Florham Park, NJ
Multi-Scale Analysis for Network Traffic Prediction and Anomaly Detection Ling Huang Joint work with Anthony Joseph and Nina Taft January, 2005.
Traffic Matrix Estimation: Existing Techniques and New Directions A. Medina (Sprint Labs, Boston University), N. Taft (Sprint Labs), K. Salamatian (University.
Computing Sketches of Matrices Efficiently & (Privacy Preserving) Data Mining Petros Drineas Rensselaer Polytechnic Institute (joint.
A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University.
Experimental Design for Practical Network Diagnosis Yin Zhang University of Texas at Austin Joint work with Han Hee Song and Lili.
Network Anomography Yin Zhang, Zihui Ge, Albert Greenberg, Matthew Roughan Internet Measurement Conference 2005 Berkeley, CA, USA Presented by Huizhong.
Traffic Matrix Estimation for Traffic Engineering Mehmet Umut Demircin.
Classification and Prediction: Regression Analysis
Chapter 10 Real Inner Products and Least-Square (cont.)
On a New Internet Traffic Matrix (Completion) Problem
SVD(Singular Value Decomposition) and Its Applications
Tomo-gravity Yin ZhangMatthew Roughan Nick DuffieldAlbert Greenberg “A Northern NJ Research Lab” ACM.
Principled Regularization for Probabilistic Matrix Factorization Robert Bell, Suhrid Balakrishnan AT&T Labs-Research Duke Workshop on Sensing and Analysis.
COPE: Traffic Engineering in Dynamic Networks Hao Wang, Haiyong Xie, Lili Qiu, Yang Richard Yang, Yin Zhang, Albert Greenberg Yale University UT Austin.
Shannon Lab 1AT&T – Research Traffic Engineering with Estimated Traffic Matrices Matthew Roughan Mikkel Thorup
Mining Discriminative Components With Low-Rank and Sparsity Constraints for Face Recognition Qiang Zhang, Baoxin Li Computer Science and Engineering Arizona.
Non Negative Matrix Factorization
Training and Testing of Recommender Systems on Data Missing Not at Random Harald Steck at KDD, July 2010 Bell Labs, Murray Hill.
On AS-Level Path Inference Jia Wang (AT&T Labs Research) Joint work with Z. Morley Mao (University of Michigan, Ann Arbor) Lili Qiu (University of Texas,
Learning Stable Multivariate Baseline Models for Outbreak Detection Sajid M. Siddiqi, Byron Boots, Geoffrey J. Gordon, Artur W. Dubrawski The Auton Lab.
Low-rank By: Yanglet Date: 2012/12/2. Included Works. Yin Zhang, Lili Qiu ―Spatio-Temporal Compressive Sensing and Internet Traffic Matrices, SIGCOMM.
1 Passive Network Tomography Using Bayesian Inference Lili Qiu Joint work with Venkata N. Padmanabhan and Helen J. Wang Microsoft Research Internet Measurement.
Network Anomography Yin Zhang – University of Texas at Austin Zihui Ge and Albert Greenberg – AT&T Labs Matthew Roughan – University of Adelaide IMC 2005.
Progress in identification of damping: Energy-based method with incomplete and noisy data Marco Prandina University of Liverpool.
Orthogonalization via Deflation By Achiya Dax Hydrological Service Jerusalem, Israel
1 University of Texas at Austin Machine Learning Group 图像与视频处理 计算机学院 Motion Detection and Estimation.
Efficient computation of Robust Low-Rank Matrix Approximations in the Presence of Missing Data using the L 1 Norm Anders Eriksson and Anton van den Hengel.
Direct Robust Matrix Factorization Liang Xiong, Xi Chen, Jeff Schneider Presented by xxx School of Computer Science Carnegie Mellon University.
Gap-filling and Fault-detection for the life under your feet dataset.
Sparse Signals Reconstruction Via Adaptive Iterative Greedy Algorithm Ahmed Aziz, Ahmed Salim, Walid Osamy Presenter : 張庭豪 International Journal of Computer.
Intradomain Traffic Engineering By Behzad Akbari These slides are based in part upon slides of J. Rexford (Princeton university)
6 December On Selfish Routing in Internet-like Environments paper by Lili Qiu, Yang Richard Yang, Yin Zhang, Scott Shenker presentation by Ed Spitznagel.
Endmember Extraction from Highly Mixed Data Using MVC-NMF Lidan Miao AICIP Group Meeting Apr. 6, 2006 Lidan Miao AICIP Group Meeting Apr. 6, 2006.
Efficient Gaussian Process Regression for large Data Sets ANJISHNU BANERJEE, DAVID DUNSON, SURYA TOKDAR Biometrika, 2008.
Bundle Adjustment A Modern Synthesis Bill Triggs, Philip McLauchlan, Richard Hartley and Andrew Fitzgibbon Presentation by Marios Xanthidis 5 th of No.
Rank Minimization for Subspace Tracking from Incomplete Data
Taming Internet Traffic Some notes on modeling the wild nature of OD flows Augustin Soule Kavé Salamatian Antonio Nucci Nina Taft Univ. Paris VI Sprintlabs.
D. Rincón, M. Roughan, W. Willinger – Towards a Meaningful MRA of Traffic Matrices 1/36 Towards a Meaningful MRA for Traffic Matrices D. Rincón, M. Roughan,
Model Fusion and its Use in Earth Sciences R. Romero, O. Ochoa, A. A. Velasco, and V. Kreinovich Joint Annual Meeting NSF Division of Human Resource Development.
Large-Scale Matrix Factorization with Missing Data under Additional Constraints Kaushik Mitra University of Maryland, College Park, MD Sameer Sheoreyy.
Kijung Shin Jinhong Jung Lee Sael U Kang
NetQuest: A Flexible Framework for Large-Scale Network Measurement Lili Qiu University of Texas at Austin Joint work with Han Hee Song.
Network Anomography Yin Zhang Joint work with Zihui Ge, Albert Greenberg, Matthew Roughan Internet Measurement.
Locations. Soil Temperature Dataset Observations Data is – Correlated in time and space – Evolving over time (seasons) – Gappy (Due to failures) – Faulty.
Facets: Fast Comprehensive Mining of Coevolving High-order Time Series Hanghang TongPing JiYongjie CaiWei FanQing He Joint Work by Presenter:Wei Fan.
ECE 530 – Analysis Techniques for Large-Scale Electrical Systems Prof. Hao Zhu Dept. of Electrical and Computer Engineering University of Illinois at Urbana-Champaign.
SCREAM: Sketch Resource Allocation for Software-defined Measurement Masoud Moshref, Minlan Yu, Ramesh Govindan, Amin Vahdat (CoNEXT’15)
1 Network Tomography Using Passive End-to-End Measurements Lili Qiu Joint work with Venkata N. Padmanabhan and Helen J. Wang.
Dimension reduction (2) EDR space Sliced inverse regression Multi-dimensional LDA Partial Least Squares Network Component analysis.
Introduction to several works and Some Ideas Songcan Chen
1 Dongheng Sun 04/26/2011 Learning with Matrix Factorizations By Nathan Srebro.
SketchVisor: Robust Network Measurement for Software Packet Processing
Singular Value Decomposition
Dogu Arifler and Brian L. Evans
Backbone Traffic Engineering
Presentation transcript:

Spatio-Temporal Compressive Sensing Yin Zhang The University of Texas at Austin Joint work with Matthew Roughan University of Adelaide Walter Willinger AT&T Labs–Research Lili Qiu Univ. of Texas at Austin ACM SIGCOMM 2009 August 20, 2009

2 Q: How to fill in missing values in a matrix? –Traffic matrix –Delay matrix –Social proximity matrix

3 Internet Traffic Matrices Traffic Matrix (TM) –Gives traffic volumes between origins and destinations Essential for many networking tasks –what-if analysis, traffic engineering, anomaly detection Lots of prior research –Measurement, e.g. [FGLR+01, VE03] –Inference, e.g. [MTSB+02, ZRDG03, ZRLD03, ZRLD05, SLTP+06, ZGWX06] –Anomaly detection, e.g. [LCD04, ZGRG05, RSRD07]

4 Missing Values: Why Bother? Missing values are common in TM measurements –Direct measurement is infeasible/expensive –Measurement and data collection are unreliable –Anomalies/outliers hide non-anomaly-related traffic –Future traffic has not yet appeared The need for missing value interpolation –Many networking tasks are sensitive to missing values –Need non-anomaly-related traffic for diagnosis –Need predicted TMs in what-if analysis, traffic engineering, capacity planning, etc.

5 The Problem router route 1 route 3 route 2 link 2 link 1 link 3 x r,t : traffic volume on route r at time t

6 indirect: only measure at links The Problem router route 1 route 3 route 2 link 2 link 1 link 3 Interpolation: fill in missing values from incomplete and/or indirect measurements future anomaly missing

7 The Problem E.g., link loads only: AX=Y A : routing matrix; Y : link load matrix E.g., direct measurements only: M.*X=M.*D M(r,t)=1  X(r,t) exists; D : direct measurements router route 1 route 3 route 2 link 2 link 1 link 3 A (X)=B Challenge: In real networks, the problem is massively underconstrained!

8 Spatio-Temporal Compressive Sensing Idea 1: Exploit low-rank nature of TMs –Observation: TMs are low-rank [LPCD+04, LCD04]: X nxm  L nxr * R mxr T (r « n,m) Idea 2: Exploit spatio-temporal properties –Observation: TM rows or columns close to each other (in some sense) are often close in value Idea 3: Exploit local structures in TMs –Observation: TMs have both global & local structures

9 Spatio-Temporal Compressive Sensing Idea 1: Exploit low-rank nature of TMs –Technique: Compressive Sensing Idea 2: Exploit spatio-temporal properties –Technique: Sparsity Regularized Matrix Factorization (SRMF) Idea 3: Exploit local structures in TMs –Technique: Combine global and local interpolation

10 Compressive Sensing Basic approach: find X=LR T s.t. A(LR T )=B –(m+n)*r unknowns (instead of m*n ) Challenges –A(LR T )=B may have many solutions  which to pick? –A(LR T )=B may have zero solution, e.g. when X is approximately low-rank, or there is noise Solution: Sparsity Regularized SVD (SRSVD) –minimize | A (LR T ) – B| 2 // fitting error + (|L| 2 +|R| 2 ) // regularization –Similar to SVD but can handle missing values and indirect measurements

11 Sparsity Regularized Matrix Factorization Motivation –The theoretical conditions for compressive sensing to perform well may not hold on real-world TMs Sparsity Regularized Matrix Factorization –minimize | A (LR T ) – B| 2 // fitting error + (|L| 2 +|R| 2 ) // regularization + |S(LR T )| 2 // spatial constraint + |(LR T )T T | 2 // temporal constraint –S and T capture spatio-temporal properties of TMs –Can be solved efficiently via alternating least-squares

12 Spatio-Temporal Constraints Temporal constraint matrix T –Captures temporal smoothness –Simple choices suffice, e.g.: Spatial constraint matrix S –Captures which rows of X are close to each other –Challenge: TM rows are ordered arbitrarily –Our solution: use a initial estimate of X to approximate similarity between rows of X

13 Combining Global and Local Methods Local correlation among individual elements may be stronger than among TM rows/columns –S and T in SRMF are chosen to capture global correlation among entire TM rows or columns SRMF+KNN: combine SRMF with local interpolation –Switch to K-Nearest-Neighbors if a missing element is temporally close to observed elements

14 Generalizing Previous Methods Tomo-SRMF: find a solution that is close to LR T yet satisfies A (X)=B solution subspace A (X)=B Tomo-SRMF solution SRMF solution: LR T Tomo-SRMF generalizes the tomo-gravity method for inferring TM from link loads

15 Applications Inference (a.k.a. tomography) –Can combine both direct and indirect measurements for TM inference Prediction –What-if analysis, traffic engineering, capacity planning all require predicted traffic matrix Anomaly Detection –Project TM onto a low-dimensional, spatially & temporally smooth subspace ( LR T )  normal traffic Spatio-temporal compressive sensing provides a unified approach for many applications

16 Evaluation Methodology Data sets Metrics –Normalized Mean Absolute Error for missing values –Other metrics yield qualitatively similar results. NetworkDateDurationResolutionSize Abilene03/20031 week10 min.121x1008 Commercial ISP10/20063 weeks1 hour400x504 GEANT04/20051 week15 min.529x672

17 Algorithms Compared AlgorithmDescription Baseline Baseline estimate via rank-2 approximation SRSVD Sparsity Regularized SVD SRSVD-base SRSVD with baseline removal NMF Nonnegative Matrix Factorization KNN K-Nearest-Neighbors SRSVD-base+KNN Hybrid of SRSVD-base and KNN SRMF Sparsity Regularized Matrix Factorization SRMF+KNN Hybrid of SRMF and KNN Tomo-SRMF Generalization of tomo-gravity

18 Interpolation: Random Loss Our method is always the best Only ~20% error even with 98% loss Dataset: Abilene

19 Interpolation: Structured Loss Our method is always the best; sometimes dramatically better Only ~20% error even with 98% loss Dataset: Abilene

20 Tomography Performance Dataset: Commercial ISP Can halve the error of Tomo-Gravity by measuring only 2% elements!

21 Other Results Prediction –Taking periodicity into account helps prediction –Our method consistently outperforms other methods Smooth, low-rank approximation improves prediction Anomaly detection –Generalizes many previous methods E.g., PCA, anomography, time domain methods –Yet offers more Can handle missing values, indirect measurements Less sensitive to contamination in normal subspace No need to specify exact # of dimensions for normal subspace –Preliminary results also show better accuracy

22 Conclusion Spatio-temporal compressive sensing –Advances ideas from compressive sensing –Uses the first truly spatio-temporal model of TMs –Exploits both global and local structures of TMs General and flexible –Generalizes previous methods yet can do much more –Provides a unified approach to TM estimation, prediction, anomaly detection, etc. Highly effective –Accurate: works even with 90+% values missing –Robust: copes easily with highly structured loss –Fast: a few seconds on TMs we tested

23 Lots of Future Work Other types of network matrices –Delay matrices, social proximity matrices Better choices of S and T –Tailor to both applications and datasets Extension to higher dimensions –E.g., 3D: source, destination, time Theoretical foundation –When and why our approach works so well?

24 Thank you!

25 Alternating Least Squares Goal: minimize | A (LR T ) – B| 2 + (|L| 2 +|R| 2 ) Step 1: fix L and optimize R –A standard least-squares problem Step 2: fix R and optimize L –A standard least-squares problem Step 3: goto Step 1 unless MaxIter is reached