AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student.

Slides:



Advertisements
Similar presentations
AT&T Labs - Research An Information-Theoretic Approach to Traffic Matrix Estimation Yin Zhang, Matthew Roughan, Carsten Lund – AT&T Research David Donoho.
Advertisements

Impact of Interference on Multi-hop Wireless Network Performance Kamal Jain, Jitu Padhye, Venkat Padmanabhan and Lili Qiu Microsoft Research Redmond.
MS&E 211 Quadratic Programming Ashish Goel. A simple quadratic program Minimize (x 1 ) 2 Subject to: -x 1 + x 2 ≥ 3 -x 1 – x 2 ≥ -2.
Clean Slate Design for the Internet Designing a Predictable Backbone Network with Valiant Load Balancing NSF 100 x 100 Clean.
1 Efficient and Robust Streaming Provisioning in VPNs Z. Morley Mao David Johnson Oliver Spatscheck Kobus van der Merwe Jia Wang.
On Selfish Routing In Internet-like Environments Lili Qiu (Microsoft Research) Yang Richard Yang (Yale University) Yin Zhang (AT&T Labs – Research) Scott.
1 EL736 Communications Networks II: Design and Algorithms Class8: Networks with Shortest-Path Routing Yong Liu 10/31/2007.
Graph Laplacian Regularization for Large-Scale Semidefinite Programming Kilian Weinberger et al. NIPS 2006 presented by Aggeliki Tsoli.
Infocom 2003 An Approach to Alleviate Link Overload as Observed on an IP Backbone Tuesday, April 1 st Infocom 2003 Sundar Iyer 1,2, Supratik Bhattacharrya.
SIGCOMM 2003 Making Intra-Domain Routing Robust to Changing and Uncertain Traffic Demands: Understanding Fundamental Tradeoffs David Applegate Edith Cohen.
Visual Recognition Tutorial
The Cache Location Problem IEEE/ACM Transactions on Networking, Vol. 8, No. 5, October 2000 P. Krishnan, Danny Raz, Member, IEEE, and Yuval Shavitt, Member,
Iterative closest point algorithms
Branch and Bound Similar to backtracking in generating a search tree and looking for one or more solutions Different in that the “objective” is constrained.
Traffic Engineering With Traditional IP Routing Protocols
Chapter 7 Network Flow Models.
1 Engineering Computation Part 4. 2 Enrique Castillo University of Cantabria An algorithm that permits solving many problems in Algebra. Applications.
Machine Learning CUNY Graduate Center Lecture 3: Linear Regression.
Traffic Matrix Estimation: Existing Techniques and New Directions A. Medina (Sprint Labs, Boston University), N. Taft (Sprint Labs), K. Salamatian (University.
EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley Asynchronous Distributed Algorithm Proof.
1 Traffic Engineering for ISP Networks Jennifer Rexford IP Network Management and Performance AT&T Labs - Research; Florham Park, NJ
On Self Adaptive Routing in Dynamic Environments -- A probabilistic routing scheme Haiyong Xie, Lili Qiu, Yang Richard Yang and Yin Yale, MR and.
Traffic Matrix Estimation for Traffic Engineering Mehmet Umut Demircin.
On a New Internet Traffic Matrix (Completion) Problem
1University of Adelaide Network Tomography and Internet Traffic Matrices Matthew Roughan School of Mathematical Sciences University of Adelaide.
Modeling Traffic in St. Louis By Julia Greenberger.
Tomo-gravity Yin ZhangMatthew Roughan Nick DuffieldAlbert Greenberg “A Northern NJ Research Lab” ACM.
Flow Models and Optimal Routing. How can we evaluate the performance of a routing algorithm –quantify how well they do –use arrival rates at nodes and.
1 Meeyoung Cha (KAIST) Sue Moon (KAIST) Chong-Dae Park (KAIST) Aman Shaikh (AT&T Labs – Research) IEEE INFOCOM 2005 Poster Session Positioning Relay Nodes.
Machine Learning CUNY Graduate Center Lecture 3: Linear Regression.
Network Sensitivity to Hot-Potato Disruptions Renata Teixeira (UC San Diego) with Aman Shaikh (AT&T), Tim Griffin(Intel),
1 Meeyoung Cha, Sue Moon, Chong-Dae Park Aman Shaikh Placing Relay Nodes for Intra-Domain Path Diversity To appear in IEEE INFOCOM 2006.
On the Construction of Data Aggregation Tree with Minimum Energy Cost in Wireless Sensor Networks: NP-Completeness and Approximation Algorithms National.
Shannon Lab 1AT&T – Research Traffic Engineering with Estimated Traffic Matrices Matthew Roughan Mikkel Thorup
Traffic Engineering for ISP Networks Jennifer Rexford Internet and Networking Systems AT&T Labs - Research; Florham Park, NJ
Network Aware Resource Allocation in Distributed Clouds.
Spatio-Temporal Compressive Sensing Yin Zhang The University of Texas at Austin Joint work with Matthew Roughan.
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
National Accounts and SAM Estimation Using Cross-Entropy Methods Sherman Robinson.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.
Network Anomography Yin Zhang – University of Texas at Austin Zihui Ge and Albert Greenberg – AT&T Labs Matthew Roughan – University of Adelaide IMC 2005.
Parameter estimation. 2D homography Given a set of (x i,x i ’), compute H (x i ’=Hx i ) 3D to 2D camera projection Given a set of (X i,x i ), compute.
April 4th, 2002George Wai Wong1 Deriving IP Traffic Demands for an ISP Backbone Network Prepared for EECE565 – Data Communications.
1 Network Measurements and Sampling Nick Duffield, Carsten Lund, and Mikkel Thorup AT&T Labs-Research, Florham Park, NJ.
EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley.
Intradomain Traffic Engineering By Behzad Akbari These slides are based in part upon slides of J. Rexford (Princeton university)
6 December On Selfish Routing in Internet-like Environments paper by Lili Qiu, Yang Richard Yang, Yin Zhang, Scott Shenker presentation by Ed Spitznagel.
On Selfish Routing In Internet-like Environments Lili Qiu (Microsoft Research) Yang Richard Yang (Yale University) Yin Zhang (AT&T Labs – Research) Scott.
CpSc 881: Machine Learning
1 An Arc-Path Model for OSPF Weight Setting Problem Dr.Jeffery Kennington Anusha Madhavan.
Classification Course web page: vision.cis.udel.edu/~cv May 14, 2003  Lecture 34.
1 Slides by Yong Liu 1, Deep Medhi 2, and Michał Pióro 3 1 Polytechnic University, New York, USA 2 University of Missouri-Kansas City, USA 3 Warsaw University.
Machine Learning 5. Parametric Methods.
NetQuest: A Flexible Framework for Large-Scale Network Measurement Lili Qiu University of Texas at Austin Joint work with Han Hee Song.
1 ISMA Backbone Traffic Inference MAKE SYSTEMS THE NETWORK RESOURCE PLANNING COMPANY ISP Backbone Traffic Inference Methods to Support Traffic Engineering.
STATIC ANALYSIS OF UNCERTAIN STRUCTURES USING INTERVAL EIGENVALUE DECOMPOSITION Mehdi Modares Tufts University Robert L. Mullen Case Western Reserve University.
A Brief Maximum Entropy Tutorial Presenter: Davidson Date: 2009/02/04 Original Author: Adam Berger, 1996/07/05
Network Anomography Yin Zhang Joint work with Zihui Ge, Albert Greenberg, Matthew Roughan Internet Measurement.
Chance Constrained Robust Energy Efficiency in Cognitive Radio Networks with Channel Uncertainty Yongjun Xu and Xiaohui Zhao College of Communication Engineering,
Scoring Rules, Generalized Entropy and Utility Maximization Victor Richmond R. Jose Robert F. Nau Robert L. Winkler The Fuqua School of Business Duke University.
The Message Passing Communication Model David Woodruff IBM Almaden.
Placing Relay Nodes for Intra-Domain Path Diversity Meeyoung Cha Sue Moon Chong-Dae Park Aman Shaikh Proc. of IEEE INFOCOM 2006 Speaker 游鎮鴻.
Learning Theory Reza Shadmehr Distribution of the ML estimates of model parameters Signal dependent noise models.
Forward and Backward Deviation Measures and Robust Optimization Peng Sun (Duke) with Xin Chen (UIUC) and Melvyn Sim (NUS)
Dimension reduction (2) EDR space Sliced inverse regression Multi-dimensional LDA Partial Least Squares Network Component analysis.
Constraint-Based Routing
L12. Network optimization
Nick McKeown High Performance Networking Group Stanford University
Presentation transcript:

AT&T Labs - Research Internet Measurement Conference Of October, 2003 Miami, Florida, USA Date for student travel grant applications: Sept 5th

AT&T Labs - Research An Information-Theoretic Approach to Traffic Matrix Estimation Yin Zhang, Matthew Roughan, Carsten Lund – AT&T Research David Donoho – Stanford Shannon Lab

AT&T Labs - Research Want to know demands from source to destination Problem Have link traffic measurements A B C

AT&T Labs - Research Example App: reliability analysis Under a link failure, routes change want to find an traffic invariant A B C

AT&T Labs - Research Approach Principle * “Don’t try to estimate something if you don’t have any information about it” zMaximum Entropy yEntropy is a measure of uncertainty xMore information = less entropy yTo include measurements, maximize entropy subject to the constraints imposed by the data yImpose the fewest assumptions on the results zInstantiation: Maximize “relative entropy” yMinimum Mutual Information

AT&T Labs - Research Mathematical Formalism Only measure traffic at links router link 1 link 2 link 3 Traffic y 1

AT&T Labs - Research Mathematical Formalism router route 2 route 1 route 3 Problem: Estimate traffic matrix (x’s) from the link measurements (y’s) Traffic y 1 Traffic matrix element x 1

AT&T Labs - Research Mathematical Formalism router route 2 route 1 route 3 Problem: Estimate traffic matrix (x’s) from the link measurements (y’s)

AT&T Labs - Research Mathematical Formalism router route 2 route 1 route 3 Problem: Estimate traffic matrix (x’s) from the link measurements (y’s)

AT&T Labs - Research Mathematical Formalism router route 2 route 1 route 3 For non-trivial network UNDERCONSTRAINED y = Ax Routing matrix

AT&T Labs - Research Regularization zWant a solution that satisfies constraints: y = Ax yMany more unknowns than measurement: O(N 2 ) vs O(N) yUnderconstrained system yMany solutions satisfy the equations yMust somehow choose the “best” solution zSuch (ill-posed linear inverse) problems occur in yMedical imaging: e.g CAT scans ySeismology yAstronomy zStatistical intuition => Regularization yPenalty function J(x) ysolution:

AT&T Labs - Research How does this relate to other methods? zPrevious methods are just particular cases of J(x) zTomogravity (Zhang, Roughan, Greenberg and Duffield) yJ(x) is a weighted quadratic distance from a gravity model zA very natural alternative yStart from a penalty function that satisfies the “maximum entropy” principle yMinimum Mutual Information

AT&T Labs - Research Minimum Mutual Information (MMI) zMutual Information I(S,D) yInformation gained about Source from Destination yI(S,D) = -relative entropy with respect to independent S and D I(S,D) = 0 S and D are independent p(D|S) = p(D) gravity model zNatural application of principle * yAssume independence in the absence of other information yAggregates have similar behavior to network overall zWhen we get additional information (e.g. y = Ax) yMaximize entropy  Minimize I(S,D) (subject to constraints) yJ(x) = I(S,D) equivalent

AT&T Labs - Research MMI in practice zIn general there aren’t enough constraints zConstraints give a subspace of possible solutions y = Ax

AT&T Labs - Research MMI in practice zIndependence gives us a starting point y = Ax independent solution

AT&T Labs - Research MMI in practice zFind a solution which ySatisfies the constraint yIs closest to the independent solution solution Distance measure is the Kullback-Lieber divergence

AT&T Labs - Research Is that it? zNot quite that simple yNeed to do some networking specific things ye.g. conditional independence to model hot-potato routing zCan be solved using standard optimization toolkits yTaking advantage of sparseness of routing matrix A zBack to tomogravity yConditional independence = generalized gravity model yQuadratic distance function is a first order approximation to the Kullback-Leibler divergence yTomogravity is a first-order approximation to MMI

AT&T Labs - Research Results – Single example z±20% bounds for larger flows zAverage error ~11% zFast (< 5 seconds) zScales: yO(100) nodes

AT&T Labs - Research More results tomogravity method simple approximation >80% of demands have <20% error Large errors are in small flows

AT&T Labs - Research Other experiments zSensitivity yVery insensitive to lambda ySimple approximations work well zRobustness yMissing data yErroneous link data yErroneous routing data zDependence on network topology yVia Rocketfuel network topologies zAdditional information yNetflow yLocal traffic matrices

AT&T Labs - Research Dependence on Topology clique star (20 nodes)

AT&T Labs - Research Additional information – Netflow

AT&T Labs - Research Local traffic matrix (George Varghese) for reference previous case 0% 1% 5% 10%

AT&T Labs - Research Conclusion zWe have a good estimation method yRobust, fast, and scales to required size yAccuracy depends on ratio of unknowns to measurements yDerived from principle zApproach gives some insight into other methods yWhy they work – regularization yShould provide better idea of the way forward zAdditional insights about the network and traffic yTraffic and network are connected zImplemented yUsed in AT&T’s NA backbone yAccurate enough in practice

AT&T Labs - Research Additional Slides

AT&T Labs - Research Results zMethodology yUse netflow based partial (~80%) traffic matrix ySimulate SNMP measurements using routing sim, and y = Ax yCompare estimates, and true traffic matrix zAdvantage yRealistic network, routing, and traffic yComparison is direct, we know errors are due to algorithm not errors in the data yCan do controlled experiments (e.g. introduce known errors) zData yOne hour traffic matrices (don’t need fine grained data) y506 data sets, comprising the majority of June 2002 yIncludes all times of day, and days of week

AT&T Labs - Research Robustness (input errors)

AT&T Labs - Research Robustness (missing data)

AT&T Labs - Research Point-to-multipoint We don’t see whole Internet – What if an edge link fails? Point-to-point traffic matrix isn’t invariant

AT&T Labs - Research Point-to-multipoint zIncluded in this approach zImplicit in results above zExplicit results worse yAmbiguity in demands in increased yMore demands use exactly the same sets of routes zuse in applications is better Point-to-point Point-to-multipoint Link failure analysis

AT&T Labs - Research Independent model

AT&T Labs - Research Conditional independence zInternet routing is asymmetric zA provider can control exit points for traffic going to peer networks peer links access links

AT&T Labs - Research Conditional independence peer links access links zInternet routing is asymmetric zA provider can control exit points for traffic going to peer networks zHave much less control of where traffic enters

AT&T Labs - Research Conditional independence

AT&T Labs - Research Minimum Mutual Information (MMI) zMutual Information I(S,D)=0 yInformation gained about S from D yI(S,D) = relative entropy with respect to independence yCan also be given by Kullback-Leibler information divergence zWhy this model yIn the absence of information, let’s assume no information yMinimal assumption about the traffic yLarge aggregates tend to behave like overall network?

AT&T Labs - Research Dependence on Topology Unknowns per RelativeErrors (%) NetworkPoPsLinks measurement GeographicRandom Exodus* Sprint* Abovenet* StarN2(N-1)N/2= CliqueNN(N-1)10.2 AT&T * These are not the actual networks, but only estimates made by Rocketfuel

AT&T Labs - Research yBayesian (e.g. Tebaldi and West) xJ(x) = -log  (x), where  (x) is the prior model yMLE (e.g. Vardi, Cao et al, …) xIn their thinking the prior model generates extra constraints xEqually, can be modeled as a (complicated) penalty function Uses deviations from higher order moments predicted by model

AT&T Labs - Research Acknowledgements zLocal traffic matrix measurements yGeorge Varghese zPDSCO optimization toolkit for Matlab yMichael Saunders zData collection yFred True, Joel Gottlieb zTomogravity yAlbert Greenberg and Nick Duffield