Optimal Sampling Strategies for Multiscale Stochastic Processes Vinay Ribeiro Rolf Riedi, Rich Baraniuk (Rice University)

Slides:



Advertisements
Similar presentations
Edge-based Network Modeling and Inference
Advertisements

Collaborators: Mark Coates, Rui Castro, Ryan King, Mike Rabbat, Yolanda Tsang, Vinay Ribeiro, Shri Sarvotham, Rolf Reidi Network Bandwidth Estimation and.
Connection-level Analysis and Modeling of Network Traffic understanding the cause of bursts control and improve performance detect changes of network state.
Bayesian Belief Propagation
pathChirp Efficient Available Bandwidth Estimation
Object Specific Compressed Sensing by minimizing a weighted L2-norm A. Mahalanobis.
Fast Algorithms For Hierarchical Range Histogram Constructions
Pattern Recognition and Machine Learning
Onur G. Guleryuz & Ulas C.Kozat DoCoMo USA Labs, San Jose, CA 95110
Budapest May 27, 2008 Unifying mixed linear models and the MASH algorithm for breakpoint detection and correction Anders Grimvall, Sackmone Sirisack, Agne.
STAT 497 APPLIED TIME SERIES ANALYSIS
Probabilistic Aggregation in Distributed Networks Ling Huang, Ben Zhao, Anthony Joseph and John Kubiatowicz {hling, ravenben, adj,
1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004.
Spatial and Temporal Data Mining
L16: Micro-array analysis Dimension reduction Unsupervised clustering.
Reduced Support Vector Machine
Statistics.
On the interdependence of routing and data compression in multi-hop sensor networks Anna Scaglione, Sergio D. Servetto.
Optimal Bandwidth Selection for MLS Surfaces
Mean Squared Error : Love It or Leave It ?. Why do we love the MSE ? It is simple. It has a clear physical meaning. The MSE is an excellent metric in.
Predicting Communication Latency in the Internet Dragan Milic Universität Bern.
Random Forest Photometric Redshift Estimation Samuel Carliles 1 Tamas Budavari 2, Sebastien Heinis 2, Carey Priebe 3, Alex Szalay 2 Johns Hopkins University.
© 2005, it - instituto de telecomunicações. Todos os direitos reservados. Gerhard Maierbacher Scalable Coding Solutions for Wireless Sensor Networks IT.
Scalable Information-Driven Sensor Querying and Routing for ad hoc Heterogeneous Sensor Networks Maurice Chu, Horst Haussecker and Feng Zhao Xerox Palo.
Arizona State University DMML Kernel Methods – Gaussian Processes Presented by Shankar Bhargav.
Estimation and the Kalman Filter David Johnson. The Mean of a Discrete Distribution “I have more legs than average”
Colorado Center for Astrodynamics Research The University of Colorado STATISTICAL ORBIT DETERMINATION Project Report Unscented kalman Filter Information.
Physics 114: Lecture 15 Probability Tests & Linear Fitting Dale E. Gary NJIT Physics Department.
Geo479/579: Geostatistics Ch13. Block Kriging. Block Estimate  Requirements An estimate of the average value of a variable within a prescribed local.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Summarized by Soo-Jin Kim
Inference for the mean vector. Univariate Inference Let x 1, x 2, …, x n denote a sample of n from the normal distribution with mean  and variance 
PIC: Practical Internet Coordinates for Distance Estimation Manuel Costa joint work with Miguel Castro, Ant Rowstron, Peter Key Microsoft Research Cambridge.
On the Construction of Data Aggregation Tree with Minimum Energy Cost in Wireless Sensor Networks: NP-Completeness and Approximation Algorithms National.
Alternative Measures of Risk. The Optimal Risk Measure Desirable Properties for Risk Measure A risk measure maps the whole distribution of one dollar.
Simple Linear Regression Models
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
1 LES of Turbulent Flows: Lecture 1 Supplement (ME EN ) Prof. Rob Stoll Department of Mechanical Engineering University of Utah Fall 2014.
Network Characterization via Random Walks B. Ribeiro, D. Towsley UMass-Amherst.
1 Oblivious Routing in Wireless networks Costas Busch Rensselaer Polytechnic Institute Joint work with: Malik Magdon-Ismail and Jing Xi.
ELEC 303 – Random Signals Lecture 18 – Classical Statistical Inference, Dr. Farinaz Koushanfar ECE Dept., Rice University Nov 4, 2010.
Multiscale Models for Network Traffic Vinay Ribeiro Rolf Riedi, Matt Crouse, Rich Baraniuk Dept. of Electrical Engineering Rice University (Houston, Texas)
Multiplicative Wavelet Traffic Model and pathChirp: Efficient Available Bandwidth Estimation Vinay Ribeiro.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
July 11, 2006Bayesian Inference and Maximum Entropy Probing the covariance matrix Kenneth M. Hanson T-16, Nuclear Physics; Theoretical Division Los.
Attenuation measurement with all 4 frozen-in SPATS strings Justin Vandenbroucke Freija Descamps IceCube Collaboration Meeting, Utrecht, Netherlands September.
Maximizing Lifetime per Unit Cost in Wireless Sensor Networks
SCALABLE INFORMATION-DRIVEN SENSOR QUERYING AND ROUTING FOR AD HOC HETEROGENEOUS SENSOR NETWORKS Paper By: Maurice Chu, Horst Haussecker, Feng Zhao Presented.
Internet Measurement and Analysis Vinay Ribeiro Shriram Sarvotham Rolf Riedi Richard Baraniuk Rice University.
Geo479/579: Geostatistics Ch12. Ordinary Kriging (2)
PathChirp Efficient Available Bandwidth Estimation Vinay Ribeiro Rice University Rolf Riedi Rich Baraniuk.
Giansalvo EXIN Cirrincione unit #4 Single-layer networks They directly compute linear discriminant functions using the TS without need of determining.
Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.
Presenter: Jae Sung Park
Queuing Analysis of Tree-Based LRD Traffic Models Vinay J. Ribeiro R. Riedi, M. Crouse, R. Baraniuk.
Probability Theory and Parameter Estimation I
Ch3: Model Building through Regression
Ch9: Decision Trees 9.1 Introduction A decision tree:
Spatial Online Sampling and Aggregation
Inference about the Slope and Intercept
Inference about the Slope and Intercept
Comparative RNA Structural Analysis
المشرف د.يــــاســـــــــر فـــــــؤاد By: ahmed badrealldeen
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 3rd Edition
Richard Baraniuk Edward Knightly Robert Nowak Rolf Riedi
pathChirp Efficient Available Bandwidth Estimation
pathChirp Efficient Available Bandwidth Estimation
Stochastic Methods.
Presentation transcript:

Optimal Sampling Strategies for Multiscale Stochastic Processes Vinay Ribeiro Rolf Riedi, Rich Baraniuk (Rice University)

Motivation Probing for RTT (ping, TCP), available bandwidth (pathload, pathChirp) Packet trace collection –Traffic matrix estimation, overall traffic composition Routing/Connectivity analysis –Sample few routing tables Sensor networks –deploy limited number of sensors Global (space/time) average Limited number of local samples How to optimally place N samples to estimate the global quantity? 0 T probe packets

Multiscale Stochastic Processes Nodes at higher scales – averages over larger regions Powerful structure – model LRD traffic, image data, natural phenomena root – global average, leaves – local samples Choose N leaf nodes to give best linear estimate (in terms of mean squared error) of root node Bunched, uniform, exponential? root leaves Scale j Quad-tree

Independent Innovations Trees Each node is linear combination of parent and independent random innovation Recursive top-to-bottom algorithm Concave optimization for split at each node Polynomial time algorithm O(N x depth + (# tree nodes)) Uniformly spaced leaves are optimal if innovations i.i.d. within scale n N-n split N

Covariance Trees Distance : Two leaf nodes have distance j if their lowest common ancestor is at scale j Covariance tree : Covariance between leaf nodes with distance j is c j (only a function of distance), covariance between root and any leaf node is constant,  Positively correlation progression : c j >c j+1 Negatively correlation progression : c j <c j+1

Covariance Tree Result Optimality proof: Simply construct an independent innovations tree with similar correlation structure Worst case proof: Based on eigenanalysis optimalworst-case Positive correlation progression uniformbunch Negative correlation progression bunch (conjecture) uniform

Numerical Results Covariance trees with fractional Gaussian noise correlation structure Plots of normalized MSE vs. number of leaves for different leaf patterns Positive correlation progression Negative correlation progression

Future Directions Sampling –more general tree structures –non-linear estimates –non-tree stochastic processes –leverage related work in Statistics (Bellhouse et al) Internet Inference –how to determine correlation between traffic traces, routing tables etc. Sensor networks – jointly optimize with other constraints like power transmission

: arbitrary set of leaf nodes; : size of X : leaves on left, : leaves on right : linear min. mean sq. error of estimating root using X Water-Filling f L (l)f R (l) N= Repeat at next lower scale with N replaced by l * N (left) and (N-l * N ) (right) Result: If innovations identically distributed within each scale then uniformly distribute leaves, l * N = b N/2 c

Covariance Tree Result Result: For a positive correlation progresssion choosing leaf nodes uniformly in the tree is optimal. However, for negatively correlation progression this same uniform choice is the worst case! Optimality proof: Simply construct an independent innovations tree with similar correlation structure Worst case proof: The uniform choice maximizes sum of elements of S X Using eigen analysis show that this implies that uniform choice minimizes sum of elements of S -1 X