Property Testing on Product Distributions: Optimal Testers for Bounded Derivative Properties Deeparnab Chakrabarty Microsoft Research Bangalore Kashyap.

Slides:



Advertisements
Similar presentations
Optimal Lower Bounds for 2-Query Locally Decodable Linear Codes Kenji Obata.
Advertisements

Bart Jansen 1.  Problem definition  Instance: Connected graph G, positive integer k  Question: Is there a spanning tree for G with at least k leaves?
Time-Space Tradeoffs in Resolution: Superpolynomial Lower Bounds for Superlinear Space Chris Beck Princeton University Joint work with Paul Beame & Russell.
Minimum Clique Partition Problem with Constrained Weight for Interval Graphs Jianping Li Department of Mathematics Yunnan University Jointed by M.X. Chen.
Secret Sharing, Matroids, and Non-Shannon Information Inequalities.
Gillat Kol joint work with Ran Raz Locally Testable Codes Analogues to the Unique Games Conjecture Do Not Exist.
Property testing of Tree Regular Languages Frédéric Magniez, LRI, CNRS Michel de Rougemont, LRI, University Paris II.
Distributional Property Estimation Past, Present, and Future Gregory Valiant (Joint work w. Paul Valiant)
Comments We consider in this topic a large class of related problems that deal with proximity of points in the plane. We will: 1.Define some proximity.
Bipartite Matching, Extremal Problems, Matrix Tree Theorem.
Approximation Algorithms Chapter 14: Rounding Applied to Set Cover.
Fast Algorithms For Hierarchical Range Histogram Constructions
A UNIFIED FRAMEWORK FOR TESTING LINEAR-INVARIANT PROPERTIES ARNAB BHATTACHARYYA CSAIL, MIT (Joint work with ELENA GRIGORESCU and ASAF SHAPIRA)
Cse 521: design and analysis of algorithms Time & place T, Th pm in CSE 203 People Prof: James Lee TA: Thach Nguyen Book.
1 The TSP : Approximation and Hardness of Approximation All exact science is dominated by the idea of approximation. -- Bertrand Russell ( )
Time-Space Tradeoffs in Resolution: Superpolynomial Lower Bounds for Superlinear Space Chris Beck Princeton University Joint work with Paul Beame & Russell.
Approximation Algorithms Chapter 5: k-center. Overview n Main issue: Parametric pruning –Technique for approximation algorithms n 2-approx. algorithm.
1 Distribution-free testing algorithms for monomials with a sublinear number of queries Elya Dolev & Dana Ron Tel-Aviv University.
Distribution-free testing algorithms for monomials with a sublinear number of queries Elya Dolev & Dana Ron Tel-Aviv University.
New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions Rocco Servedio Joint work with Xi Chen and Li-Yang Tan Columbia University.
02/01/11CMPUT 671 Lecture 11 CMPUT 671 Hard Problems Winter 2002 Joseph Culberson Home Page.
By : L. Pour Mohammad Bagher Author : Vladimir N. Vapnik
The main idea of the article is to prove that there exist a tester of monotonicity with query and time complexity.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 8 May 4, 2005
Sublinear Algorithms for Approximating Graph Parameters Dana Ron Tel-Aviv University.
Robust Network Design with Exponential Scenarios By: Rohit Khandekar Guy Kortsarz Vahab Mirrokni Mohammad Salavatipour.
Graphs and Trees This handout: Trees Minimum Spanning Tree Problem.
CSE 421 Algorithms Richard Anderson Lecture 4. What does it mean for an algorithm to be efficient?
1 Sampling Lower Bounds via Information Theory Ziv Bar-Yossef IBM Almaden.
Group Strategyproofness and No Subsidy via LP-Duality By Kamal Jain and Vijay V. Vazirani.
1 On the Benefits of Adaptivity in Property Testing of Dense Graphs Joint work with Mira Gonen Dana Ron Tel-Aviv University.
CS151 Complexity Theory Lecture 6 April 15, 2004.
An introduction to Approximation Algorithms Presented By Iman Sadeghi.
Approximation Algorithms Motivation and Definitions TSP Vertex Cover Scheduling.
A General Approach to Online Network Optimization Problems Seffi Naor Computer Science Dept. Technion Haifa, Israel Joint work: Noga Alon, Yossi Azar,
CS774. Markov Random Field : Theory and Application Lecture 08 Kyomin Jung KAIST Sep
1 By: MOSES CHARIKAR, CHANDRA CHEKURI, TOMAS FEDER, AND RAJEEV MOTWANI Presented By: Sarah Hegab.
The Complexity of Optimization Problems. Summary -Complexity of algorithms and problems -Complexity classes: P and NP -Reducibility -Karp reducibility.
Expanders via Random Spanning Trees R 許榮財 R 黃佳婷 R 黃怡嘉.
1 Sublinear Algorithms Lecture 1 Sofya Raskhodnikova Penn State University TexPoint fonts used in EMF. Read the TexPoint manual before you delete this.
Chapter 2 Mathematical preliminaries 2.1 Set, Relation and Functions 2.2 Proof Methods 2.3 Logarithms 2.4 Floor and Ceiling Functions 2.5 Factorial and.
Private Approximation of Search Problems Amos Beimel Paz Carmi Kobbi Nissim Enav Weinreb (Technion)
Approximate schemas Michel de Rougemont, LRI, University Paris II.
Ch.6 Phylogenetic Trees 2 Contents Phylogenetic Trees Character State Matrix Perfect Phylogeny Binary Character States Two Characters Distance Matrix.
Testing the independence number of hypergraphs
CSE 589 Part VI. Reading Skiena, Sections 5.5 and 6.8 CLR, chapter 37.
The length of vertex pursuit games Anthony Bonato Ryerson University CCC 2013.
Bipartite Matching. Unweighted Bipartite Matching.
狄彥吾 (Yen-Wu Ti) 華夏技術學院資訊工程系 Property Testing on Combinatorial Objects.
Complexity and Efficient Algorithms Group / Department of Computer Science Testing the Cluster Structure of Graphs Christian Sohler joint work with Artur.
Algorithms for hard problems Parameterized complexity – definitions, sample algorithms Juris Viksna, 2015.
Iterative Rounding in Graph Connectivity Problems Kamal Jain ex- Georgia Techie Microsoft Research Some slides borrowed from Lap Chi Lau.
Graphs Definition: a graph is an abstract representation of a set of objects where some pairs of the objects are connected by links. The interconnected.
Theory of Computing Lecture 12 MAS 714 Hartmut Klauck.
Theory of Computational Complexity Probability and Computing Ryosuke Sasanuma Iwama and Ito lab M1.
Theory of Computational Complexity Probability and Computing Chapter Hikaru Inada Iwama and Ito lab M1.
P & NP.
Algorithms for Big Data: Streaming and Sublinear Time Algorithms
On Sample Based Testers
An introduction to Approximation Algorithms Presented By Iman Sadeghi
Haim Kaplan and Uri Zwick
Lecture 18: Uniformity Testing Monotonicity Testing
Chapter 5. Optimal Matchings
Warren Center for Network and Data Sciences
Applications of Derivatives
CIS 700: “algorithms for Big Data”
cse 521: design and analysis of algorithms
Problem Solving 4.
Miniconference on the Mathematics of Computation
Clustering.
Presentation transcript:

Property Testing on Product Distributions: Optimal Testers for Bounded Derivative Properties Deeparnab Chakrabarty Microsoft Research Bangalore Kashyap Dixit (PSU), Madhav Jha (Sandia), C. Seshadhri (Sandia)

Functional Property Testing (x 1 x 2 … x d ) f(x 1, x 2,…, x d ) Blackbox access. quality = #queries.

False Positives via Stat. Indistinguishability IDEAL WORLD: Either, the tester finds a violation and REJECTS. Or tester concludes that function satisfies property, and ACCEPTS. REAL D: ambient distribution over inputs. dist D (f,g)

Formal Definition *: one-sided tester, which never rejects a function satisfying the property.

Monotonicity f Relevant even when domain and range is {0,1}. Monotone concepts in learning theory. Property of being sorted.

Smoothness (Lipschitz Continuity) f Robustness of Programs. Fundamental in Differential Privacy.

Bounded Derivative Properties Optimal Testers for all bounded derivative properties with respect to arbitrary product distributions.

Quasimetric form Bounding Family x y z x y x’ y’

Previous Work Goldreich-Goldwasser-Lehman-Ron 1998, Ergun et al 1998, Dodis et al 1999 Lehman-Ron 2001, Fischer et al. 2002, Fischer 2004, Parnas-Ron-Rubinfeld 2006, Ailon et al 2007, Bhattacharya et al 2009, Briet et al 2010, Blais-Brody-Matulef 2011, Jha-Raskhodnikova 2011, Awasthi et al. 2012, Chakrabarty-Seshadhri 2013a Chakrabarty-Seshadhri 2013a Ailon-Chazelle 2004, Halevy Kushilevitz 2007, Dixit et al Uniform Distribution Ailon-Chazelle 2004 Dixit et al 2013 Product Distribution

Binary Search Trees Give product Distribution D = D 1 X … X D d, ∆ * (D) = ∆ * (D 1 ) + … + ∆ * (D d ) At most the entropy, but could be less by additive d. Rooted Binary Tree with n vertices marked 1 to n. label(left-child) < label(v) < label(right-child)

Statement of Results Upper Bounds. Given any product distribution D and any bounded derivative property P(B), there exists a 100ε -1 ∆ * (D)-query P(B)-tester. Lower Bounds. For any bounded derivative property P(B), and any stable product distribution D, for some constant ε, Ω(∆ * (D))-queries are necessary. Dimension Reduction Theorem. dist i (f) be the distance of the function restricted to a “random” i-line. Then, dist 1 (f) + dist 2 (f) + … + dist d (f) ≥ dist(f)/4

The Line

Algorithm x

Analysis Lemma: Pr[Find Violation] ≥ dist D (f). Certificate of distance: dist D (f) = min μ D (VC) where VC is a “hitting set” of all violations. X be set of points which have violn with some ancestor. Pr[Violation] = μ D (X) X forms a vertex cover. x y z If (x,y) is a violation then either (x,z) or (y,z) is a violation, where z = lca(x,y) Triangle inequality of m Linearity of m

Lower Bound (monotonicity) Setting:[Fischer’04, CS’13] Collection of ‘hard’ functions: g 1,…,g L each ε-far, and q-queries “distinguishes” at most q of these g i ’s from a specified monotone function h, implies Ω(L) lower bnd Hard function from each level k of the median BST. Properties of g k : - (x,y) is a violation iff lca(x) is in level k - dist D (g k ) ≥ μ ≥k (T)/2 For stable distributions, μ ≥k (T) is constant after Ω(∆ * (D)) levels Intervals μ ≥k (T): mass beyond level k

Dimension Reduction

Statement and Application Dimension Reduction Theorem. dist i (f) be the distance of the function restricted to a “random” i-line. Then, dist 1 (f) + dist 2 (f) + … + dist d (f) ≥ dist(f)/4 Algorithm for [n] d : Sample x←D and choose a line passing through it uar. Run algorithm for line on function restricted to this line. Exp[Queries] = 1/d(∆ * (D 1 ) + … + ∆ * (D d )) = ∆ * (D)/d Pr[Find Violation] ≥ 1/d(dist 1 (f) + … + dist d (f)) ≥ dist(f)/4d

First Try Dimension Reduction Theorem. dist i (f) be the distance of the function restricted to a “random” i-line. Then, dist 1 (f) + dist 2 (f) + … + dist d (f) ≥ dist(f)/4 Our Approach: Non-constructive based on Matchings and Alternating Paths. Contrapositive: if most lines can be fixed with small changes, then so can the whole hypergrid. Fixing one dimension may introduce new violations in other dimensions

Matchings and Alternating Paths Violation Graph of f has an edge for every pair of violations. Main Structural Theorem If f has no violations along dimension i, then there exists a maximal matching that doesn’t cross dimension i. Folklore Lemma: If dist U (f) = ε, then any maximal matching in VG has cardinality larger than εn d /2. Maximum weight matching wrt certain weighing scheme

Proof from Structure Thm Given f, let M i be the maximum weight matching which has no j-cross pairs for 1 ≤j ≤ i. So, |M 0 | ≥ dist(f)n d /2, and |M d | = 0 Bounded Drop Lemma. For any k, |M k-1 | - |M k | ≤ 2dist k (f)n d Implies: dist 1 (f) + dist 2 (f) + … + dist d (f) ≥ dist(f)/4 Proof. Let f k be the closest function to f with no viol along dir k. dist(f,f k ) = dist k (f). Let N be the maximum weight matching wrt f k. |M 0 | - |N| ≤ dist(f,f k )n d ≤ dist k (f)n d |N| - |M 1 | ≤ dist(f,f k ) n d ≤ dist k (f)n d (Look at M 0 ∆ N) N has no k-cross pairs.

Take Home Points and Points to Ponder on Optimal Testers for the class of bounded (first) derivative properties under any product distribution. Inherent connection to search trees. Subsumes many results known for monotonicity and Lipschitz Continuity testing. Near Optimal Dimension Reduction. What we didn’t cover today: proof of the structure theorem, uniform to arbitrary product distributions, and proof of the general lower bound. More general distributions? Can we do a general distribution on a 2D grid? What’s the answer? Bounded Second derivative property? Can we test submodularity?