New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions Rocco Servedio Joint work with Xi Chen and Li-Yang Tan Columbia University.

Slides:



Advertisements
Similar presentations
1+eps-Approximate Sparse Recovery Eric Price MIT David Woodruff IBM Almaden.
Advertisements

Quantum Lower Bounds The Polynomial and Adversary Methods Scott Aaronson September 14, 2001 Prelim Exam Talk.
Approximate List- Decoding and Hardness Amplification Valentine Kabanets (SFU) joint work with Russell Impagliazzo and Ragesh Jaiswal (UCSD)
Gillat Kol joint work with Ran Raz Locally Testable Codes Analogues to the Unique Games Conjecture Do Not Exist.
Lecture 24 MAS 714 Hartmut Klauck
Distributional Property Estimation Past, Present, and Future Gregory Valiant (Joint work w. Paul Valiant)
1 Distribution-free testing algorithms for monomials with a sublinear number of queries Elya Dolev & Dana Ron Tel-Aviv University.
Property Testing on Product Distributions: Optimal Testers for Bounded Derivative Properties Deeparnab Chakrabarty Microsoft Research Bangalore Kashyap.
Approximating Average Parameters of Graphs Oded Goldreich, Weizmann Institute Dana Ron, Tel Aviv University.
Dictator tests and Hardness of approximating Max-Cut-Gain Ryan O’Donnell Carnegie Mellon (includes joint work with Subhash Khot of Georgia Tech)
Asaf Shapira (Georgia Tech) Joint work with: Arnab Bhattacharyya (MIT) Elena Grigorescu (Georgia Tech) Prasad Raghavendra (Georgia Tech) 1 Testing Odd-Cycle.
The main idea of the article is to prove that there exist a tester of monotonicity with query and time complexity.
Proclaiming Dictators and Juntas or Testing Boolean Formulae Michal Parnas Dana Ron Alex Samorodnitsky.
Learning, testing, and approximating halfspaces Rocco Servedio Columbia University DIMACS-RUTCOR Jan 2009.
Derandomization: New Results and Applications Emanuele Viola Harvard University March 2006.
On Uniform Amplification of Hardness in NP Luca Trevisan STOC 05 Paper Review Present by Hai Xu.
Testing of Clustering Noga Alon, Seannie Dar Michal Parnas, Dana Ron.
4. Multiple Regression Analysis: Estimation -Most econometric regressions are motivated by a question -ie: Do Canadian Heritage commercials have a positive.
Michael Bender - SUNY Stony Brook Dana Ron - Tel Aviv University Testing Acyclicity of Directed Graphs in Sublinear Time.
On Proximity Oblivious Testing Oded Goldreich - Weizmann Institute of Science Dana Ron – Tel Aviv University.
1 On approximating the number of relevant variables in a function Dana Ron & Gilad Tsur Tel-Aviv University.
1 On the Benefits of Adaptivity in Property Testing of Dense Graphs Joint work with Mira Gonen Dana Ron Tel-Aviv University.
1 Algorithmic Aspects in Property Testing of Dense Graphs Oded Goldreich – Weizmann Institute Dana Ron - Tel-Aviv University.
1 On the Benefits of Adaptivity in Property Testing of Dense Graphs Joint works with Mira Gonen and Oded Goldreich Dana Ron Tel-Aviv University.
Preference Analysis Joachim Giesen and Eva Schuberth May 24, 2006.
Lower Bounds for Property Testing Luca Trevisan U C Berkeley.
Learning and testing k-modal distributions Rocco A. Servedio Columbia University Joint work (in progress) with Ilias Diakonikolas UC Berkeley Costis Daskalakis.
On Testing Computability by small Width OBDDs Oded Goldreich Weizmann Institute of Science.
Scott Aaronson (MIT) Andris Ambainis (U. of Latvia) Forrelation: A Problem that Optimally Separates Quantum from Classical Computing H H H H H H f |0 
Some 3CNF Properties are Hard to Test Eli Ben-Sasson Harvard & MIT Prahladh Harsha MIT Sofya Raskhodnikova MIT.
How Robust are Linear Sketches to Adaptive Inputs? Moritz Hardt, David P. Woodruff IBM Research Almaden.
Dana Moshkovitz, MIT Joint work with Subhash Khot, NYU.
Correlation testing for affine invariant properties on Shachar Lovett Institute for Advanced Study Joint with Hamed Hatami (McGill)
Approximating the MST Weight in Sublinear Time Bernard Chazelle (Princeton) Ronitt Rubinfeld (NEC) Luca Trevisan (U.C. Berkeley)
Ragesh Jaiswal Indian Institute of Technology Delhi Threshold Direct Product Theorems: a survey.
A Property Testing Double-Feature of Short Talks Oded Goldreich Weizmann Institute of Science Talk at Technion, June 2013.
Quantum Computing MAS 725 Hartmut Klauck NTU TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A.
February 18, 2015CS21 Lecture 181 CS21 Decidability and Tractability Lecture 18 February 18, 2015.
Submodular Functions Learnability, Structure & Optimization Nick Harvey, UBC CS Maria-Florina Balcan, Georgia Tech.
Edge-disjoint induced subgraphs with given minimum degree Raphael Yuster 2012.
1 Sublinear Algorithms Lecture 1 Sofya Raskhodnikova Penn State University TexPoint fonts used in EMF. Read the TexPoint manual before you delete this.
Cs3102: Theory of Computation Class 24: NP-Completeness Spring 2010 University of Virginia David Evans.
Private Approximation of Search Problems Amos Beimel Paz Carmi Kobbi Nissim Enav Weinreb (Technion)
Lower Bounds for Property Testing Luca Trevisan U.C. Berkeley.
Sublinear Algorithms via Precision Sampling Alexandr Andoni (Microsoft Research) joint work with: Robert Krauthgamer (Weizmann Inst.) Krzysztof Onak (CMU)
Amplification and Derandomization Without Slowdown Dana Moshkovitz MIT Joint work with Ofer Grossman (MIT)
Massive Data Sets and Information Theory Ziv Bar-Yossef Department of Electrical Engineering Technion.
Unique Games Approximation Amit Weinstein Complexity Seminar, Fall 2006 Based on: “Near Optimal Algorithms for Unique Games" by M. Charikar, K. Makarychev,
狄彥吾 (Yen-Wu Ti) 華夏技術學院資訊工程系 Property Testing on Combinatorial Objects.
Complexity and Efficient Algorithms Group / Department of Computer Science Testing the Cluster Structure of Graphs Christian Sohler joint work with Artur.
Tight Bound for the Gap Hamming Distance Problem Oded Regev Tel Aviv University TexPoint fonts used in EMF. Read the TexPoint manual before you delete.
Why almost all satisfiable k - CNF formulas are easy? Danny Vilenchik Joint work with A. Coja-Oghlan and M. Krivelevich.
TU/e Algorithms (2IL15) – Lecture 12 1 Linear Programming.
The Message Passing Communication Model David Woodruff IBM Almaden.
The NP class. NP-completeness Lecture2. The NP-class The NP class is a class that contains all the problems that can be decided by a Non-Deterministic.
Imperfectly Shared Randomness
The NP class. NP-completeness
On Sample Based Testers
Property Testing (a.k.a. Sublinear Algorithms )
Information Complexity Lower Bounds
New Characterizations in Turnstile Streams with Applications
Dana Ron Tel Aviv University
On Testing Dynamic Environments
From dense to sparse and back again: On testing graph properties (and some properties of Oded)
Warren Center for Network and Data Sciences
CIS 700: “algorithms for Big Data”
Partly Verifiable Signals (c.n.)
Gautam Kamath Simons Institute  University of Waterloo
Learning, testing, and approximating halfspaces
Imperfectly Shared Randomness
Presentation transcript:

New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions Rocco Servedio Joint work with Xi Chen and Li-Yang Tan Columbia University Institute for Advanced Study March 2014

Property Testing All Boolean functions Property P  Query access to unknown f on any input x  With as few queries as possible, decide if Far from P Simplest question about a Boolean function: Does it have some property P? f has Property P vs. f does not have Property P Unreasonable: Easy lower bound of f is far from having Property P

Rules of the game 1.If f has Property P, accept w.p. > 2/3. 2.If f is ε-far from having Property P, reject w.p. > 2/3. 3.Otherwise: doesn’t matter what we do. All Boolean functions Property PFar from P Query access to unknown f on any input x

Super-efficient algorithms Sublinear Time Property Testing Sublinear Space Streaming, Sketching Sublinear Measurement Sparse Recovery, Compressed Sensing 1.Many properties P testable with surprisingly few queries. 2.Rich connections with many other areas:  Learning theory  Hardness of approximation  Communication complexity ... Two recurring messages:

This work: P = monotonicity

Monotone Far from monotone Well-studied problem: [GGR98, GGL+98, DGL+99, FLN+02, HK08, BCGM12, RRS+12, BBM12, BRY13, CS13, …] but still significant gaps in our understanding. This work: P = monotonicity

Background and prior results

For all monotone functions g: A monotone function is one that satisfies: Monotone Far from monotone Quick reminder about monotonicity “Flipping an input bit from 0 to 1 cannot make f go from 1 to 0”

Monotone Far from monotone 0n0n 0n0n 1n1n 1n1n

First test that comes to mind Pick random edge y = x = Reject! f(x) = 0 f(y) = 1 f(x) = 1 f(y) = 1 f(x) = 0 f(y) = 0 f(x) = 1 f(y) = 0 Repeat ? x y Call such an edge a “violating edge”

Theorem [Goldreich et al. 1998] If f is ε-far from monotone, Ω(ε/n) fraction of edges are violations. Therefore tester will reject within O(n/ε) queries. An observation and a theorem Reject! f(x) = 0 f(y) = 1 Call such an edge a “violating edge” Simple observation: If f is monotone, tester never rejects. Question: If f is ε-far from monotone, how likely to catch violating edge? ? x y Tester = Sample random edges, check for violations.

An exponential gap between upper and lower bounds  Goldreich et al. [FOCS 1998]  Introduced problem, gave tester with O(n) query complexity.  Fischer et al. [STOC 2002]  Any non-adaptive tester must make Ω(log n) queries.  Therefore, any adaptive tester must make Ω(log log n) queries.  Chakrabarty-Seshadhri [STOC 2013]  O(n 7/8 )-query non-adaptive tester! [GGR98, DGL+99, HK08, BCGM12, RRS+12, BRY13, CS13, …]

Lower bound: Any non-adaptive tester must make queries. Therefore, any adaptive tester must make queries. Exponential improvements over Ω(log n) and Ω(log log n) lower bounds of Fischer et al. (2002) Upper bound: There is a non-adaptive tester that make queries. Polynomial improvement over O(n 7/8 ) upper bound of Chakrabarty and Seshadhri (2013) Our results

Rest of talk The lower bound Main idea of the argument in a simpler setting Key technical ingredient: Multidimensional Central Limit Theorems Generalizing the lower bound: Testing monotonicity on hypergrids

Yao’s minimax principle Lower bound against randomized algorithms Tricky distribution over inputs to deterministic algorithms implies

Yao’s principle in our setting Monotone Far from Monotone Distribution D no supported on far from monotone functions Indistinguishability. For all T = deterministic tester that makes o(n 1/5 ) queries, Distribution D yes supported on monotone functions

D no : = −1 with prob 0.1, 7/3 with prob 0.9 Main Structural Result: Indistinguishability Any deterministic tester that makes few queries cannot tell D yes from D no Key property:,. D yes : = uniform from {1,3} Our D yes and D no distributions Both supported on Linear Threshold Functions (LTFs) over {−1,1} n : Verify: D yes LTFs are monotone, D no LTFs far from monotone w.h.p.

Claim. For all T = deterministic tester that makes q = o(n 1/5 ) queries, Indistinguishability: starting small q = 1 query

Non-trivial proof of a triviality Claim. Let T = deterministic tester that makes 1 query, on string z. Then: (*)(*) vs. Tester sees:

Central Limit Theorems. Sum of many independent “reasonable” random variables converges to Gaussian of same mean and variance. Main analytic tool (Baby version):

Goal: Upper bound Recall key property:

Claim. Let T = deterministic tester that makes 1 query. Then: We just proved: Plenty of room to spare! Would be happy with < 0.1

q queries instead of 1 Multidimensional CLTs. Sum of many independent “reasonable” q-dimensional random variables converge to q-dimensional Gaussian of same mean and variance. Main analytic tool (Grown-up version):

Multidimensional CLTs [Mossel 08, Gopalan-O’Donnell-Wu-Zuckerman 10] Lower bound [Valiant-Valiant 11] Main technical work: Adapting multidimensional CLT for Earth Mover Distance to get

Claim. For all T = deterministic tester that makes q = o(n 1/5 ) queries, Let’s prove the real thing:

Setting things up Arrange the q queries of tester T in a q x n matrix Q i-th query string Recall: Tester’s goal is to distinguish versus

What does the tester see? Goal: Upper bound Tester sees: (coordinate-wise sign)

Goal: Upper bound Random variables supported on 2 q orthants of R q union of orthants “roughly equal weight on any union of orthants”

Fixed from product distribution over R n sum of n independent vectors in R q Ditto:

Goal: Two sums of n independent vectors in R q are “close” where closeness = roughly equal weight on any union of orthants. The final setup Furthermore, since suffices to show each are close to q-dimensional Gaussian with matching mean and covariance matrix.

Valiant-Valiant Multidimensional CLT Sum of many independent “reasonable” q-dimensional random variables is close to q-dimensional Gaussian of same mean and variance. with respect to Earth Mover Distance: Minimum amount of work necessary to “change one PDF into the other one”, where work := mass x distance Technical lemma: Closeness in EMD roughly equal weight on any union of orthants

Valiant-Valiant Multidimensional CLT

Key technical lemma Closeness in EMD Roughly equal weight on any union of orthants

Let’s consider the contrapositive: for all unions of orthants ≥ 0.1 unit of mass must be moved out of Recall: work = mass x distance Slight technical obstacle: What if this 0.1 unit of mass only moves a tiny distance? Solution: Then G has ≥ 0.1 weight on red boundary. Not possible thanks to anti-concentration of G If we want to make S look like G,

In slightly more detail For all r > 0, define B r := radius r boundary around Therefore:, has to be moved out of BrBr

We just proved Recall the Valiant-Valiant CLT: Gaussian anti-concentration Valiant-Valiant CLT

The details Recall the Valiant-Valiant CLT:  =O(1) in our setting = in our setting Optimal choice of makes RHS Pick, and RHS is o(1) as desired.

Indistinguishability. For all T = deterministic tester that makes q = o(n 1/5 ) queries, Recap Lower bound: Any non-adaptive tester must make queries. Therefore, any adaptive tester must make queries.

Testing monotonicity of Boolean functions over general hypergrid domains

Boolean functions over hypergrids Monotone Far from monotone All Boolean functions All Boolean-valued functions over hypergrid Testing monotonicity of Boolean functions over hypergrids

Lower bound: * Any non-adaptive tester for testing monotonicity of.must make queries. * only for even m Proof by reduction to m=2 case (Boolean hypercube)

A useful characterization Theorem. [Fischer et al. 2002] There exists ε. m n vertex-disjoint pairs such that and. is ε-far from monotone “violating pair” (Upward direction is easy)

Given any, define as follows: Reducing to m = 2 numbers in [m] bits in {0,1} Easy: If f is monotone then so is F. Remains to argue: If f is ε-far from monotone then so is F. Exists ε2 n vertex-disjoint pairs in {0,1} n that are violations w.r.t. f Exists ε. m n vertex-disjoint pairs in [m] n that are violations w.r.t. F Useful characterization Each violating pair (m/2) n vertex-disjoint violating pairs suffices to show:

f(x) = 0 f(y) = S(x)S(x) S(y)S(y) {0,1} n [m]n[m]n |S(x)| = |S(y)| = (m/2) n where Easy end game: exhibit order-preserving bijection between S(x) and S(y)

Conclusion  A polynomial lower bound for testing monotonicity of Boolean functions  Main technical ingredient: multidimensional central limit theorems  Proof extends to testing monotonicity over general hypergrid domains (when m is even) Open Problem: Polynomial lower bounds against adaptive testers? Lower bound: Any non-adaptive tester must make queries. Therefore, any adaptive tester must make queries.

What I didn’t talk about  An improved one-sided, non-adaptive tester  Builds on ingredients from [Chakrabarty and Seshadhri 2013]: trade off edge tester versus “path tester”  We give a different “path tester” with an improved analysis Open Problems: can we exploit adaptivity or two-sided error to obtain more efficient testers? Upper bound: There is a non-adaptive tester that make queries.

Thank you!