The Complexity of Matrix Completion Nick Harvey David Karger Sergey Yekhanin.

Slides:



Advertisements
Similar presentations
Linear-Degree Extractors and the Inapproximability of Max Clique and Chromatic Number David Zuckerman University of Texas at Austin.
Advertisements

An Ω(n 1/3 ) Lower Bound for Bilinear Group Based Private Information Retrieval Alexander Razborov Sergey Yekhanin.
Max Cut Problem Daniel Natapov.
Uniqueness of Optimal Mod 3 Circuits for Parity Frederic Green Amitabha Roy Frederic Green Amitabha Roy Clark University Akamai Clark University Akamai.
Lecture 21 NP-complete problems
Complexity class NP Is the class of languages that can be verified by a polynomial-time algorithm. L = { x in {0,1}* | there exists a certificate y with.
Jeff Kinne Indiana State University Part I: Feb 11, 2011 Part II: Feb 25, 2011.
CS Department Fireside Chat All are welcome! Wed Nov 18, 5-6pm, Ols 228e/236d Kim Hazelwood and Wes Weimer Meet and ask them questions in a non-academic.
Non-commutative computation with division Avi Wigderson IAS, Princeton Pavel Hrubes U. Washington.
1/17 Optimal Long Test with One Free Bit Nikhil Bansal (IBM) Subhash Khot (NYU)
Complexity ©D.Moshkovits 1 Hardness of Approximation.
The number of edge-disjoint transitive triples in a tournament.
Complexity 11-1 Complexity Andrei Bulatov NP-Completeness.
Approximation Algoirthms: Semidefinite Programming Lecture 19: Mar 22.
CS151 Complexity Theory Lecture 6 April 15, 2015.
February 23, 2015CS21 Lecture 201 CS21 Decidability and Tractability Lecture 20 February 23, 2015.
CS151 Complexity Theory Lecture 7 April 20, 2004.
Simultaneous Matchings Irit Katriel - BRICS, U of Aarhus, Denmark Joint work with Khaled Elabssioni and Martin Kutz - MPI, Germany Meena Mahajan - IMSC,
Arithmetic Hardness vs. Randomness Valentine Kabanets SFU.
Haplotyping via Perfect Phylogeny Conceptual Framework and Efficient (almost linear-time) Solutions Dan Gusfield U.C. Davis RECOMB 02, April 2002.
Deterministic Network Coding by Matrix Completion Nick Harvey David Karger Kazuo Murota.
1 CSE 417: Algorithms and Computational Complexity Winter 2001 Lecture 23 Instructor: Paul Beame.
2-Layer Crossing Minimisation Johan van Rooij. Overview Problem definitions NP-Hardness proof Heuristics & Performance Practical Computation One layer:
Algebraic Structures and Algorithms for Matching and Matroid Problems Nick Harvey.
1 CSE 417: Algorithms and Computational Complexity Winter 2001 Lecture 24 Instructor: Paul Beame.
An Algebraic Algorithm for Weighted Linear Matroid Intersection
CSE 421 Algorithms Richard Anderson Lecture 27 NP Completeness.
Chapter 11: Limitations of Algorithmic Power
CS151 Complexity Theory Lecture 6 April 15, 2004.
Query Lower Bounds for Matroids via Group Representations Nick Harvey.
Randomness in Computation and Communication Part 1: Randomized algorithms Lap Chi Lau CSE CUHK.
C&O 355 Mathematical Programming Fall 2010 Lecture 17 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A.
Repairable Fountain Codes Megasthenis Asteris, Alexandros G. Dimakis IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 32, NO. 5, MAY /5/221.
Physical Mapping of DNA Shanna Terry March 2, 2004.
1 A fast algorithm for Maximum Subset Matching Noga Alon & Raphael Yuster.
Recent Developments in Algebraic Proof Complexity Recent Developments in Algebraic Proof Complexity Iddo Tzameret Tsinghua Univ. Based on Pavel Hrubeš.
C&O 750 Randomized Algorithms Winter 2011 Lecture 24 Nicholas Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.:
Pricing Combinatorial Markets for Tournaments Presented by Rory Kulz.
Great Theoretical Ideas in Computer Science.
Zeev Dvir Weizmann Institute of Science Amir Shpilka Technion Locally decodable codes with 2 queries and polynomial identity testing for depth 3 circuits.
MCS 312: NP Completeness and Approximation algorthms Instructor Neelima Gupta
An algorithmic proof of the Lovasz Local Lemma via resampling oracles Jan Vondrak IBM Almaden TexPoint fonts used in EMF. Read the TexPoint manual before.
Week 10Complexity of Algorithms1 Hard Computational Problems Some computational problems are hard Despite a numerous attempts we do not know any efficient.
EMIS 8373: Integer Programming NP-Complete Problems updated 21 April 2009.
May 9, New Topic The complexity of counting.
1 Efficient Rule Matching for Large Scale Systems Packet Classification – A Case Study Alok Tongaonkar Stony Brook University TexPoint fonts used in EMF.
CS151 Complexity Theory Lecture 12 May 6, QSAT is PSPACE-complete Theorem: QSAT is PSPACE-complete. Proof: 8 x 1 9 x 2 8 x 3 … Qx n φ(x 1, x 2,
Surya Bhupatiraju MIT PRIMES May 20 th, 2012 On the Complexity of the Marginal Consistency Problem.
Umans Complexity Theory Lectures Lecture 1a: Problems and Languages.
1 Randomization, Derandomization, and Parallelization --- take the MIS problem as a demonstration Speaker: Hong Chih-Duo Advisor: Chao Kuen-Mao National.
1 Chapter 34: NP-Completeness. 2 About this Tutorial What is NP ? How to check if a problem is in NP ? Cook-Levin Theorem Showing one of the most difficult.
NP-COMPLETE PROBLEMS. Admin  Two more assignments…  No office hours on tomorrow.
1/19 Minimizing weighted completion time with precedence constraints Nikhil Bansal (IBM) Subhash Khot (NYU)
NP-Complete problems.
Amplification and Derandomization Without Slowdown Dana Moshkovitz MIT Joint work with Ofer Grossman (MIT)
CS 3343: Analysis of Algorithms Lecture 25: P and NP Some slides courtesy of Carola Wenk.
“One ring to rule them all” Analogy (sort of) Lord of The Rings Computational Complexity “One problem to solve them all” “my preciousss…”
CS6045: Advanced Algorithms NP Completeness. NP-Completeness Some problems are intractable: as they grow large, we are unable to solve them in reasonable.
Happy 80th B’day Dick.
CSC 413/513: Intro to Algorithms
CSE 421 Algorithms Richard Anderson Lecture 27 NP-Completeness Proofs.
C&O 355 Lecture 19 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A A A A A A.
1 Design and Analysis of Algorithms Yoram Moses Lecture 13 June 17, 2010
1 IAS, Princeton ASCR, Prague. The Problem How to solve it by hand ? Use the polynomial-ring axioms ! associativity, commutativity, distributivity, 0/1-elements.
Theory of Computational Complexity Probability and Computing Chapter Hikaru Inada Iwama and Ito lab M1.
CMU Graduate Complexity Lecture 20
Instructor: Shengyu Zhang
Hiroshi Hirai University of Tokyo
Complexity Theory in Practice
Switching Lemmas and Proof Complexity
Presentation transcript:

The Complexity of Matrix Completion Nick Harvey David Karger Sergey Yekhanin

What is matrix completion? Given matrix containing variables, substitute values for the variables to get full rank 1 x 1 y x=1, y=0 1 x 1 y 1 x=1, y=1 Bad

Why should I care? Combinatorics Many combinatorial problems relate to matrices of variables Tutte ’47, Edmonds ’67, Lovasz ’79 Relation to Algebra Tomizawa-Iri ’74, Murota ’00 Gessel-Viennot ’85 Graph Matching Matroid Intersection Counting paths in DAG Problem God (i.e., the BOOK)

Why should I care? Algorithms Often yields highly efficient algorithms RNC: KUW’86, MVV’87 Sequential O(n 2.38 ) time: MS’04, H’06 O(nr 1.38 ) time: H’06 Random Network Codes: Koetter-Medard ’03, Ho et al. ’03 Graph Matching Matroid Intersection Counting paths in DAG Algorithms Problem

Why should I care? Complexity Depending on parameters, can be NP-complete, in RP, or in P  Key parameters: Field size, # variables, # occurrences of each variable Contains polynomial identity testing as special case (Valiant ’79)  Derandomizing PIT implies strong circuit lower bounds (Kabanets-Impagliazzo ’03)

Field Size Why care about field size?  Relevant to complexity: random works over large fields  Understanding smaller fields may provide insight to derandomization  Important for network coding efficiency (i.e., complexity of routers)

Complexity Regions Field Size # Occurences of an variable 2 357n NP Hard RP P Buss et al. ‘99 Lovasz ‘79 H., Karger, Murota ‘05 P Geelen ‘99 ??????

Complexity Regions # Occurences of an variable 2 357n NP Hard RP NP Hard Field Size P P

Variant: Simultaneous Completion We have set of matrices A := {A 1, …, A d }  Each variable appears at most once per matrix  An variable can appear in several matrices Def: A simultaneous completion for A assigns values to variables while preserving the rank of all matrices RP algorithm still works over large field Application to Network Coding uses Simultaneous Completion

Relationship to Single Matrix Completion Hardness for Simultaneous Completion  Hardness for Single Matrix Completion w/many occurrences of variables 1 A B C Simultaneous Completion 1 A D E 1 B C D Single Matrix Completion

Simultaneous Completion Algorithm Input: d matrices Compute rank of all matrices Pick an variable x for i  {0,…,d} Set x := i If all matrices have unchanged rank Recurse (# variables has decreased) Simple self-reducibility algorithm Operates over field F q, where d := # matrices < q Non-trivial! Murota ’93.

A Sharp Threshold Simple self-reducibility algorithm Operates over field F q, where d := # matrices < q Thm: Simultaneous completion for d matrices over F q is:  in P if q > d[HKM ’05]  NP-hard if q ≤ d[This paper]

A Sharp Threshold Thm: Simultaneous completion for d matrices over F q is:  in P if q > d[HKM ’05]  NP-hard if q ≤ d[This paper] Cor: Single matrix completion with d occurrences of variables over F q is NP-hard if q ≤ d

Approach Reduction from Circuit-SAT A NAND B C C =  ( A  B )  C = 1 - A ∙ B (if A, B, C  {0, 1})  det  0 1 A B C (if A, B, C  {0, 1})

What have we shown so far? Simultaneous completion of an unbounded number of matrices over F 2 is NP-hard Can we use fewer?  Combine small matrices into huge matrix?  Problem: Variables appear too many times  Need to somehow make “copies” of a variable Coming up next:  completing two matrices over F 2 is NP-hard

A Curious Matrix x1x x2x2 111 x3x3 11 xnxn 1 R n :=

A Curious Matrix x1x x2x2 111 x3x3 11 xnxn 1 R n := Thm: det R n =

Linearity of Determinant x1x x2x2 111 x3x3 11 xnxn 1 det x1x x2x2 111 x3x3 11 xnxn x1x x2x2 110 x3x3 10 xnxn 0 det + =

Column Expansion x1x1 111 x2x2 11 x3x3 1 xnxn (-1) n+1 det == x1x x2x2 111 x3x3 11 xnxn 1 det 1111 x1x x2x2 110 x3x3 10 xnxn 0 det +

11111 x1x x2x2 111 x3x3 11 xnxn 1

Schur Complement Identity x1x x2x2 111 x3x3 11 xnxn 1 det = det x1x1 111 x2x2 11 x3x3 1 xnxn ∙∙ -

Applying Outer Product = det 1-x 1 11-x x x n = det x1x1 111 x2x2 11 x3x3 1 xnxn ∙∙ -

Finishing up = det 1-x 1 11-x x x n = QED

Proof: det R n =, which is arithmetization of So either all variables true, or all false. Replicating Variables Corollary: If {x 1, x 2, …, x n } in {0,1} then det R n  0  x i = x j  i,j  x i   x i. ii

Replicating Variables Corollary: If {x 1, x 2, …, x n } in {0,1} then det R n  0  x i = x j  i,j Consequence: over F 2, need only 2 matrices NAND A := RnRn RnRn RnRn B :=

What have we shown so far? Simultaneous completion of:  an unbounded number of matrices over F 2 is NP-hard  two matrices over F 2 is NP-hard Next:  q matrices over F q is NP-hard

Handling Fields F q Previous gadgets only work if each x  {0,1}. How can we ensure this over F q ? Introduce q-2 auxiliary variables: x=x (1), x (2), …, x (q-1) Sufficient to enforce that: x (i)  x (j)  i,j and x (i)  {0,1}  i  2  det  0 1 x (i) x (j) etc.

Handling Fields F q x (i)  x (j)  i,j and x (i)  {0,1}  i  2 x (2) 01 x (1) x (3) x (4) x (q-1) Edge indicates endpoints non-equal

Handling Fields F q x (i)  x (j)  i,j and x (i)  {0,1}  i  2 x (2) 01 x (1) x (3) x (4) x (q-1) Pack these constraints into few matrices Each variable used once per matrix Amounts to edge-coloring From  (K n ), conclude that q matrices suffice

What have we shown so far? Simultaneous completion of:  an unbounded number of matrices over F 2 is NP-hard  two matrices over F 2 is NP-hard  q matrices over F q is NP-hard

Main Results Thm: A simultaneous completion for d matrices over F q is NP-hard if q ≤ d Cor: Completion of single matrix, variables appearing d times is NP-hard if q ≤ d Cor: Completion of skew-symmetric matrix, variables appearing d times is NP-hard if q ≤ d

Open Questions Improved hardess results / algorithms for matrix completion? Lower bounds / hardness for field size in network coding? More combinatorial uses of matrix completion