Lower bounds for small depth arithmetic circuits Chandan Saha Joint work with Neeraj Kayal (MSRI) Nutan Limaye (IITB) Srikanth Srinivasan (IITB)

Slides:

Advertisements

Similar presentations

The Polynomial Method In Quantum and Classical Computing Scott Aaronson (MIT) OPEN PROBLEM.

Advertisements

Quantum Lower Bounds The Polynomial and Adversary Methods Scott Aaronson September 14, 2001 Prelim Exam Talk.

Quantum t-designs: t-wise independence in the quantum world Andris Ambainis, Joseph Emerson IQC, University of Waterloo.

Parikshit Gopalan Georgia Institute of Technology Atlanta, Georgia, USA.

Computing with Polynomials over Composites Parikshit Gopalan Algorithms, Combinatorics & Optimization. Georgia Tech.

PRG for Low Degree Polynomials from AG-Codes Gil Cohen Joint work with Amnon Ta-Shma.

On the degree of symmetric functions on the Boolean cube Joint work with Amir Shpilka.

Are lower bounds hard to prove? Michal Koucký Institute of Mathematics, Prague.

How to Fool People to Work on Circuit Lower Bounds Ran Raz Weizmann Institute & Microsoft Research.

Time-Space Tradeoffs in Resolution: Superpolynomial Lower Bounds for Superlinear Space Chris Beck Princeton University Joint work with Paul Beame & Russell.

Uniqueness of Optimal Mod 3 Circuits for Parity Frederic Green Amitabha Roy Frederic Green Amitabha Roy Clark University Akamai Clark University Akamai.

Gillat Kol joint work with Ran Raz Competing Provers Protocols for Circuit Evaluation.

Simple Affine Extractors using Dimension Expansion. Matt DeVos and Ariel Gabizon.

Linear Systems With Composite Moduli Arkadev Chattopadhyay (University of Toronto) Joint with: Avi Wigderson TexPoint fonts used in EMF. Read the TexPoint.

Chapter 2: Second-Order Differential Equations

Jeff Kinne Indiana State University Part I: Feb 11, 2011 Part II: Feb 25, 2011.

Algorithm : Design & Analysis [4]

1 L is in NP means: There is a language L’ in P and a polynomial p so that L 1 · L 2 means: For some polynomial time computable map r : 8 x: x 2 L 1 iff.

CS151 Complexity Theory Lecture 6 April 15, 2015.

Dagstuhl 2010 University of Puerto Rico Computer Science Department The power of group algebras for constrained multilinear monomial detection Yiannis.

On the Hardness of Graph Isomorphism Jacobo Tor á n SIAM J. Comput. Vol 33, p , Presenter: Qingwu Yang April, 2006.

Arithmetic Hardness vs. Randomness Valentine Kabanets SFU.

Introduction to Algorithms

Lower Bounds for Depth Three Circuits with small bottom fanin Neeraj Kayal Chandan Saha Indian Institute of Science.

A Differential Approach to Inference in Bayesian Networks - Adnan Darwiche Jiangbo Dang and Yimin Huang CSCE582 Bayesian Networks and Decision Graphs.

Tensor-Rank and Lower Bounds for Arithmetic Formulas Ran Raz Weizmann Institute.

THE LAPLACE TRANSFORM LEARNING GOALS Definition The transform maps a function of time into a function of a complex variable Two important singularity functions.

Iddo Tzameret Tel Aviv University The Strength of Multilinear Proofs (Joint work with Ran Raz)

CSCI 4325 / 6339 Theory of Computation Zhixiang Chen.

Multilinear NC 1  Multilinear NC 2 Ran Raz Weizmann Institute.

Recent Developments in Algebraic Proof Complexity Recent Developments in Algebraic Proof Complexity Iddo Tzameret Tsinghua Univ. Based on Pavel Hrubeš.

Do Now: Evaluate each expression for x = -2. Aim: How do we work with polynomials? 1) -x + 12) x ) -(x – 6) Simplify each expression. 4) (x + 5)

Chapter 8 With Question/Answer Animations 1. Chapter Summary Applications of Recurrence Relations Solving Linear Recurrence Relations Homogeneous Recurrence.

Zeev Dvir Weizmann Institute of Science Amir Shpilka Technion Locally decodable codes with 2 queries and polynomial identity testing for depth 3 circuits.

Eric Allender Rutgers University Dual VP Classes Joint work with Anna Gál (U. Texas) and Ian Mertz (Rutgers) MFCS, Milan, August 27, 2015.

Secure Computation (Lecture 5) Arpita Patra. Recap >> Scope of MPC > models of computation > network models > modelling distrust (centralized/decentralized.

Umans Complexity Theory Lectures Lecture 1a: Problems and Languages.

THE LAPLACE TRANSFORM LEARNING GOALS Definition

Polynomials Emanuele Viola Columbia University work partially done at IAS and Harvard University December 2007.

Time Complexity of Algorithms (Asymptotic Notations)

THE FUNDAMENTAL THEOREM OF ALGEBRA. Descartes’ Rule of Signs If f(x) is a polynomial function with real coefficients, then *The number of positive real.

1 How to establish NP-hardness Lemma: If L 1 is NP-hard and L 1 ≤ L 2 then L 2 is NP-hard.

Chapter 7 The Laplace Transform

Norms, XOR lemmas, and lower bounds for GF(2) polynomials and multiparty protocols Emanuele Viola, IAS (Work partially done during postdoc at Harvard)

Forrelation: A Problem that Optimally Separates Quantum from Classical Computing.

Happy 60 th B’day Noga. Elementary problems encoding computational hardness Avi Wigderson IAS, Princeton or Some problems Noga never solved.

Elusive Functions, and Lower Bounds for Arithmetic Circuits Ran Raz Weizmann Institute.

4.2 Area Definition of Sigma Notation = 14.

Lower Bounds using shifted partials Chandan Saha Indian Institute of Science Workshop on Algebraic Complexity Theory 2016 Tel-Aviv University.

Characterizing Propositional Proofs as Non-Commutative Formulas Iddo Tzameret Royal Holloway, University of London Based on Joint work Fu Li (Texas Austin)

1 SAT SAT: Given a Boolean function in CNF representation, is there a way to assign truth values to the variables so that the function evaluates to true?

Recent Developments in Algebraic & Proof Complexity Recent Developments in Algebraic & Proof Complexity Iddo Tzameret Tsinghua Univ. Based on Hrubes and.

1 IAS, Princeton ASCR, Prague. The Problem How to solve it by hand ? Use the polynomial-ring axioms ! associativity, commutativity, distributivity, 0/1-elements.

Tali Kaufman (Bar-Ilan)

Iddo Tzameret Tel Aviv University

Algebraic Proofs over Noncommutative Formulas

INTEGRATION & TECHNIQUES OF INTEGRATION

Information Complexity Lower Bounds

Advanced Engineering Mathematics 6th Edition, Concise Edition

Negation-Limited Formulas

Polynomial Norms Amir Ali Ahmadi (Princeton University) Georgina Hall

Circuit Lower Bounds A combinatorial approach to P vs NP

Matrix PI-algebras and Lower Bounds on Arithmetic Proofs (work in progress) Iddo Tzameret Joint work with Fu Li Tsinghua University.

Algorithm : Design & Analysis [4]

Tight Fourier Tails for AC0 Circuits

Polynomial Optimization over the Unit Sphere

The Curve Merger (Dvir & Widgerson, 2008)

CSE838 Lecture notes copy right: Moon Jung Chung

Linear Algebra in Weak Formal Theories of Arithmetic

Do Now: Aim: How do we work with polynomials?

Presentation transcript:

Lower bounds for small depth arithmetic circuits Chandan Saha Joint work with Neeraj Kayal (MSRI) Nutan Limaye (IITB) Srikanth Srinivasan (IITB)

Arithmetic Circuit: A model of computation + xxxx ++++ xxxx …. ….. x1x1 x2x2 x n-1 xnxn f(x 1, x 2, …, x n )--> multivariate polynomial in x 1, …, x n x g h gh + g h g+h Product gate Sum gate There are `field constants’ on the wires

Arithmetic Circuit: A model of computation + xxxx ++++ xxxx …. ….. x1x1 x2x2 x n-1 xnxn f(x 1, x 2, …, x n ) Depth = 4

Arithmetic Circuit: A model of computation + xxxx ++++ xxxx …. ….. x1x1 x2x2 x n-1 xnxn f(x 1, x 2, …, x n ) Size = no. of gates and wires

The lower bound question Is there an explicit family of n-variate, poly(n) degree polynomials {f n } that requires… …super-polynomial in n circuit size ?

The lower bound question Is there an explicit family of n-variate, poly(n) degree polynomials {f n } that requires… …super-polynomial in n circuit size ? Note : A random polynomial has super-poly(n) circuit size

The Permanent – an explicit family Perm n = ∑ ∏ x i σ(i) σ є S n i є [n]

The Permanent – an explicit family Degree of Perm n is low. i.e. bounded by poly(n) Perm n = ∑ ∏ x i σ(i) σ є S n i є [n]

The Permanent – an explicit family Degree of Perm n is low. Coefficient of any given monomial can be found efficiently. …given a monomial, there’s a poly-time algorithm to determine the coefficient of the monomial. Perm n = ∑ ∏ x i σ(i) σ є S n i є [n]

The Permanent – an explicit family Degree of Perm n is low. Coefficient of any given monomial can be found efficiently. These two properties characterize explicitness Perm n = ∑ ∏ x i σ(i) σ є S n i є [n]

The Permanent – an explicit family Degree of Perm n is low. Coefficient of any given monomial can be found efficiently. Define class VNP Perm n = ∑ ∏ x i σ(i) σ є S n i є [n]

The Permanent – an explicit family Degree of Perm n is low. Coefficient of any given monomial can be found efficiently. Define class VNP Perm n = ∑ ∏ x i σ(i) σ є S n i є [n] Class VP: Contains families of low degree polynomials {f n } that can be computed by poly(n)-size circuits.

The Permanent – an explicit family Degree of Perm n is low. Coefficient of any given monomial can be found efficiently. Perm n = ∑ ∏ x i σ(i) σ є S n i є [n] VP vs VNP: Does Perm n family require super-poly(n) size circuits?

A strategy for proving arithmetic circuit lower bound Step 1: Depth reduction Step 2: Lower bound for small depth circuits

A strategy for proving arithmetic circuit lower bound Step 1: Depth reduction Step 2: Lower bound for small depth circuits

Notations and Terminologies Notations: n = no. of variables in f n d = degree bound on f n = n O(1) Homogeneous polynomial: A polynomial is homogeneous if all its monomials have the same degree (say, d). Homogeneous circuits: A circuit is homogeneous if every gate outputs/computes a homogeneous polynomial. Multilinear polynomial: In every monomial, degree of every variable is at most 1.

Reduction to depth ≈ log d Valiant, Skyum, Berkowitz, Rackoff (1983). Homogeneous, degree d, f n computed by poly(n) circuit f n computed by homogeneous poly(n) circuit of depth O(log d) arbitrary depth ≈ log d poly(n)

Reduction to depth 4 Agrawal, Vinay (2008); Koiran (2010); Tavenas (2013). Homogeneous, degree d, f n computed by poly(n) circuit f n computed by homogeneous depth 4 circuit of size n O(√d) ≈ log d 4 n O(√d) poly(n)

Reduction to depth 4 Agrawal, Vinay (2008); Koiran (2010); Tavenas (2013). Homogeneous, degree d, f n computed by poly(n) circuit f n computed by homogeneous depth 4 circuit of size n O(√d) ≈ log d 4 n O(√d) poly(n) … f n can have n O(d) monomials !

A depth 4 circuit + xxxx ++++ xxxx …. ….. x1x1 x2x2 x n-1 xnxn ∑ ∏ ∑ ∏

A depth 4 circuit + xxxx ++++ xxxx …. ….. x1x1 x2x2 x n-1 xnxn ∑ ∏ Q ij ij sum of monomials Q ij

Reduction to depth 3 Gupta, Kamath, Kayal, Saptharishi (2013); Tavenas (2013). Homogeneous, degree d, f n computed by poly(n) circuit f n computed by depth 3 circuit of size n O(√d) 3 n O(√d) 4

Reduction to depth 3 Gupta, Kamath, Kayal, Saptharishi (2013); Tavenas (2013). Homogeneous, degree d, f n computed by poly(n) circuit f n computed by depth 3 circuit of size n O(√d) 3 n O(√d) 4 not homogeneous!

A depth 3 circuit + xxxx ++++ …. x1x1 x2x2 x n-1 xnxn ∑ ∏ l ij ij linear polynomial l ij bottom fanin

Implication of the depth reductions Let {f n } be an explicit family of polynomials. if f n takes n ω(√d) size homogeneous if f n takes n ω(√d) size VP ≠ VNP or 4 3

A strategy for proving arithmetic circuit lower bound Step 1: Depth reduction Step 2: Lower bound for small depth circuits

Lower bound for homogeneous depth 4 Theorem: There is a family of homogeneous polynomials {f n } in VNP (with deg f n = d) such that… …any homogeneous depth-4 circuit computing f n has size n Ω(√d) size = n Ω(√d) 4 fnfn

Lower bound for homogeneous depth 4 Theorem: There is a family of homogeneous polynomials {f n } in VNP (with deg f n = d) such that… …any homogeneous depth-4 circuit computing f n has size n Ω(√d) size = n Ω(√d) 4 fnfn f n = i ∑ ∏ Q ij … has size n Ω(√d) j sum of monomials

Lower bound for homogeneous depth 4 Theorem: There is a family of homogeneous polynomials {f n } in VNP (with deg f n = d) such that… …any homogeneous depth-4 circuit computing f n has size n Ω(√d) size = n Ω(√d) 4 fnfn …joint work with Kayal, Limaye, Srinivasan

Lower bound for homogeneous depth 4 Theorem: There is a family of homogeneous polynomials {f n } in VNP (with deg f n = d) such that… …any homogeneous depth-4 circuit computing f n has size n Ω(√d) size = n Ω(√d) 4 fnfn …the technique appears to be using homogeneity crucially

Lower bound for depth 3 Theorem: There is a family of homogeneous polynomials {f n } in VNP (with deg f n = d) such that… any depth-3 circuit (bottom fanin ≤ √d) computing f n has size n Ω(√d) size = n Ω(√d) 3 fnfn

Lower bound for depth 3 Theorem: There is a family of homogeneous polynomials {f n } in VNP (with deg f n = d) such that… any depth-3 circuit (bottom fanin ≤ √d) computing f n has size n Ω(√d) size = n Ω(√d) 3 fnfn needn’t be homogeneous

Lower bound for depth 3 Theorem: There is a family of homogeneous polynomials {f n } in VNP (with deg f n = d) such that… any depth-3 circuit (bottom fanin ≤ √d) computing f n has size n Ω(√d) size = n Ω(√d) 3 fnfn Note: Even for bottom fanin ≤ √d, depth-3 circuits n ω(√d) VP ≠ VNP

Lower bound for depth 3 Theorem: There is a family of homogeneous polynomials {f n } in VNP (with deg f n = d) such that… any depth-3 circuit (bottom fanin ≤ t) computing f n has size n Ω(d/t) size = n Ω(d/t) 3 fnfn …joint work with Kayal

Lower bound for depth 3 Theorem: There is a family of homogeneous polynomials {f n } in VNP (with deg f n = d) such that… any depth-3 circuit (bottom fanin ≤ t) computing f n has size n Ω(d/t) size = n Ω(d/t) 3 fnfn … answers a question by Shpilka & Wigderson (1999)

Proof ideas

Homogeneous depth-4 lower bound

Complexity measure A measure is a function μ: F[x 1, …, x n ] -> R. We wish to find a measure μ such that 1.If C is a circuit (say, a depth 4 circuit) then μ(C) ≤ s. “small quantity”, where s = size(C) 2.For an “explicit” polynomial f n, μ(f n ) ≥ “large quantity” Implication: If C = f n then s ≥ “large quantity” “small quantity”  Upper bound  Lower bound

Some complexity measures Measure Model Partial derivatives (Nisan & Wigderson) homogeneous depth-3 circuits Evaluation dimension (Raz) multilinear formulas Hessian (Mignon & Ressayre) determinantal complexity permanent Jacobian (Agrawal et. al.) occur-k, depth-4 circuits Incomplete list ?

Some complexity measures Measure Model Partial derivatives (Nisan & Wigderson) homogeneous depth-3 circuits Evaluation dimension (Raz) multilinear formulas Hessian (Mignon & Ressayre) determinantal complexity permanent Jacobian (Agrawal et. al.) occur-k, depth-4 circuits Shifted partials (Kayal; Gupta et. al.) homog. depth-4 with low bottom fanin Projected shifted partials homogeneous depth-4 circuits; depth-3 circuits (with low bottom fanin)

Space of Partial Derivatives Notations: ∂ =k f : Set of all k th order derivatives of f(x 1, …, x n ) : The vector space spanned by F-linear combinations of polynomials in S Definition: PD k (f) = dim( ) Sub-additive property: PD k (f 1 + f 2 ) ≤ PD k (f 1 ) + PD k (f 2 )

Space of Shifted Partials Notation: x =ℓ = Set of all monomials of degree ℓ Definition: SP k,ℓ (f) := dim ( ) Sub-additivity: SP k,ℓ (f 1 + f 2 ) ≤ SP k,ℓ (f 1 ) + SP k,ℓ (f 2 )

Space of Shifted Partials Notation: x =ℓ = Set of all monomials of degree ℓ Definition: SP k,ℓ (f) := dim ( ) Sub-additivity: SP k,ℓ (f 1 + f 2 ) ≤ SP k,ℓ (f 1 ) + SP k,ℓ (f 2 ) Why do we expect SP(C) to be small ?

Shifted partials – the intuition C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm (homog. depth 4) Q ij = Sum of monomials

Shifted partials – the intuition C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm (homog. depth 4) Observation: ∂ =k Q i1 …Q im has “many roots” if k << m << n … any common root of Q i1 …Q im is also a common root of ∂ =k Q i1 …Q im

Shifted partials – the intuition C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm (homog. depth 4) Observation: Dimension of the variety of ∂ =k Q i1 …Q im is large if k << m << n

Shifted partials – the intuition C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm (homog. depth 4) Observation: Dimension of the variety of ∂ =k Q i1 …Q im is large if k << m << n [Hilbert’s] Theorem (informal): If dimension of the variety of {g} is large then dim ( ) is small.

Shifted partials – the intuition C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm (homog. depth 4) Observation: Dimension of the variety of ∂ =k Q i1 …Q im is large if k << m << n [Hilbert’s] Theorem (informal): If dimension of the variety of {g} is large then dim ( ) is small. … so we expect SP k,ℓ (Q i1 …Q im ) to be a `small quantity’

Shifted partials – the intuition C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm (homog. depth 4) Observation: Dimension of the variety of ∂ =k Q i1 …Q im is large if k << m << n [Hilbert’s] Theorem (informal): If dimension of the variety of {g} is large then dim ( ) is small. … by subadditivity, SP k,ℓ (C) ≤ s. `small quantity’

Depth-4 with low bottom degree C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm (homog. depth 4) Q ij = Sum of monomials of degree ≤ t (w.l.o.g m ≤ 2d/t )

Depth-4 with low bottom degree C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm ∂ =k Q i1 …Q im = Q i1 Q i2 …Q ik …Q im + Q i1 Q i2 …Q ik Q i k+1 …Q im + … X = Q i k+1 … Q im + Q i1 Q i k+2 … Q im + … degree ≤ k.t

Depth-4 with low bottom degree C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm ∂ =k Q i1 …Q im = Q i1 Q i2 …Q ik …Q im + Q i1 Q i2 …Q ik Q i k+1 …Q im + … X = Q i k+1 … Q im + Q i1 Q i k+2 … Q im + … at most ( ) terms m k

Depth-4 with low bottom degree C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm ∂ =k Q i1 …Q im = Q i1 Q i2 …Q ik …Q im + Q i1 Q i2 …Q ik Q i k+1 …Q im + … X = Q i k+1 … Q im + Q i1 Q i k+2 … Q im + … u. ∂ =k Q i1 …Q im = Q i k+1 … Q im + Q i1 Q i k+2 … Q im + … X degree = ℓ degree ≤ ℓ + k.t

Depth-4 with low bottom degree C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm ∂ =k Q i1 …Q im = Q i1 Q i2 …Q ik …Q im + Q i1 Q i2 …Q ik Q i k+1 …Q im + … X = Q i k+1 … Q im + Q i1 Q i k+2 … Q im + … u. ∂ =k Q i1 …Q im = Q i k+1 … Q im + Q i1 Q i k+2 … Q im + … X n + ℓ + kt n m k SP k,ℓ (Q i1 …Q im ) ≤ ( ). ( )

Depth-4 with low bottom degree C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm ∂ =k Q i1 …Q im = Q i1 Q i2 …Q ik …Q im + Q i1 Q i2 …Q ik Q i k+1 …Q im + … X = Q i k+1 … Q im + Q i1 Q i k+2 … Q im + … u. ∂ =k Q i1 …Q im = Q i k+1 … Q im + Q i1 Q i k+2 … Q im + … X n + ℓ + kt n m k SP k,ℓ (C) ≤ s. ( ). ( )  Upper bound

Reduction to low bottom degree C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm (homog. depth 4) Q ij = Sum of monomials (NO degree restriction)

Reduction to low bottom degree C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm Idea: Reduce to the case of low bottom degree using Random restriction Multilinear projection

Reduction to low bottom degree C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm Random restriction: Set every variable to zero independently at random with a certain probability. …denoted naturally by a map σ

Reduction to low bottom degree C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm Random restriction: Set every variable to zero independently at random with a certain probability. …denoted naturally by a map σ σ(C) = σ(Q 11 ) σ(Q 12 )…σ(Q 1m ) + … + σ(Q s1 ) σ(Q s2 )…σ(Q sm ) Obs: If a monomial u has many variables (high support) then σ(u) = 0 w.h.p

Reduction to low bottom degree C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm Random restriction: Set every variable to zero independently at random with a certain probability. …denoted naturally by a map σ σ(C) = σ(Q 11 ) σ(Q 12 )…σ(Q 1m ) + … + σ(Q s1 ) σ(Q s2 )…σ(Q sm ) w.l.o.g σ(Q ij ) = sum of ‘low support’ monomials

Reduction to low bottom degree C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm Random restriction: Set every variable to zero independently at random with a certain probability. Homogeneous depth 4  homogenous depth 4 with low bottom support … w.l.o.g assume that C has low bottom support

Reduction to low bottom degree C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm Projection map: π (g) = sum of the multilinear monomials in g

Reduction to low bottom degree C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm Projection map: π (g) = sum of the multilinear monomials in g Observation: π (sum of ‘low support’ monomials) = sum of ‘low degree’ monomials

Reduction to low bottom degree C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm Projection map: π (g) = sum of the multilinear monomials in g Observation: π (Q ij ) = sum of ‘low degree’ monomials

Projected Shifted Partials PSP k,ℓ (f) := dim (π (x =ℓ. ∂ =k f) ) (obeys subadditivity)

Projected Shifted Partials PSP k,ℓ (f) := dim (π (x =ℓ. ∂ =k f) ) (obeys subadditivity) multilinear shifts only!

Projected Shifted Partials PSP k,ℓ (f) := dim (π (x =ℓ. ∂ =k f) ) (obeys subadditivity) multilinear derivatives!

Depth-4 with low bottom support C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm support of every monomial bounded by t

Depth-4 with low bottom support C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm Q ij = Q’ ij + Every variable in every monomial has degree 2 or less

Depth-4 with low bottom support C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm Q ij = Q’ ij + Every monomial has a variable with degree 3 or more

Depth-4 with low bottom support C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm Q ij = Q’ ij + Q i1 Q i2 …Q im = Q’ i1 Q’ i2 …Q’ im + Every monomial has a variable with degree 3 or more

Depth-4 with low bottom support C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm Q ij = Q’ ij + Q i1 Q i2 …Q im = Q’ i1 Q’ i2 …Q’ im + PSP k,ℓ (Q i1 Q i2 …Q im ) ≤ PSP k,ℓ (Q’ i1 Q’ i2 …Q’ im ) + PSP k,ℓ ( )

Depth-4 with low bottom support C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm Q ij = Q’ ij + Q i1 Q i2 …Q im = Q’ i1 Q’ i2 …Q’ im + PSP k,ℓ (Q i1 Q i2 …Q im ) ≤ PSP k,ℓ (Q’ i1 Q’ i2 …Q’ im ) + PSP k,ℓ ( ) 0

Depth-4 with low bottom support C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm Q ij = Q’ ij + Q i1 Q i2 …Q im = Q’ i1 Q’ i2 …Q’ im + PSP k,ℓ (Q i1 Q i2 …Q im ) ≤ PSP k,ℓ (Q’ i1 Q’ i2 …Q’ im ) + PSP k,ℓ ( ) 0 degree ≤ 2t

Depth-4 with low bottom support C = Q 11 Q 12 …Q 1m + … + Q s1 Q s2 …Q sm Q ij = Q’ ij + Q i1 Q i2 …Q im = Q’ i1 Q’ i2 …Q’ im + PSP k,ℓ (Q i1 Q i2 …Q im ) ≤ PSP k,ℓ (Q’ i1 Q’ i2 …Q’ im ) Abusing notation: Call Q’ ij as Q ij

Depth-4 with low bottom support ∂ =k Q i1 …Q im = Q i1 Q i2 …Q ik …Q im + Q i1 Q i2 …Q ik Q i k+1 …Q im + … X = Q i k+1 … Q im + Q i1 Q i k+2 … Q im + … degree ≤ 2kt

Depth-4 with low bottom support ∂ =k Q i1 …Q im = Q i1 Q i2 …Q ik …Q im + Q i1 Q i2 …Q ik Q i k+1 …Q im + … X = Q i k+1 … Q im + Q i1 Q i k+2 … Q im + … u. ∂ =k Q i1 …Q im = u. Q i k+1 … Q im + u. Q i1 Q i k+2 … Q im + X degree = ℓ degree ≤ 2kt

Depth-4 with low bottom support ∂ =k Q i1 …Q im = Q i1 Q i2 …Q ik …Q im + Q i1 Q i2 …Q ik Q i k+1 …Q im + … X = Q i k+1 … Q im + Q i1 Q i k+2 … Q im + … π(u.∂ =k Q i1 …Q im ) = π( Q i k+1 … Q im ) + π( Q i1 Q i k+2 … Q im ) + X multilinear, degree ≤ ℓ + 2k.t

Depth-4 with low bottom support ∂ =k Q i1 …Q im = Q i1 Q i2 …Q ik …Q im + Q i1 Q i2 …Q ik Q i k+1 …Q im + … X = Q i k+1 … Q im + Q i1 Q i k+2 … Q im + … π(u.∂ =k Q i1 …Q im ) = π( Q i k+1 … Q im ) + π( Q i1 Q i k+2 … Q im ) + X  Upper bound ℓ + 2kt n m k SP k,ℓ (C) ≤ s. ( ). ( )

How large can PSP(f) be? Trivially, PSP k,ℓ (f) ≤ min { ( ).( ), ( ) } n k n ℓ n ℓ + d - k

How large can PSP(f) be? Trivially, PSP k,ℓ (f) ≤ min { ( ).( ), ( ) } n k n ℓ n ℓ + d - k Size of the set { x =ℓ. ∂ =k f } ≤ ( ).( ) Number of monomials in any polynomial in π (x =ℓ. ∂ =k f) ≤ ( ) n k n ℓ n ℓ + d - k Let f be a multilinear polynomial

How large can PSP(f) be? Trivially, PSP k,ℓ (f) ≤ min { ( ).( ), ( ) } Best lower bound for s s ≥ n k n ℓ n ℓ + d - k min {( ).( ), ( )} ( ).( ) m k n ℓ + 2kt n k n ℓ n ℓ + d - k = n Ω(d/t) After setting k and ℓ appropriately

How large can PSP(f) be? Trivially, PSP k,ℓ (f) ≤ min { ( ).( ), ( ) } Best lower bound for s s ≥ There’s an explicit f such that PSP k,ℓ (f) is close to the trivial upper bound.  (lower bound) n k n ℓ n ℓ + d - k min {( ).( ), ( )} ( ).( ) m k n ℓ + 2kt n k n ℓ n ℓ + d - k = n Ω(d/t)

Depth-3 lower bound

Trading depth for homogeneity Idea: Depth-3 with low bottom fanin Homogeneous depth-4 with low bottom support Size = s Bottom fanin = t 3 fnfn 4 (homogeneous) fnfn Size = s. 2 O(√d) Bottom support = t

Depth-3 to Depth-4 Implicit in Shpilka & Wigderson ; Hrubes & Yehudayoff (2011) C = α 1.(1 + l 11 )(1 + l 12 )…(1 + l 1m ) + …. + α s.(1 + l s1 )(1 + l s2 )…(1 + l sm ) linear forms field constants

Depth-3 to Depth-4 Implicit in Shpilka & Wigderson ; Hrubes & Yehudayoff (2011) C = (1 + l 11 )(1 + l 12 )…(1 + l 1m ) + …. + (1 + l s1 )(1 + l s2 )…(1 + l sm ) Notation: [g] d = d-th homogeneous part of g Easy observation: If C = f, which is homogeneous deg d polynomial, then [C] d = f.

Depth-3 to Depth-4 Implicit in Shpilka & Wigderson ; Hrubes & Yehudayoff (2011) C = (1 + l 11 )(1 + l 12 )…(1 + l 1m ) + …. + (1 + l s1 )(1 + l s2 )…(1 + l sm ) [C] d = [(1 + l 11 )(1 + l 12 )…(1 + l 1m )] d +….+ [(1 + l s1 )(1 + l s2 )…(1 + l sm )] d idea: transform these to homogeneous depth-4

Newton’s identities E d (y 1, y 2, …, y m ) := ∑ ∏ y j P r (y 1, y 2, …, y m ) := ∑ y j r S in 2 [m] |S| = d j in S (elementary symmetric polynomial of degree d) j in [m] (power symmetric polynomial of degree r)

Newton’s identities E d (y 1, y 2, …, y m ) := ∑ ∏ y j P r (y 1, y 2, …, y m ) := ∑ y j r S in 2 [m] |S| = d j in S j in [m] Lemma: E d (y) = ∑ β a ∏ P r (y) a = (a 1, …, a d ) ∑ r. a r = d r in [d] arar e.g. 2y 1 y 2 = (y 1 + y 2 ) 2 – y 1 2 – y 2 2 = P 1 2 – P 2 field constant

Newton’s identities E d (y 1, y 2, …, y m ) := ∑ ∏ y j P r (y 1, y 2, …, y m ) := ∑ y j r S in 2 [m] |S| = d j in S j in [m] Lemma: E d (y) = ∑ β a ∏ P r (y) a = (a 1, …, a d ) ∑ r. a r = d r in [d] arar Hardy-Ramanujan estimate: The number of a = (a 1, …, a d ) such that ∑ r.a r = d is 2 O(√d)

Depth-3 to Depth-4 Implicit in Shpilka & Wigderson ; Hrubes & Yehudayoff (2011) [(1 + l i1 )(1 + l i2 )…(1 + l im )] d = E d ( l i1, …, l im ) = ∑ β a ∏ P r ( l i1, …, l im ) a = (a 1, …, a d ) ∑ r. a r = d r in [d] arar 2 O(√d) summands

Depth-3 to Depth-4 Implicit in Shpilka & Wigderson ; Hrubes & Yehudayoff (2011) [(1 + l i1 )(1 + l i2 )…(1 + l im )] d = E d ( l i1, …, l im ) = ∑ β a ∏ P r ( l i1, …, l im ) a = (a 1, …, a d ) ∑ r. a r = d r in [d] arar 2 O(√d) summands Suppose every l ij has at most t variables, then…

Depth-3 to Depth-4 Implicit in Shpilka & Wigderson ; Hrubes & Yehudayoff (2011) [(1 + l i1 )(1 + l i2 )…(1 + l im )] d = E d ( l i1, …, l im ) = ∑ β a ∏ P r ( l i1, …, l im ) a = (a 1, …, a d ) ∑ r. a r = d r in [d] arar = ∑ β a ∏ Q i,a,r a = (a 1, …, a d ) ∑ r. a r = d r in [d] every monomial has support ≤ t

Depth-3 to Depth-4 Implicit in Shpilka & Wigderson ; Hrubes & Yehudayoff (2011) [(1 + l i1 )(1 + l i2 )…(1 + l im )] d = E d ( l i1, …, l im ) = ∑ β a ∏ P r ( l i1, …, l im ) a = (a 1, …, a d ) ∑ r. a r = d r in [d] arar = ∑ β a ∏ Q i,a,r a = (a 1, …, a d ) ∑ r. a r = d r in [d] [C] d = ∑ ∑ β a ∏ Q i,a,r a = (a 1, …, a d ) ∑ r. a r = d r in [d] i in [s]

Depth-3 to Depth-4 Implicit in Shpilka & Wigderson ; Hrubes & Yehudayoff (2011) [(1 + l i1 )(1 + l i2 )…(1 + l im )] d = E d ( l i1, …, l im ) = ∑ β a ∏ P r ( l i1, …, l im ) a = (a 1, …, a d ) ∑ r. a r = d r in [d] arar = ∑ β a ∏ Q i,a,r a = (a 1, …, a d ) ∑ r. a r = d r in [d] [C] d = ∑ ∑ β a ∏ Q i,a,r a = (a 1, …, a d ) ∑ r. a r = d r in [d] i in [s] Homogeneous depth-4 with low bottom support and size s.2 Ω(√d)

An explicit family with high PSP k,ℓ

An explicit family of polynomials Nisan-Wigderson family of polynomials: NW r := ∑ ∏ x i, h(i) d2d2 h(z) in F [z], deg(h) ≤ r i in [d] identifying the elements of F with {1,2, …, d 2 } d2d2

An explicit family of polynomials Nisan-Wigderson family of polynomials: NW r := ∑ ∏ x i, h(i) d2d2 h(z) in F [z], deg(h) ≤ r i in [d] `Disjointness’ property: Two monomials can share at most r ≈ d/3 variables. = + + … d r r d 2(r+1) monomials

Projected Shifted Partials of NW r The set π (x =ℓ. ∂ =k NW r ) has ( ).( ) elements. Every polynomial in π (x =ℓ. ∂ =k NW r ) is multilinear & homogeneous of degree (ℓ + d – k). n k n ℓ

Projected Shifted Partials of NW r The set π (x =ℓ. ∂ =k NW r ) has ( ).( ) elements. Every polynomial in π (x =ℓ. ∂ =k NW r ) is multilinear & homogeneous of degree (ℓ + d – k). PSP k,ℓ (NW r ) = rank (M) n k n ℓ M := ( ).( ) rows π (x =ℓ. ∂ =k NW r ) (0/1)-matrix of coefficients n ℓ + d - k ( ) columns n k n ℓ

Projected Shifted Partials of NW r Because of the `disjointness property’ of NW r, the columns of M are almost orthogonal. Hence, B := M T M is diagonally dominant. Observe, rank (M) ≥ rank (B).

Projected Shifted Partials of NW r Because of the `disjointness property’ of NW r, the columns of M are almost orthogonal. Hence, B := M T M is diagonally dominant. Observe, rank (M) ≥ rank (B). Alon’s rank bound (for diagonally dominant matrix): If B is a real symmetric matrix then rank (B) ≥ Tr (B) 2 Tr (B 2 )

Projected Shifted Partials of NW r [Main lemma]: Using Alon’s bound and settings r, k and ℓ appropriately, PSP k,ℓ (NW r ) ≥ η. min {( ).( ), ( )} n k n ℓ n ℓ + d - k small factor

An explicit family in VP [Kumar-Saraf (2014)] : Showed the same lower bound using the Iterated Matrix multiplication polynomial, which is in VP

An explicit family in VP [Kumar-Saraf (2014)] : Showed the same lower bound using the Iterated Matrix multiplication polynomial, which is in VP VNP Circuits (VP) ABPs Formulas Depth-4 exponential separation

An explicit family in VP [Kumar-Saraf (2014)] : Showed the same lower bound using the Iterated Matrix multiplication polynomial, which is in VP VNP Circuits (VP) ABPs Formulas Open: separation ? …known in the multilinear setting [Dvir, Malod, Perifel, Yehudayoff (2012)]

An explicit family in VP [Kumar-Saraf (2014)] : Showed the same lower bound using the Iterated Matrix multiplication polynomial, which is in VP VNP Circuits (VP) ABPs Formulas Open: separation ? …improve n Ω(√d) to n ω(√d)

Some other open questions 1.Prove a n Ω(√d) lower bound for general depth-3 circuits (i.e. without the low bottom fanin restriction).

Some other open questions 1.Prove a n Ω(√d) lower bound for general depth-3 circuits. 2.Prove a n Ω(√d) lower bound for homogeneous depth-5 circuits. [open problem in Nisan & Wigderson (1996)] (2)  (1)

Some other open questions 1.Prove a n Ω(√d) lower bound for general depth-3 circuits. 2.Prove a n Ω(√d) lower bound for homogeneous depth-5 circuits. 3.Prove a n Ω(d) lower bound for multilinear depth-3 circuits. (current best is 2 Ω(d) ) …interestingly, one can get this using PSP measure

Some other open questions 1.Prove a n Ω(√d) lower bound for general depth-3 circuits. 2.Prove a n Ω(√d) lower bound for homogeneous depth-5 circuits. 3.Prove a n Ω(d) lower bound for multilinear depth-3 circuits. 4.A separation between homogeneous formulas and homogeneous depth-4 formulas.

Some other open questions 1.Prove a n Ω(√d) lower bound for general depth-3 circuits. 2.Prove a n Ω(√d) lower bound for homogeneous depth-5 circuits. 3.Prove a n Ω(d) lower bound for multilinear depth-3 circuits. 4.A separation between homogeneous formulas and homogeneous depth-4 formulas. 5.A separation between homogeneous formulas and multilinear homogeneous formulas. …exhibiting the power of non-multilinearity

Some other open questions 1.Prove a n Ω(√d) lower bound for general depth-3 circuits. 2.Prove a n Ω(√d) lower bound for homogeneous depth-5 circuits. 3.Prove a n Ω(d) lower bound for multilinear depth-3 circuits. 4.A separation between homogeneous formulas and homogeneous depth-4 formulas. 5.A separation between homogeneous formulas and multilinear homogeneous formulas. Thanks!