Why almost all k-colorable graphs are easy A. Coja-Oghlan, M. Krivelevich, D. Vilenchik.

Slides:



Advertisements
Similar presentations
Lower Bounds for Additive Spanners, Emulators, and More David P. Woodruff MIT and Tsinghua University To appear in FOCS, 2006.
Advertisements

Inapproximability of Hypergraph Vertex-Cover. A k-uniform hypergraph H= : V – a set of vertices E - a collection of k-element subsets of V Example: k=3.
On the Density of a Graph and its Blowup Raphael Yuster Joint work with Asaf Shapira.
Approximation, Chance and Networks Lecture Notes BISS 2005, Bertinoro March Alessandro Panconesi University La Sapienza of Rome.
Gibbs sampler - simple properties It’s not hard to show that this MC chain is aperiodic. Often is reversible distribution. If in addition the chain is.
Heuristics for the Hidden Clique Problem Robert Krauthgamer (IBM Almaden) Joint work with Uri Feige (Weizmann)
Lecture 22: April 18 Probabilistic Method. Why Randomness? Probabilistic method: Proving the existence of an object satisfying certain properties without.
Generating Hard Satisfiability Problems1 Bart Selman, David Mitchell, Hector J. Levesque Presented by Xiaoxin Yin.
The Theory of NP-Completeness
Approximating the Domatic Number Feige, Halldorsson, Kortsarz, Srinivasan ACM Symp. on Theory of Computing, pages , 2000.
Fast FAST By Noga Alon, Daniel Lokshtanov And Saket Saurabh Presentation by Gil Einziger.
1 List Coloring and Euclidean Ramsey Theory TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A Noga Alon, Tel Aviv.
Message Passing on Planted Models: What do we know? Why do we care? Elchanan Mossel Joint works with: 1. Uri Feige and Danny Vilenchik 2. Amin Coja-Oghlan.
Phase Transitions of PP-Complete Satisfiability Problems D. Bailey, V. Dalmau, Ph.G. Kolaitis Computer Science Department UC Santa Cruz.
Testing of ‘massively parametrized problems’ - Ilan Newman Haifa University Based on joint work with: Sourav Chakraborty, Eldar Fischer, Shirley Halevi,
Beating Brute Force Search for Formula SAT and QBF SAT Rahul Santhanam University of Edinburgh.
Coloring the edges of a random graph without a monochromatic giant component Reto Spöhel (joint with Angelika Steger and Henning Thomas) TexPoint fonts.
NP-Complete Problems Reading Material: Chapter 10 Sections 1, 2, 3, and 4 only.
Testing of Clustering Noga Alon, Seannie Dar Michal Parnas, Dana Ron.
NP-Complete Problems Problems in Computer Science are classified into
Testing Metric Properties Michal Parnas and Dana Ron.
It’s all about the support: a new perspective on the satisfiability problem Danny Vilenchik.
Message Passing for the Coloring Problem: Gallager Meets Alon and Kahale Sonny Ben-Shimon and Dan Vilenchik Tel Aviv University AofA June, 2007 TexPoint.
Analysis of Algorithms CS 477/677
Technion 1 Generating minimum transitivity constraints in P-time for deciding Equality Logic Ofer Strichman and Mirron Rozanov Technion, Haifa, Israel.
Why almost all k-colorable graphs are easy ? A. Coja-Oghlan, M. Krivelevich, D. Vilenchik.
1 On the Benefits of Adaptivity in Property Testing of Dense Graphs Joint work with Mira Gonen Dana Ron Tel-Aviv University.
Computability and Complexity 24-1 Computability and Complexity Andrei Bulatov Approximation.
Job Scheduling Lecture 19: March 19. Job Scheduling: Unrelated Multiple Machines There are n jobs, each job has: a processing time p(i,j) (the time to.
Coloring Algorithms and Networks. Coloring2 Graph coloring Vertex coloring: –Function f: V  C, such that for all {v,w}  E: f(v)  f(w) Chromatic number.
1 Separator Theorems for Planar Graphs Presented by Shira Zucker.
Finding a maximum independent set in a sparse random graph Uriel Feige and Eran Ofek.
1 Joint work with Shmuel Safra. 2 Motivation 3 Motivation.
The Theory of NP-Completeness 1. Nondeterministic algorithms A nondeterminstic algorithm consists of phase 1: guessing phase 2: checking If the checking.
Fixed Parameter Complexity Algorithms and Networks.
1 The Theory of NP-Completeness 2012/11/6 P: the class of problems which can be solved by a deterministic polynomial algorithm. NP : the class of decision.
The Effect of Induced Subgraphs on Quasi-randomness Asaf Shapira & Raphael Yuster.
1 Introduction to Approximation Algorithms. 2 NP-completeness Do your best then.
Approximating the Minimum Degree Spanning Tree to within One from the Optimal Degree R 陳建霖 R 宋彥朋 B 楊鈞羽 R 郭慶徵 R
Online Vertex-Coloring Games in Random Graphs Reto Spöhel (joint work with Martin Marciniszyn; appeared at SODA ’07)
Edge-disjoint induced subgraphs with given minimum degree Raphael Yuster 2012.
Satisfiable k -CNF formulas above the threshold Danny Vilenchik.
1 The Theory of NP-Completeness 2 Cook ’ s Theorem (1971) Prof. Cook Toronto U. Receiving Turing Award (1982) Discussing difficult problems: worst case.
Testing the independence number of hypergraphs
Graph Colouring L09: Oct 10. This Lecture Graph coloring is another important problem in graph theory. It also has many applications, including the famous.
1/19 Minimizing weighted completion time with precedence constraints Nikhil Bansal (IBM) Subhash Khot (NYU)
Approximate Inference: Decomposition Methods with Applications to Computer Vision Kyomin Jung ( KAIST ) Joint work with Pushmeet Kohli (Microsoft Research)
CS 3343: Analysis of Algorithms Lecture 25: P and NP Some slides courtesy of Carola Wenk.
Artur Czumaj DIMAP DIMAP (Centre for Discrete Maths and it Applications) Computer Science & Department of Computer Science University of Warwick Testing.
NP Completeness Piyush Kumar. Today Reductions Proving Lower Bounds revisited Decision and Optimization Problems SAT and 3-SAT P Vs NP Dealing with NP-Complete.
CSC 413/513: Intro to Algorithms
Why almost all satisfiable k - CNF formulas are easy? Danny Vilenchik Joint work with A. Coja-Oghlan and M. Krivelevich.
1 Distributed Vertex Coloring. 2 Vertex Coloring: each vertex is assigned a color.
The Theory of NP-Completeness 1. Nondeterministic algorithms A nondeterminstic algorithm consists of phase 1: guessing phase 2: checking If the checking.
1 The Theory of NP-Completeness 2 Review: Finding lower bound by problem transformation Problem X reduces to problem Y (X  Y ) iff X can be solved by.
The NP class. NP-completeness Lecture2. The NP-class The NP class is a class that contains all the problems that can be decided by a Non-Deterministic.
CHAPTER SIX T HE P ROBABILISTIC M ETHOD M1 Zhang Cong 2011/Nov/28.
Theory of Computational Complexity Probability and Computing Chapter Hikaru Inada Iwama and Ito lab M1.
ICS 353: Design and Analysis of Algorithms NP-Complete Problems King Fahd University of Petroleum & Minerals Information & Computer Science Department.
The NP class. NP-completeness
NP-Completeness (2) NP-Completeness Graphs 7/23/ :02 PM x x x x
From dense to sparse and back again: On testing graph properties (and some properties of Oded)
Path Coupling And Approximate Counting
ICS 353: Design and Analysis of Algorithms
On the effect of randomness on planted 3-coloring models
M. Krivelevich, D. Vilenchik SODA 2006
The Theory of NP-Completeness
Locality In Distributed Graph Algorithms
Integer and fractional packing of graph families
Contagious sets in random graphs
Presentation transcript:

Why almost all k-colorable graphs are easy A. Coja-Oghlan, M. Krivelevich, D. Vilenchik

Talk Outline Random graphs: phase transitions and clustering How do typical k -colorable graphs look? Message passing and clustering (SAT) Proof techniques – two approaches

The k -Coloring Problem Given a graph G =( V, E ): Find f : V ! [ k ] s.t. 8 ( u, v ) 2 E ( G ): f ( u )  f ( v ) Find f with minimal possible k Such k is called the chromatic number of G,  ( G ) E.g.  ( G )=

The k -Coloring Problem Finding a proper k -coloring is NP Hard No polynomial time algorithm approximates  ( G ) within factor better than n 1 -  (unless NP µ ZPP) [FK98] How to proceed? random models and average case analysis G n, p - every possible edge is included w.p. p = p ( n )  ( G n, p )= np / 2ln ( np ) for np 2 [ c 0, n / log 7 n ] [Bol88,Luc91]

Phase transitions and clustering Consider the variant G n, m of G n, p : Choose uniformly at random m = m ( n ) edges When, G n, m and G n, p are “close” There exists a constant d = d ( k ) such that 2m / n > d : almost all graphs in G n, m are not k -colorable 2m / n < d : almost all graphs are k -colorable [Fri99] Such phenomena is called a phase transition

Phase transitions and clustering G n, m with 2m / n just below the threshold is “hard” experimentally Possible explanation (partially non-rigorous) comes from statistical physics [MPWZ02] The “geometrical” structure of the space of proper k -colorings - the clustering phenomena Need to define notion of distance

Phase transitions and clustering Two k -colorings are the same if they differ only by a permutation of the color classes Two k -colorings ,  are at distance t if they disagree on the color of at least t vertices in every permutation of the color classes. There exists one permutation obtaining equality Similar to Hamming distance

Phase transitions and clustering G n, m with 2m / n just below the threshold: All colorings within a cluster are “close” A linear number of vertices are “frozen” Every two clusters are “far” from each other Exponentially many clusters based on analysis that uses partially- rigorous tools Proved rigorously for k -SAT, k ¸ 8 [AR06,MMZ05] For k -SAT: not believed to be true for small k, say k = 3 [MMW05]

Phase transitions and clustering Why does this structure make life hard? Heuristics get “distracted” by this structure Every cluster “pulls” in its direction Heuristics try to find a compromise between clusters This is impossible due to the structure Survey Propagation does well in practice [BMWZ05]

Random k-colorable graphs G n, m with 2m / n above the threshold – not suitable to study k -colorable graphs Instead, consider G n, m | { k -colorability } The uniform distribution over k -colorable graphs with exactly m edges Another possibility, the planted model G n, m, k Partition the vertex set into k color classes of size n / k Include m random edges that respect the coloring V1V1 V3V3 V2V2

Our Results Characterization of G n, m | { k –colorability } 2m / n = C k, C k a sufficiently large constant Using rigorous analysis we show that typically: Single cluster of proper k -colorings Size of the cluster is exponential in n (1-exp{-  ( C k )})n vertices are “frozen”

Our Results There exists a deterministic polynomial time algorithm that k -colors almost all k -colorable graphs with m > C k n edges. C k a sufficiently large constant. Rigorously complement results for sparse case: When clustering is simple – the problem is easy When clustering is “complicated” – the problem is harder (?) Almost all k -colorable graphs are easy !

Our Results Show that G n, m, k and G n, m | { k –colorability } share many structural properties (“close”) Justifying the somewhat unnatural usage of planted- solution models Alon-Kahale’s coloring algorithm [AK97] works for G n, m | { k –colorability } as well G n, m, k also has the same clustering structure

Our Results Our results also apply to the k -SAT setting Similar threshold and clustering phenomena are known/believed for k -SAT The planted and uniform SAT distributions are “close” Flaxman’s algorithm for planted 3CNF formulas works for the uniform setting Improving the exponential time algorithm for uniform satisfiable 3CNFs (only one known so far) Answering open research questions in [BBG02]

What was known so far? UniformPlanted Dist. Density Clustering phenomena Survey Propagation < d k Clustering Planted and Uniform “close” Alon and Kahale’s coloring algorithm [AK97] Alon and Kahale’s coloring algorithm [AK97] CkCk Alon-Kahale’s coloring algorithm [AK97] Planted and Uniform coincide [AK97] C k log n

What was known for SAT? UniformPlanted Dist. Density Clustering phenomena Survey Propagation [BMZ05] > d k Exponential time algorithm [Chen03] Planted and Uniform “close” Flaxman, k-opt Clustering Flaxman’s algorithm [Fla03] Version of k-opt [FV04] CkCk Majority vote works whp [BBG02] Planted and Uniform coincide Majority vote C k log n

Algorithmic Perspective Show that Alon and Kahale’s algorithm [AK97] works in the uniform case What is Alon and Kahale’s algorithm? Approximate a proper 3-coloring (spectral techniques) Refine the coloring – recoloring step Uncolor “suspicious” vertices G [ U ] – graph induced by uncolored vertices Exhaustively color G [ U ] according to G [ V \ U ] Outcome differs from planted on n /1000 vertices Outcome agrees on the core Core remains colored Every colored vertex agrees with planted Logarithmic size connected components

Algorithmic Perspective - SAT Show that Flaxman’s algorithm [Fla03] works in the uniform case What is Flaxman’s algorithm? Approximate a satisfying assignment (majority vote) Unassign “suspicious” variables G [ U ] – graph induced by unassigned variables Exhaustively satisfy G [ U ] according to G [ V \ U ]

SAT and Message Passing Warning Propagation: Given a 3CNF F – define Factor Graph G ( F ) Bipartite graph: V 1 = variables, V 2 = clauses ( x, C ) 2 E ( G ) iff x appears in C Two types of messages: C =( x Ç y Ç z ) C  x = 1 if y  C < 0 and z  C < 0; 0 otherwise x  C = (  x 2 C ’, C ’  C C ’  x ) – (  ¬ x 2 C ’’  C ’’  x )

SAT and Message Passing WP( F ) Initialize all messages C  x to 1/0 w.p. 0.5 Repeat until no message changes: Randomly order the edges of G ( F ) Evaluate all messages C  x Assign every x according to (  x 2 C ’ C ’  x ) – (  ¬ x 2 C ’’ C ’’  x ) Theorem [FMV06]: If F sampled according to Planted 3SAT p = d / n 2, d sufficiently large constant, then whp: WP converges after O( logn ) iterations Assigned variables agree with some satisfying assignment All but exp{-  ( d )} n variables are assigned Clauses of unassigned variables are “easy” to satisfy

SAT and Message Passing Our work implies – [FMV06] applies for the uniform SAT setting as well Reinforces the following thesis: When clustering is complicated ) formulas are hard ) sophisticated algorithms needed: Survey Propagation When clustering is simple ) formulas are easy ) naïve algorithms work: Warning Propagation

Clustering: Proof Techniques Recall, G n, m | { k -colorability } The uniform distribution over k -colorable graphs with exactly m edges Why more difficult than the planted distribution? Edges are not independent For starters, consider the planted distribution G n, p, k ( k = 3 ) V1V1 V3V3 V2V2

Proof Techniques – The Core Every vertex is expected to have d / 3 neighbors in every other color class ( d = np ) Claim 1: whp there is no subgraph H of G s.t. | V ( H )| d | H |/ 10 Claim 2: whp there are no two proper 3-colorings at distance greater than n / 100 d ¸ d 0, d 0 a sufficiently large constant

Proof Techniques – The Core Claim 3: Suppose that every vertex has the expected degree, and Claims 1 and 2 hold. Then the graph G is uniquely 3-colorable. Proof:  - the planted coloring. If not unique, 9 , dist( ,  )< n / 100 (Claim 1). U - set of disagreeing vertices.  v )  ( v ) ) v has d / 3 neighbors in U. |U| d | U |/ 6 – Contradicting Claim 2. V1V1 V3V3 V2V2

Proof Techniques – The Core This is whp the case when np > C k logn When np = O ( 1 ) – whp not the case Definition of Core H : v 2 H if v has at least np / 4 neighbors in G [ H ] in every other color class v has at most np / 10 neighbors outside of H. Claim 4: 9 Core H s.t. whp | H | ¸ (1-exp{-  ( np )}) n H is uniquely 3-colorable

Proof Techniques – The Core Corollary: (1-exp{-  ( np )}) n vertices are frozen in every proper 3-coloring Only one cluster of exponential size V1V1 V3V3 V2V2 V1V1 V2V2 V3V3

Moving to the Uniform Case A – a “bad” graph property (e.g. the graph has no big core)  – the expected number of proper k -colorings of random graph in the planted distribution Claim 5: Pr uniform [ A ] ·  ¢ Pr planted [ A ] Intuition: typically there are at most  ways to generate G in the planted model. Now use a union bound.

Moving to the Uniform Case A – “the graph has no big core” Claim 6: Pr planted [ A ] · e -exp{- C 1 } n Claim 7:  · e exp{- C 2 } n, C 2 > C 1 Corollary: Pr uniform [ A ] = o ( 1 ) There exists no proper 3-coloring w.r.t which there exists a big core

Algorithmic Perspective - Analysis Typically, uniform graphs have a big core Two more facts needed for the analysis: Claim 1 in the uniform case Logarithmic size components in G [ V \ H ] Both properties hold w.p. 1-1 / poly ( n ) in the planted model - cannot use “union bound” Solution: analyze directly the uniform distribution Difficulty: edges are strongly dependent Solution: careful, non-trivial, counting argument

Proof Techniques – a sampler Proof Technique - using the union bound: Fix a set U of t vertices, t · n /100 Fix a “bad” graph on the vertices in U Upper bound the probability of such graph Sum over all possible choices for U Claim 1: whp there is no subgraph H of G s.t. | V ( H )| C | H |/ 10 G n, m | { k –colorability }, m=Cn, C some large constant.

Proof Techniques – a sampler There are ways to choose the set U There are ways to fix the “bad” graph Pr[ the bad graph is a subgraph of G ] = If G was in G n, p then = p dt / 10 Recall G n, m | { k –colorability } In the uniform distribution, what is p ?

Proof Techniques – a sampler Let p be s.t. We shall Prove: Pr uniform [ fixed subgraph with r edges ] · ( 6p ) r Assume we did, then the probability of the claim not holding is at most The expected number of edges had the graph been sampled according to the planted distribution p=3m/n2p=3m/n2

Proof Techniques – a sampler Our goal: prove that Pr uniform [ fixed subgraph with r edges ] · ( 6p ) r Fix “bad” subgrpah B V 1 = G i contains B as subgraph V 2 =all k -colorable graphs with m edges G1G2...G1G2... H1H2...H1H2...

Proof Techniques – a sampler  1  – average degree of left side  2 – average degree of right side Double counting: | V 1 |  1 = | V 2 |  2 | V 1 | · | V 2 |(  2    1 ) We show a bipartite graph s.t. (  2    1 )=( 6p ) r ( G i, H j ) 2 E if G i can be obtained from H j using the following procedure: G1G2...G1G2... H1H2...H1H2...

Proof Techniques – a sampler Remove edges of B from G i, and replace them arbitrarily s.t. at least one k -coloring is respected Denote Observe that: Therefore,

Further Research Loose Rigorously analyze Survey Propagation on near- threshold formulas/graphs First step – analyze Survey Propagation on Planted instances Prove the near-threshold clustering phenomena Rigorously analyze message passing algorithms Analyze instances with an arbitrary constant (above the threshold) density