Convergent Message-Passing Algorithms for Inference over General Graphs with Convex Free Energies Tamir Hazan, Amnon Shashua School of Computer Science.

Slides:



Advertisements
Similar presentations
Mean-Field Theory and Its Applications In Computer Vision1 1.
Advertisements

Algorithms for MAP estimation in Markov Random Fields Vladimir Kolmogorov University College London Tutorial at GDR (Optimisation Discrète, Graph Cuts.
Bayesian Belief Propagation
Discrete Optimization Lecture 4 – Part 3 M. Pawan Kumar Slides available online
Constrained Approximate Maximum Entropy Learning (CAMEL) Varun Ganapathi, David Vickrey, John Duchi, Daphne Koller Stanford University TexPoint fonts used.
Join-graph based cost-shifting Alexander Ihler, Natalia Flerova, Rina Dechter and Lars Otten University of California Irvine Introduction Mini-Bucket Elimination.
Section 3: Appendix BP as an Optimization Algorithm 1.
ICCV 2007 tutorial Part III Message-passing algorithms for energy minimization Vladimir Kolmogorov University College London.
Loopy Belief Propagation a summary. What is inference? Given: –Observabled variables Y –Hidden variables X –Some model of P(X,Y) We want to make some.
Introduction to Belief Propagation and its Generalizations. Max Welling Donald Bren School of Information and Computer and Science University of California.
Probabilistic Graph and Hypergraph Matching
Belief Propagation by Jakob Metzler. Outline Motivation Pearl’s BP Algorithm Turbo Codes Generalized Belief Propagation Free Energies.
Markov Networks.
Belief Propagation on Markov Random Fields Aggeliki Tsoli.
CS774. Markov Random Field : Theory and Application Lecture 04 Kyomin Jung KAIST Sep
Distributed Message Passing for Large Scale Graphical Models Alexander Schwing Tamir Hazan Marc Pollefeys Raquel Urtasun CVPR2011.
1 Graphical Models in Data Assimilation Problems Alexander Ihler UC Irvine Collaborators: Sergey Kirshner Andrew Robertson Padhraic Smyth.
Graph Algorithms: Minimum Spanning Tree We are given a weighted, undirected graph G = (V, E), with weight function w:
Convergent and Correct Message Passing Algorithms Nicholas Ruozzi and Sekhar Tatikonda Yale University TexPoint fonts used in EMF. Read the TexPoint manual.
Recent Development on Elimination Ordering Group 1.
Global Approximate Inference Eran Segal Weizmann Institute.
Genome evolution: a sequence-centric approach Lecture 6: Belief propagation.
A Trainable Graph Combination Scheme for Belief Propagation Kai Ju Liu New York University.
Genome Evolution. Amos Tanay 2009 Genome evolution: Lecture 8: Belief propagation.
Some Surprises in the Theory of Generalized Belief Propagation Jonathan Yedidia Mitsubishi Electric Research Labs (MERL) Collaborators: Bill Freeman (MIT)
CS774. Markov Random Field : Theory and Application Lecture 08 Kyomin Jung KAIST Sep
Adaptive CSMA under the SINR Model: Fast convergence using the Bethe Approximation Krishna Jagannathan IIT Madras (Joint work with) Peruru Subrahmanya.
1 Structured Region Graphs: Morphing EP into GBP Max Welling Tom Minka Yee Whye Teh.
Belief Propagation. What is Belief Propagation (BP)? BP is a specific instance of a general class of methods that exist for approximate inference in Bayes.
Probabilistic Graphical Models
Physics Fluctuomatics / Applied Stochastic Process (Tohoku University) 1 Physical Fluctuomatics Applied Stochastic Process 9th Belief propagation Kazuyuki.
Fast Parallel and Adaptive Updates for Dual-Decomposition Solvers Ozgur Sumer, U. Chicago Umut Acar, MPI-SWS Alexander Ihler, UC Irvine Ramgopal Mettu,
Algorithms for MAP estimation in Markov Random Fields Vladimir Kolmogorov University College London.
Survey Propagation. Outline Survey Propagation: an algorithm for satisfiability 1 – Warning Propagation – Belief Propagation – Survey Propagation Survey.
Tokyo Institute of Technology, Japan Yu Nishiyama and Sumio Watanabe Theoretical Analysis of Accuracy of Gaussian Belief Propagation.
Readings: K&F: 11.3, 11.5 Yedidia et al. paper from the class website
Probabilistic Inference Lecture 5 M. Pawan Kumar Slides available online
Physics Fluctuomatics (Tohoku University) 1 Physical Fluctuomatics 7th~10th Belief propagation Kazuyuki Tanaka Graduate School of Information Sciences,
Update any set S of nodes simultaneously with step-size We show fixed point update is monotone for · 1/|S| Covering Trees and Lower-bounds on Quadratic.
An Introduction to Variational Methods for Graphical Models
Approximate Inference: Decomposition Methods with Applications to Computer Vision Kyomin Jung ( KAIST ) Joint work with Pushmeet Kohli (Microsoft Research)
1 Mean Field and Variational Methods finishing off Graphical Models – Carlos Guestrin Carnegie Mellon University November 5 th, 2008 Readings: K&F:
Belief Propagation and its Generalizations Shane Oldenburger.
DISTIN: Distributed Inference and Optimization in WSNs A Message-Passing Perspective SCOM Team
Inference for Learning Belief Propagation. So far... Exact methods for submodular energies Approximations for non-submodular energies Move-making ( N_Variables.
Join-graph based cost-shifting Alexander Ihler, Natalia Flerova, Rina Dechter and Lars Otten University of California Irvine Introduction Mini-Bucket Elimination.
Eric Xing © Eric CMU, Machine Learning Algorithms and Theory of Approximate Inference Eric Xing Lecture 15, August 15, 2010 Reading:
@ 15/7/2003 Tokyo Institute of Technology 1 Propagating beliefs in spin- glass models Yoshiyuki Kabashima Dept. of Compt. Intel. & Syst.
Mean field approximation for CRF inference
Tightening LP Relaxations for MAP using Message-Passing David Sontag Joint work with Talya Meltzer, Amir Globerson, Tommi Jaakkola, and Yair Weiss.
Markov Networks: Theory and Applications Ying Wu Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208
ベーテ自由エネルギーに対するCCCPアルゴリズムの拡張
Free Energy Estimates of All-atom Protein Structures Using Generalized Belief Propagation Kamisetty H., Xing, E.P. and Langmead C.J. Raluca Gordan February.
Introduction of BP & TRW-S
Exact Inference Continued
An Investigation into Concurrent Expectation Propagation
Markov Networks.
Graduate School of Information Sciences Tohoku University, Japan
Generalized Belief Propagation
Exact Inference ..
Bucket Renormalization for Approximate Inference
≠ Particle-based Variational Inference for Continuous Systems
Physical Fluctuomatics 7th~10th Belief propagation
Algorithms and Theory of
Exact Inference Continued
Expectation-Maximization & Belief Propagation
Unifying Variational and GBP Learning Parameters of MNs EM for BNs
Readings: K&F: 11.3, 11.5 Yedidia et al. paper from the class website
Markov Networks.
Generalized Belief Propagation
Presentation transcript:

Convergent Message-Passing Algorithms for Inference over General Graphs with Convex Free Energies Tamir Hazan, Amnon Shashua School of Computer Science and Engineering Hebrew University of Jerusalem

2 Input: Graphical Models - Background Output:

3 Input: Output: Graphical Models - Background

4 Input: Output: Belief Propagation: Graphical Models - Background

5 BP - Variational Methods Bethe free energy: Approximating the marginals

6 BP - Variational Methods Stationary Bethe free energy = BP fixed points (Yedidia, Freeman, Weiss ’01) When the factor graph has no cycles Bethe is convex over the marginalization constraints and BP is exact. Bethe free energy: Approximating the marginals Factor graph has cycles: Bethe is non-convex and BP might not converge.

7 BP - Variational Methods TRW free energy (Wainwright, Jaakkola, Willsky ’02): Weighted number of spanning trees through an edge α TRW free energy is convex over the marginalization constraints. Convergent message passing algorithm for cliques of size 2 (Globerson & Jaakkola, ’07)

8 “convex free energy” is convex (over the marginalization constraints) for all factor graphs if there exists such that and Claim: (Pakzad and Anantharam ‘02, Heskes ’04, Weiss et al. ’07) Convex Free Energies

9 “convex free energy” is convex (over the marginalization constraints) for all factor graphs if there exists such that and Claim: (Pakzad and Anantharam ‘02, Heskes ’04, Weiss et al. ’06) Convex Free Energies

10 Background - Summary Bethe free energy TRW free energyconvex free energy non-convex for general graphs specific convexification of free energy general form of convexification with parameters Contributions: 1) Convergent message passing algorithm for convex free energy 2) Heuristic for choosing good convex free energy. Belief PropagationGloberson Jaakkola ’07 message passing algorithm for |α|=2 ? message passing algorithm for |α|≥2

11 strictly convex & differentiable Strictly convex & proper ( for some b) Convex Belief Propagation

12 strictly convex & differentiable Strictly convex & proper ( for some b) marginals of x i agree Convex Belief Propagation

13 strictly convex & differentiable Strictly convex & proper ( for some b) marginals of x i agree Whenever marginals of x i agree Otherwise

14 Output: For t=1,2,... For i=1,2,...,n Convex Message Passing * We also have a similar parallel message passing algorithm

15 Output: For t=1,2,... For i=1,2,...,n Convex Message Passing * We also have a similar parallel message passing algorithm

16 Primer on Fenchel Duality convex conjugate of h(b) concave conjugate of g(b) PrimalDual

17 Primer on Fenchel Duality More Conveniently:

18 Sequential message-passing Fenchel duality theorem

19 Generalized Fenchel duality theorem Sequential message-passing Fenchel duality theorem

20 Generalized Fenchel duality theorem Sequential message-passing Fenchel duality theorem Block update approach: Optimize Set

21 Generalized Fenchel duality theorem Sequential message-passing Fenchel duality theorem Block update approach: Optimize Set

22 Generalized Fenchel duality theorem Sequential message-passing Fenchel duality theorem Block update approach: Optimize Set

23 Generalized Fenchel duality theorem Sequential message-passing Fenchel duality theorem Algorithm: Repeat until convergence Output: Block update approach: Optimize Set

24 Sequential message-passing Bregman’s successive projection algorithm (Hildreth, Dykstra, Csiszar, Tseng) Output: For t=1,2,... For i=1,2,...,n

25 Convex Belief Propagation marginals of x i agree Output: For t=1,2,... For i=1,2,...,n

26 Convex Belief Propagation Our algorithm: Closed form solution for Belief propagation:

27 Convex Belief Propagation Our algorithm: Closed form solution for Belief propagation:

28 3) For general and (non-convex!) our algorithm 1) when f(),h i () are non-convex if the algorithm converges then it is to a stationary point. 2) Set (Bethe free energy, non-convex!) our message passing algorithm = belief propagation. Convex Belief Propagation Interesting Notes: Open question =? “Fractional BP” (Wiegerinck, Heskes ‘03)

29 Determine Convex Energy Heuristic: Find convex energy while is close to 1 as possible Motivation: - Bethe approximation

30 Experiments – Ising Model grid Every variable has two values variables field interaction attractive mixed

31 Experiments – Efficiency Grid size Seconds

32 Approximating Marginals – Ising model convex free energies Vs. Bethe free energy Marginal Error Interaction strength

33 convex free energies Vs. Bethe free energy Marginal Error Interaction strength Approximating Marginals – Ising model

34 Approximating Marginals – Random Graph 10 vertices. Edge density 30% (almost tree), 50% (far from tree) TRW free energy Vs. L2-convex free energy

35 Novel perspective of message passing as primal-dual optimization for the general class of Convex free energy mapped to Heuristic for setting convex free energy by “convexifying” Bethe free energy Algorithm applies to general clique sizes; BP is a particular case of our algorithm Region Graphs energies (Kikuchi) Future work: Theoretical foundation on our heuristic for setting the convex free energy. Summary