Frontiers in Applications of Machine Learning Chris Bishop Microsoft Research

Slides:



Advertisements
Similar presentations
Part 2: Unsupervised Learning
Advertisements

Mean-Field Theory and Its Applications In Computer Vision1 1.
CS188: Computational Models of Human Behavior
Scaling Up Graphical Model Inference
Gated Graphs and Causal Inference
Linear Time Methods for Propagating Beliefs Min Convolution, Distance Transforms and Box Sums Daniel Huttenlocher Computer Science Department December,
Exact Inference. Inference Basic task for inference: – Compute a posterior distribution for some query variables given some observed evidence – Sum out.
Probabilistic models Jouni Tuomisto THL. Outline Deterministic models with probabilistic parameters Hierarchical Bayesian models Bayesian belief nets.
Exact Inference in Bayes Nets
Introduction to Belief Propagation and its Generalizations. Max Welling Donald Bren School of Information and Computer and Science University of California.
An introduction to Bayesian networks Stochastic Processes Course Hossein Amirkhani Spring 2011.
EE462 MLCV Lecture Introduction of Graphical Models Markov Random Fields Segmentation Tae-Kyun Kim 1.
Graphical models: approximate inference and learning CA6b, lecture 5.
Markov Networks.
Introduction to Sampling based inference and MCMC Ata Kaban School of Computer Science The University of Birmingham.
GS 540 week 6. HMM basics Given a sequence, and state parameters: – Each possible path through the states has a certain probability of emitting the sequence.
1 Graphical Models in Data Assimilation Problems Alexander Ihler UC Irvine Collaborators: Sergey Kirshner Andrew Robertson Padhraic Smyth.
Extending Expectation Propagation for Graphical Models Yuan (Alan) Qi Joint work with Tom Minka.
Belief Propagation, Junction Trees, and Factor Graphs
Understanding Belief Propagation and its Applications Dan Yuan June 2004.
Third Generation Machine Intelligence Christopher M. Bishop Microsoft Research, Cambridge Microsoft Research Summer School 2009.
Why Machine Learning and Games? Machine Learning in Video Games Drivatars™ Reinforcement Learning Machine Learning in Online.
Computer vision: models, learning and inference Chapter 10 Graphical Models.
Approximate Inference 2: Monte Carlo Markov Chain
24 November, 2011National Tsin Hua University, Taiwan1 Mathematical Structures of Belief Propagation Algorithms in Probabilistic Information Processing.
Computer vision: models, learning and inference
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
Ch 8. Graphical Models Pattern Recognition and Machine Learning, C. M. Bishop, Summarized by B.-H. Kim Biointelligence Laboratory, Seoul National.
12/07/2008UAI 2008 Cumulative Distribution Networks and the Derivative-Sum-Product Algorithm Jim C. Huang and Brendan J. Frey Probabilistic and Statistical.
1 Learning CRFs with Hierarchical Features: An Application to Go Scott Sanner Thore Graepel Ralf Herbrich Tom Minka TexPoint fonts used in EMF. Read the.
UIUC CS 498: Section EA Lecture #21 Reasoning in Artificial Intelligence Professor: Eyal Amir Fall Semester 2011 (Some slides from Kevin Murphy (UBC))
Tokyo Institute of Technology, Japan Yu Nishiyama and Sumio Watanabe Theoretical Analysis of Accuracy of Gaussian Belief Propagation.
14 October, 2010LRI Seminar 2010 (Univ. Paris-Sud)1 Statistical performance analysis by loopy belief propagation in probabilistic image processing Kazuyuki.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Reestimation Equations Continuous Distributions.
David Stern, Thore Graepel, Ralf Herbrich Online Services and Advertising Group MSR Cambridge.
Daphne Koller Message Passing Belief Propagation Algorithm Probabilistic Graphical Models Inference.
CS Statistical Machine learning Lecture 24
Foundations of Machine Learning and Data Rainer Marrone, Ralf Möller.
Probabilistic Machine Learning in Computational Advertising Thore Graepel, Thomas Borchert, Ralf Herbrich and Joaquin Quiñonero Candela Online Services.
1 Mean Field and Variational Methods finishing off Graphical Models – Carlos Guestrin Carnegie Mellon University November 5 th, 2008 Readings: K&F:
Lecture 2: Statistical learning primer for biologists
Probabilistic models Jouni Tuomisto THL. Outline Deterministic models with probabilistic parameters Hierarchical Bayesian models Bayesian belief nets.
Wei Sun and KC Chang George Mason University March 2008 Convergence Study of Message Passing In Arbitrary Continuous Bayesian.
Exact Inference in Bayes Nets. Notation U: set of nodes in a graph X i : random variable associated with node i π i : parents of node i Joint probability:
Christopher M. Bishop, Pattern Recognition and Machine Learning 1.
Pattern Recognition and Machine Learning
04/21/2005 CS673 1 Being Bayesian About Network Structure A Bayesian Approach to Structure Discovery in Bayesian Networks Nir Friedman and Daphne Koller.
CS Statistical Machine learning Lecture 25 Yuan (Alan) Qi Purdue CS Nov
Machine Learning – Lecture 18
Daphne Koller Overview Conditional Probability Queries Probabilistic Graphical Models Inference.
Perceptual and Sensory Augmented Computing Machine Learning, Summer’09 Machine Learning – Lecture 13 Exact Inference & Belief Propagation Bastian.
1 Relational Factor Graphs Lin Liao Joint work with Dieter Fox.
Markov Networks: Theory and Applications Ying Wu Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208
Applications of Machine Learning to the Game of Go David Stern Applied Games Group Microsoft Research Cambridge (working with Thore Graepel, Ralf Herbrich,
30 November, 2005 CIMCA2005, Vienna 1 Statistical Learning Procedure in Loopy Belief Propagation for Probabilistic Image Processing Kazuyuki Tanaka Graduate.
10 October, 2007 University of Glasgow 1 EM Algorithm with Markov Chain Monte Carlo Method for Bayesian Image Analysis Kazuyuki Tanaka Graduate School.
Introduction to Sampling based inference and MCMC
Matchbox Large Scale Online Bayesian Recommendations
Extending Expectation Propagation for Graphical Models
Sublinear Computational Time Modeling in Statistical Machine Learning Theory for Markov Random Fields Kazuyuki Tanaka GSIS, Tohoku University, Sendai,
Prof. Adriana Kovashka University of Pittsburgh April 4, 2017
Bayesian Ranking using Expectation Propagation and Factor Graphs
Markov Networks.
CSCI 5822 Probabilistic Models of Human and Machine Learning
Expectation-Maximization & Belief Propagation
Lecture 3: Exact Inference in GMs
Properties of BP Algorithm
Markov Networks.
Mean Field and Variational Methods Loopy Belief Propagation
Graduate School of Information Sciences, Tohoku University
Presentation transcript:

Frontiers in Applications of Machine Learning Chris Bishop Microsoft Research

A New Framework for ML Bayesian formulation Probabilistic graphical models Deterministic approximate inference algorithms (based on local message-passing)

Directed Graphs General factorization:

Undirected Graphs Clique Maximal Clique

Factor Graphs

The Sum-Product Algorithm vwx f1(v,w)f1(v,w)f2(w,x)f2(w,x) y f3(x,y)f3(x,y) z f4(x,z)f4(x,z)

Messages: From Factors To Variables wx f2(w,x)f2(w,x) y f3(x,y)f3(x,y) z f4(x,z)f4(x,z)

Messages: From Variables To Factors x f2(w,x)f2(w,x) y f3(x,y)f3(x,y) z f4(x,z)f4(x,z)

Approximate Inference Algorithms True distributionMonte Carlo VB / Loopy BP / EP Local message passing on the graph

Illustration: Bayesian Ranking Ralf Herbrich Tom Minka Thore Graepel

Two Player Match Outcome Model y 12 p1p1 p1p1 p2p2 p2p2 s1s1 s1s1 s2s2 s2s2

“Ordering with Draws” Likelihood Player 1 wins Players 1 and 2 draw Player2 wins s 1 - s 2 Probability density Performance of player 1 Performance of player 2 Player 1 wins Player 2 wins Players 1 and 2 draw d 1 =

Two Team Match Outcome Model Skill of a team is the sum of the skills of its members y 12 t1t1 t1t1 t2t2 t2t2 s2s2 s2s2 s3s3 s3s3 s1s1 s1s1 s4s4 s4s4

Multiple Team Match Outcome Model s1s1 s1s1 s2s2 s2s2 s3s3 s3s3 s4s4 s4s4 t1t1 t1t1 y 12 t2t2 t2t2 t3t3 t3t3 y 23

Efficient Approximate Inference s1s1 s1s1 s2s2 s2s2 s3s3 s3s3 s4s4 s4s4 t1t1 t1t1 y 12 t2t2 t2t2 t3t3 t3t3 y 23 Gaussian Prior Factors Ranking Likelihood Factors

Convergence Level Number of Games char (Elo) SQLWildman (Elo) char (TrueSkill ™ ) SQLWildman (TrueSkill ™ )

Skill Beliefs and Skill Dynamics Dynamics via a Markov chain on skills y 12 p1p1 p1p1 p2p2 p2p2 p1’p1’ p1’p1’ p2’p2’ p2’p2’ s1s1 s1s1 s2s2 s2s2 s1’s1’ s1’s1’ s2’s2’ s2’s2’ ’

Bayesian Ranking: TrueSkill TM Xbox 360 Live: launched September 2005 – every 360 game uses TrueSkill TM to match players – 7.1 million active users, 2.5 million matches per day First “planet-scale” application of Bayesian methods

Further Reading A New Framework for Machine Learning, C. M. Bishop (2008) Invited paper at the 2008 World Congress on Computational Intelligence. Lecture Notes in Computer Science LNCS 5050, 1–24. Springer.