Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Missing variable problems In many vision problems, if some.

Slides:



Advertisements
Similar presentations
Unsupervised Learning
Advertisements

Clustering Beyond K-means
1 12. Principles of Parameter Estimation The purpose of this lecture is to illustrate the usefulness of the various concepts introduced and studied in.
Maximum Likelihood And Expectation Maximization Lecture Notes for CMPUT 466/551 Nilanjan Ray.
Computer Vision - A Modern Approach Set: Model-based Vision Slides by D.A. Forsyth Recognition by Hypothesize and Test General idea –Hypothesize object.
Segmentation and Fitting Using Probabilistic Methods
Human-Computer Interaction Human-Computer Interaction Segmentation Hanyang University Jong-Il Park.
Visual Recognition Tutorial
Lecture 6 Image Segmentation
Slide 1 EE3J2 Data Mining EE3J2 Data Mining Lecture 10 Statistical Modelling Martin Russell.
Motion Detection And Analysis Michael Knowles Tuesday 13 th January 2004.
Computer Vision Fitting Marc Pollefeys COMP 256 Some slides and illustrations from D. Forsyth, T. Darrel, A. Zisserman,...
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
First introduced in 1977 Lots of mathematical derivation Problem : given a set of data (data is incomplete or having missing values). Goal : assume the.
Region Segmentation. Find sets of pixels, such that All pixels in region i satisfy some constraint of similarity.
Image processing. Image operations Operations on an image –Linear filtering –Non-linear filtering –Transformations –Noise removal –Segmentation.
Computer Vision - A Modern Approach
Computer Vision - A Modern Approach Set: Fitting Slides by D.A. Forsyth Fitting Choose a parametric object/some objects to represent a set of tokens Most.
AGC DSP AGC DSP Professor A G Constantinides© Estimation Theory We seek to determine from a set of data, a set of parameters such that their values would.
Problem Sets Problem Set 3 –Distributed Tuesday, 3/18. –Due Thursday, 4/3 Problem Set 4 –Distributed Tuesday, 4/1 –Due Tuesday, 4/15. Probably a total.
Segmentation Divide the image into segments. Each segment:
Announcements Project 2 more signup slots questions Picture taking at end of class.
Expectation Maximization Algorithm
Image Segmentation Some slides: courtesy of O. Capms, Penn State, J.Ponce and D. Fortsyth, Computer Vision Book.
Some slides and illustrations from D. Forsyth, …
Fitting. Choose a parametric object/some objects to represent a set of tokens Most interesting case is when criterion is not local –can’t tell whether.
Tracking with Linear Dynamic Models. Introduction Tracking is the problem of generating an inference about the motion of an object given a sequence of.
Clustering with Bregman Divergences Arindam Banerjee, Srujana Merugu, Inderjit S. Dhillon, Joydeep Ghosh Presented by Rohit Gupta CSci 8980: Machine Learning.
Lecture 10: Robust fitting CS4670: Computer Vision Noah Snavely.
Computer Vision - A Modern Approach Set: Segmentation Slides by D.A. Forsyth Segmentation and Grouping Motivation: not information is evidence Obtain a.
Fitting.
Image Segmentation Image segmentation is the operation of partitioning an image into a collection of connected sets of pixels. 1. into regions, which usually.
Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.
沈致远. Test error(generalization error): the expected prediction error over an independent test sample Training error: the average loss over the training.
Feature and object tracking algorithms for video tracking Student: Oren Shevach Instructor: Arie nakhmani.
EM and expected complete log-likelihood Mixture of Experts
Model Inference and Averaging
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
Computer Vision - A Modern Approach Set: Tracking Slides by D.A. Forsyth The three main issues in tracking.
1 Lesson 8: Basic Monte Carlo integration We begin the 2 nd phase of our course: Study of general mathematics of MC We begin the 2 nd phase of our course:
EM and model selection CS8690 Computer Vision University of Missouri at Columbia Missing variable problems In many vision problems, if some variables.
Template matching and object recognition. CS8690 Computer Vision University of Missouri at Columbia Matching by relations Idea: –find bits, then say object.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: ML and Simple Regression Bias of the ML Estimate Variance of the ML Estimate.
PROBABILITY AND STATISTICS FOR ENGINEERING Hossein Sameti Department of Computer Engineering Sharif University of Technology Principles of Parameter Estimation.
EECS 274 Computer Vision Model Fitting. Fitting Choose a parametric object/some objects to represent a set of points Three main questions: –what object.
1 CSE 552/652 Hidden Markov Models for Speech Recognition Spring, 2006 Oregon Health & Science University OGI School of Science & Engineering John-Paul.
Probability and Statistics in Vision. Probability Objects not all the sameObjects not all the same – Many possible shapes for people, cars, … – Skin has.
Learning Sequence Motifs Using Expectation Maximization (EM) and Gibbs Sampling BMI/CS 776 Mark Craven
Visual Tracking by Cluster Analysis Arthur Pece Department of Computer Science University of Copenhagen
Model fitting ECE 847: Digital Image Processing Stan Birchfield Clemson University.
Mestrado em Ciência de Computadores Mestrado Integrado em Engenharia de Redes e Sistemas Informáticos VC 15/16 – TP10 Advanced Segmentation Miguel Tavares.
Fitting.
Miguel Tavares Coimbra
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
Model fitting ECE 847: Digital Image Processing Stan Birchfield
Fitting.
Classification of unlabeled data:
Motion Detection And Analysis
Learning Sequence Motif Models Using Expectation Maximization (EM)
Latent Variables, Mixture Models and EM
A special case of calibration
Segmentation and Grouping
Hidden Markov Models Part 2: Algorithms
دانشگاه صنعتی امیرکبیر Instructor : Saeed Shiry
Computer Vision - A Modern Approach
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Missing variable problems
Calibration and homographies
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
CS5760: Computer Vision Lecture 9: RANSAC Noah Snavely
Presentation transcript:

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Missing variable problems In many vision problems, if some variables were known the maximum likelihood inference problem would be easy –fitting; if we knew which line each token came from, it would be easy to determine line parameters –segmentation; if we knew the segment each pixel came from, it would be easy to determine the segment parameters –fundamental matrix estimation; if we knew which feature corresponded to which, it would be easy to determine the fundamental matrix –etc. This sort of thing happens in statistics, too

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Missing variable problems Strategy –estimate appropriate values for the missing variables –plug these in, now estimate parameters –re-estimate appropriate values for missing variables, continue eg –guess which line gets which point –now fit the lines –now reallocate points to lines, using our knowledge of the lines –now refit, etc. We’ve seen this line of thought before (k means)

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Missing variables - strategy We have a problem with parameters, missing variables This suggests: Iterate until convergence –replace missing variable with expected values, given fixed values of parameters –fix missing variables, choose parameters to maximise likelihood given fixed values of missing variables e.g., iterate till convergence –allocate each point to a line with a weight, which is the probability of the point given the line –refit lines to the weighted set of points Converges to local extremum Somewhat more general form is available

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Lines and robustness We have one line, and n points Some come from the line, some from “noise” This is a mixture model: We wish to determine –line parameters –p(comes from line)

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Estimating the mixture model Introduce a set of hidden variables, , one for each point. They are one when the point is on the line, and zero when off. If these are known, the negative log-likelihood becomes (the line’s parameters are  c): Here K is a normalising constant, kn is the noise intensity (we’ll choose this later).

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Substituting for delta We shall substitute the expected value of , for a given  recall  =( , c, ) E(  _i)=1. P(  _i=1|  ) Notice that if kn is small and positive, then if distance is small, this value is close to 1 and if it is large, close to zero

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Algorithm for line fitting Obtain some start point Now compute  ’s using formula above Now compute maximum likelihood estimate of – , c come from fitting to weighted points – comes by counting Iterate to convergence

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth The expected values of the deltas at the maximum (notice the one value close to zero).

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Closeup of the fit

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Choosing parameters What about the noise parameter, and the sigma for the line? –several methods from first principles knowledge of the problem (seldom really possible) play around with a few examples and choose (usually quite effective, as precise choice doesn’t matter much) –notice that if kn is large, this says that points very seldom come from noise, however far from the line they lie usually biases the fit, by pushing outliers into the line rule of thumb; its better to fit to the better fitting points, within reason; if this is hard to do, then the model could be a problem

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Other examples Segmentation –a segment is a gaussian that emits feature vectors (which could contain colour; or colour and position; or colour, texture and position). –segment parameters are mean and (perhaps) covariance –if we knew which segment each point belonged to, estimating these parameters would be easy –rest is on same lines as fitting line Fitting multiple lines –rather like fitting one line, except there are more hidden variables –easiest is to encode as an array of hidden variables, which represent a table with a one where the i’th point comes from the j’th line, zeros otherwise –rest is on same lines as above

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Issues with EM Local maxima –can be a serious nuisance in some problems –no guarantee that we have reached the “right” maximum Starting –k means to cluster the points is often a good idea

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Local maximum

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth which is an excellent fit to some points

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth and the deltas for this maximum

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth A dataset that is well fitted by four lines

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Result of EM fitting, with one line (or at least, one available local maximum).

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Result of EM fitting, with two lines (or at least, one available local maximum).

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Seven lines can produce a rather logical answer

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Figure from “Color and Texture Based Image Segmentation Using EM and Its Application to Content Based Image Retrieval”,S.J. Belongie et al., Proc. Int. Conf. Computer Vision, 1998, c1998, IEEE Segmentation with EM

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Motion segmentation with EM Model image pair (or video sequence) as consisting of regions of parametric motion –affine motion is popular Now we need to –determine which pixels belong to which region –estimate parameters Likelihood –assume Straightforward missing variable problem, rest is calculation

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Three frames from the MPEG “flower garden” sequence Figure from “Representing Images with layers,”, by J. Wang and E.H. Adelson, IEEE Transactions on Image Processing, 1994, c 1994, IEEE

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Grey level shows region no. with highest probability Segments and motion fields associated with them Figure from “Representing Images with layers,”, by J. Wang and E.H. Adelson, IEEE Transactions on Image Processing, 1994, c 1994, IEEE

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth If we use multiple frames to estimate the appearance of a segment, we can fill in occlusions; so we can re-render the sequence with some segments removed. Figure from “Representing Images with layers,”, by J. Wang and E.H. Adelson, IEEE Transactions on Image Processing, 1994, c 1994, IEEE

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Some generalities Many, but not all problems that can be attacked with EM can also be attacked with RANSAC –need to be able to get a parameter estimate with a manageably small number of random choices. –RANSAC is usually better Didn’t present in the most general form –in the general form, the likelihood may not be a linear function of the missing variables –in this case, one takes an expectation of the likelihood, rather than substituting expected values of missing variables –Issue doesn’t seem to arise in vision applications.

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Model Selection We wish to choose a model to fit to data –e.g. is it a line or a circle? –e.g is this a perspective or orthographic camera? –e.g. is there an aeroplane there or is it noise? Issue –In general, models with more parameters will fit a dataset better, but are poorer at prediction –This means we can’t simply look at the negative log- likelihood (or fitting error)

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Top is not necessarily a better fit than bottom (actually, almost always worse)

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth We can discount the fitting error with some term in the number of parameters in the model.

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Discounts AIC (an information criterion) –choose model with smallest value of –p is the number of parameters BIC (Bayes information criterion) –choose model with smallest value of –N is the number of data points Minimum description length –same criterion as BIC, but derived in a completely different way

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Cross-validation Split data set into two pieces, fit to one, and compute negative log-likelihood on the other Average over multiple different splits Choose the model with the smallest value of this average The difference in averages for two different models is an estimate of the difference in KL divergence of the models from the source of the data

Computer Vision - A Modern Approach Set: Probability in segmentation Slides by D.A. Forsyth Model averaging Very often, it is smarter to use multiple models for prediction than just one e.g. motion capture data –there are a small number of schemes that are used to put markers on the body –given we know the scheme S and the measurements D, we can estimate the configuration of the body X We want If it is obvious what the scheme is from the data, then averaging makes little difference If it isn’t, then not averaging underestimates the variance of X --- we think we have a more precise estimate than we do.