Presentation is loading. Please wait.

Presentation is loading. Please wait.

Unsupervised Learning of Visual Taxonomies IEEE conference on CVPR 2008 Evgeniy Bart – Caltech Ian Porteous – UC Irvine Pietro Perona – Caltech Max Welling.

Similar presentations


Presentation on theme: "Unsupervised Learning of Visual Taxonomies IEEE conference on CVPR 2008 Evgeniy Bart – Caltech Ian Porteous – UC Irvine Pietro Perona – Caltech Max Welling."— Presentation transcript:

1 Unsupervised Learning of Visual Taxonomies IEEE conference on CVPR 2008 Evgeniy Bart – Caltech Ian Porteous – UC Irvine Pietro Perona – Caltech Max Welling – UC Irvine

2 Introduction Recent progress in visual recognition has been dealing with up to 256 categories. The current organization is an unordered ’laundry list’ of names and associated category models The tree structure describes not only the ‘atomic’ categories, but also higher-level and broader categories in a hierarchical fashion Why worry about taxonomies

3 TAX Model Images are represented as bags of visual words Each visual word is a cluster of visually similar image patches, it is a basic unit in the model A topic represents a set of words that co-occur in images. Typically, this corresponds to a coherent visual structure, such as skies or sand A category is represented as a multinomial distribution over all the topics

4 TAX Model Shared information is represented at nodes

5 Inference The goal is to learn the structure of the taxonomy and to estimate the parameters of the model Use Gibbs sampling, which allows drawing samples from the posterior distribution of the model’s parameters given the data Taxonomy structure and other parameters of interest can be estimated from these samples

6 Inference To perform sampling, we calculate the conditional distributions

7 Experiment 1 : Corel Pick 300 color images from the Corel dataset Use ‘space-color histograms’ to define visual words(total 2048 visual words) 500 pixels were sampled from each image and encoded using the space-color histograms

8 Experiment 1 : Corel

9 A B R 12 345678 9

10 A B

11 Experiment 2 : 13 scenes

12

13

14 Evaluation of Experiment2

15 Conclusion Supervised TAX outperforms supervised LDA therefore suggests that a hierarchical organization better fits the natural structure of image patches The main limitation of TAX is the speed of training. For example, with 1300 training images, learning took 24 hours


Download ppt "Unsupervised Learning of Visual Taxonomies IEEE conference on CVPR 2008 Evgeniy Bart – Caltech Ian Porteous – UC Irvine Pietro Perona – Caltech Max Welling."

Similar presentations


Ads by Google