Semantic Texton Forests for Image Categorization and Segmentation We would like to thank Amnon Drory for this deck הבהרה : החומר המחייב הוא החומר הנלמד.

Slides:



Advertisements
Similar presentations
DECISION TREES. Decision trees  One possible representation for hypotheses.
Advertisements

Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.
Real-Time Human Pose Recognition in Parts from Single Depth Images
Lecture 07 Segmentation Lecture 07 Segmentation Mata kuliah: T Computer Vision Tahun: 2010.
Real-Time Human Pose Recognition in Parts from Single Depth Images Presented by: Mohammad A. Gowayyed.
Models for Scene Understanding – Global Energy models and a Style-Parameterized boosting algorithm (StyP-Boost) Jonathan Warrell, 1 Simon Prince, 2 Philip.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Kinect Case Study CSE P 576 Larry Zitnick
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton*, J. Winn†, C. Rother†, and A.
Ensemble Learning: An Introduction
Three kinds of learning
Redaction: redaction: PANAKOS ANDREAS. An Interactive Tool for Color Segmentation. An Interactive Tool for Color Segmentation. What is color segmentation?
© Prentice Hall1 DATA MINING Introductory and Advanced Topics Part II Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist.
Using Crowdsourcing & Big Data to Understand Agriculture in Sub-Saharan Africa Dee Luo.
12/01/11 How the Kinect Works Computational Photography Derek Hoiem, University of Illinois Photo frame-grabbed from:
Bayesian Decision Theory Making Decisions Under uncertainty 1.
Computational Photography lecture 19 – How the Kinect 1 works? CS 590 Spring 2014 Prof. Alex Berg (Credits to many other folks on individual slides)
Human tracking and counting using the KINECT range sensor based on Adaboost and Kalman Filter ISVC 2013.
A true story of trees, forests & papers
CSE 185 Introduction to Computer Vision Pattern Recognition.
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 35 – Review for midterm.
Analysis: TextonBoost and Semantic Texton Forests
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Presented by Tienwei Tsai July, 2005
Miguel Reyes 1,2, Gabriel Dominguez 2, Sergio Escalera 1,2 Computer Vision Center (CVC) 1, University of Barcelona (UB) 2
Lecture 06 06/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
Object Detection Using the Statistics of Parts Presented by Nicholas Chan – Advanced Perception Robust Real-time Object Detection Henry Schneiderman.
Chapter 9 – Classification and Regression Trees
Image Classification 영상분류
Texture We would like to thank Amnon Drory for this deck הבהרה : החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
Texture We would like to thank Amnon Drory for this deck הבהרה : החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
Automated Target Recognition Using Mathematical Morphology Prof. Robert Haralick Ilknur Icke José Hanchi Computer Science Dept. The Graduate Center of.
Combining multiple learners Usman Roshan. Bagging Randomly sample training data Determine classifier C i on sampled data Goto step 1 and repeat m times.
Human pose recognition from depth image MS Research Cambridge.
MACHINE LEARNING 10 Decision Trees. Motivation  Parametric Estimation  Assume model for class probability or regression  Estimate parameters from all.
Associative Hierarchical CRFs for Object Class Image Segmentation
Lecture 09 03/01/2012 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
Classification (slides adapted from Rob Schapire) Eran Segal Weizmann Institute.
Lecture 08 27/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
Random Forests Ujjwol Subedi. Introduction What is Random Tree? ◦ Is a tree constructed randomly from a set of possible trees having K random features.
11/05/15 How the Kinect Works Computational Photography Derek Hoiem, University of Illinois Photo frame-grabbed from:
A New Method for Crater Detection Heather Dunlop November 2, 2006.
Semi-supervised Mesh Segmentation and Labeling
Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.
TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton ; University of Cambridge J. Jinn,
Using decision trees to build an a framework for multivariate time- series classification 1 Present By Xiayi Kuang.
Data Mining By Farzana Forhad CS 157B. Agenda Decision Tree and ID3 Rough Set Theory Clustering.
M ACHINE L EARNING : F OUNDATIONS C OURSE TAU – 2012A P ROF. Y ISHAY M ANSOUR o TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class.
Lecture 05 29/11/2011 Shai Avidan Roy Josef Jevnisek הבהרה : החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
DISCRIMINATIVELY TRAINED DENSE SURFACE NORMAL ESTIMATION ANDREW SHARP.
Cell Segmentation in Microscopy Imagery Using a Bag of Local Bayesian Classifiers Zhaozheng Yin RI/CMU, Fall 2009.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 3 Basic Data Mining Techniques Jason C. H. Chen, Ph.D. Professor of MIS School of Business.
CS 548 Spring 2016 Model and Regression Trees Showcase by Yanran Ma, Thanaporn Patikorn, Boya Zhou Showcasing work by Gabriele Fanelli, Juergen Gall, and.
Color Image Segmentation Mentor : Dr. Rajeev Srivastava Students: Achit Kumar Ojha Aseem Kumar Akshay Tyagi.
Presenter: Jae Sung Park
Leaves Recognition By Zakir Mohammed Indiana State University Computer Science.
Machine Learning: Ensemble Methods
Course : T Computer Vision
2. Skin - color filtering.
Nonparametric Semantic Segmentation
Real-Time Human Pose Recognition in Parts from Single Depth Image
Multiple Instance Learning: applications to computer vision
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
Ensemble learning.
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
Grouping/Segmentation
MY BODY DICTIONARY.
Using simple machine learning for image segmentation
Presentation transcript:

Semantic Texton Forests for Image Categorization and Segmentation We would like to thank Amnon Drory for this deck הבהרה : החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.

Semantic Texton Forests Input: Input: 1. Training: Images with pixel level ground truth classification MSRC 21 Database

Semantic Texton Forests Input: Input: 1. Training: Images with pixel level ground truth classification. 2. Testing: Images Output: Output: A classification of each pixel in the test images to an object class.

Main Mathematical Tools 1. Conditional Random Fields (CRF) 2. Textons ( Convolution with Filter Banks ) 3. K-Means 4. Joint Boost Randomized Decision Forests

Decision Trees

Decision Rules (Split Functions) 1.Choose one or two pixels near pixel of interest 2.Calculate simple function of their values: -Difference -Sum -Absolute Difference -Or just the pixel valuere To a hreshold Motivation: very small number of computer operations. Easy to implement on GPU.

Training the tree For each node: 1.Randomly generate a few decision rules 2.Choose the one that maximally improves the ability of the tree to separate between classes. (E(I) – entropy) Stop when tree reached pre-defined depth, or when no further improvement in classification can be achieved.

Decision Forests For added strength, create several trees instead of just one. Each tree is trained using a different subset of the training data.

Classifying at test time For each pixel in the test image: Apply the segmentation forest – marking a path in each tree (yellow). Each leaf is associated with a histogram of classes. Average the histograms from all tree, achieving a vector of probabilities for this pixel belonging to each class:

Classifying at test time The probability vectors derived from the Decision Forests can be used to classify pixels to classes, by assigning to each pixel the label that is most likely. The results are very noisy

Paths on trees represent Texture The texture of an area around a pixel can be represented by a vector comprised of all the nodes in the decision forest that belong to paths traversed when applying the forest to this pixel. In the above example, this would be the vector: ( 1, 3, 6, 10, 17, …, 84, 85, 87, 91 ) This vector is called a Semantic Texton.

Example of Semantic Textons A visualization of leaf nodes from one tree (distance d = 21 pixels). Each patch is the average of all patches in the training images assigned to a particular leaf node. Features evident include color, horizontal, vertical and diagonal edges, blobs, ridges and corners.

Example of Semantic Textons

Second Randomized Decision Forest The split functions at the nodes are based on the Texture-Layout filter: Two types of Texture-Layout filters are used: 1.Count the occurrence of a certain node in the semantic textons of pixels in a rectangle. (, )

Second Randomized Decision Forest The split functions at the nodes are based on the Texture-Layout filter: Two types of Texture-Layout filters are used: 1.Count the occurrence of a certain node N in the semantic textons of pixels in a rectangle. 2.Sum the probabilities of belonging to a certain class K for all pixels in a rectangle. This is semantic context. (, )

STF - Results Overall Accuracy: 72% Though less aesthetic, these results are quantitatively almost as good as those of TextonBoost.

STF - Summary To speed up calculation, this algorithm uses Radomized Decision Forests instead of other mathemaical tools used in TextonBoost. One RDF is used to calculate texture. A second RDF is used to classifiy pixels to object classes. The results are almost as good (quantitatively) as those of TextonBoost. The algorithm is much quicker than TextonBoost.

Real-Time Human Pose Recognition in Parts from Single Depth Images J. Shotton, A. Fitzgibbon, M. Cook, T. Sharp, M. Finocchio, R. Moore, A. Kipman, A. Blake

Overview

Classification Classify each pixel to one of 31 body part classes: – Left Upper / Right Upper / Left Lower / Right Lower head – neck – Left / Right shoulder – Left Upper / Right Upper / Left Lower / Right Lower arm – Left /Right elbow – Left /Right wrist – Left /Right hand – Left Upper / Right Upper / Left Lower / Right Lower torso – Left Upper / Right Upper / Left Lower / Right Lower leg – Left /Right knee – Left /Right ankle – Left /Right foot

Randomized Decision Forests The Randomized Decision Forests are very deep (depth = 20). => A very strong ability to classify correctly on the Training Set. => A risk of over fitting. This risk is averted by using a very (very) large training set, containing examples of many poses we wish to recognize. Most of this training set is artificially generated.

Decision Rules

Classification → body part locations Separately for each of 31 Body Part Classes: – Create Probability map – Project to 3D space – Find modes ( using mean-shift ) – The modes (with a little post processing ) are the suggested body part locations (the output of this algorithm).