Autonomous Learning of Object Models on Mobile Robots Xiang Li Ph.D. student supervised by Dr. Mohan Sridharan Stochastic Estimation and Autonomous Robotics.

Slides:



Advertisements
Similar presentations
Complex Networks for Representation and Characterization of Images For CS790g Project Bingdong Li 9/23/2009.
Advertisements

Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.
Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.
Loris Bazzani*, Marco Cristani*†, Alessandro Perina*, Michela Farenzena*, Vittorio Murino*† *Computer Science Department, University of Verona, Italy †Istituto.
Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.
Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Introduction To Tracking
Patch to the Future: Unsupervised Visual Prediction
電腦視覺 Computer and Robot Vision I Chapter2: Binary Machine Vision: Thresholding and Segmentation Instructor: Shih-Shinh Huang 1.
Computer and Robot Vision I
December 5, 2013Computer Vision Lecture 20: Hidden Markov Models/Depth 1 Stereo Vision Due to the limited resolution of images, increasing the baseline.
Local Descriptors for Spatio-Temporal Recognition
Learning to Detect A Salient Object Reporter: 鄭綱 (3/2)
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
3-D Depth Reconstruction from a Single Still Image 何開暘
Expectation Maximization Method Effective Image Retrieval Based on Hidden Concept Discovery in Image Database By Sanket Korgaonkar Masters Computer Science.
Face Recognition Based on 3D Shape Estimation
Visual Querying By Color Perceptive Regions Alberto del Bimbo, M. Mugnaini, P. Pala, and F. Turco University of Florence, Italy Pattern Recognition, 1998.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
A Probabilistic Framework For Segmentation And Tracking Of Multiple Non Rigid Objects For Video Surveillance Aleksandar Ivanovic, Tomas S. Huang ICIP 2004.
Object Class Recognition using Images of Abstract Regions Yi Li, Jeff A. Bilmes, and Linda G. Shapiro Department of Computer Science and Engineering Department.
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2006 with a lot of slides stolen from Steve Seitz and.
IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 11, NOVEMBER 2011 Qian Zhang, King Ngi Ngan Department of Electronic Engineering, the Chinese university.
Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.
A M P C CC C Automatic Contextual Pattern Modeling Pengyu Hong Beckman Institute for Advanced Science and Technology University of Illinois at Urbana Champaign.
Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to.
Information Retrieval in Practice
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
Decision-Making on Robots Using POMDPs and Answer Set Programming Introduction Robots are an integral part of many sectors such as medicine, disaster rescue.
Mutual Information-based Stereo Matching Combined with SIFT Descriptor in Log-chromaticity Color Space Yong Seok Heo, Kyoung Mu Lee, and Sang Uk Lee.
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
A General Framework for Tracking Multiple People from a Moving Camera
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
December 4, 2014Computer Vision Lecture 22: Depth 1 Stereo Vision Comparing the similar triangles PMC l and p l LC l, we get: Similarly, for PNC r and.
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.
Stereo Object Detection and Tracking Using Clustering and Bayesian Filtering Texas Tech University 2011 NSF Research Experiences for Undergraduates Site.
Object Detection with Discriminatively Trained Part Based Models
Markov Random Fields Probabilistic Models for Images
Kevin Cherry Robert Firth Manohar Karki. Accurate detection of moving objects within scenes with dynamic background, in scenarios where the camera is.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,
Chapter 5 Multi-Cue 3D Model- Based Object Tracking Geoffrey Taylor Lindsay Kleeman Intelligent Robotics Research Centre (IRRC) Department of Electrical.
Towards Total Scene Understanding: Classification, Annotation and Segmentation in an Automatic Framework N 工科所 錢雅馨 2011/01/16 Li-Jia Li, Richard.
Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.
Semantic Alignment Spring 2009 Ben-Gurion University of the Negev.
Photoconsistency constraint C2 q C1 p l = 2 l = 3 Depth labels If this 3D point is visible in both cameras, pixels p and q should have similar intensities.
1 Review and Summary We have covered a LOT of material, spending more time and more detail on 2D image segmentation and analysis, but hopefully giving.
Image segmentation.
Object-based Classification
IMAGE PROCESSING RECOGNITION AND CLASSIFICATION
CSSE463: Image Recognition Day 21
Unsupervised Riemannian Clustering of Probability Density Functions
Nonparametric Semantic Segmentation
Saliency detection Donghun Yeo CV Lab..
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Content-Based Image Retrieval
Content-Based Image Retrieval
CSSE463: Image Recognition Day 23
Presented by: Yang Yu Spatiotemporal GMM for Background Subtraction with Superpixel Hierarchy Mingliang Chen, Xing Wei, Qingxiong.
Learning Layered Motion Segmentations of Video
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
Brief Review of Recognition + Context
Matching Words with Pictures
Liyuan Li, Jerry Kah Eng Hoe, Xinguo Yu, Li Dong, and Xinqi Chu
“Traditional” image segmentation
Presentation transcript:

Autonomous Learning of Object Models on Mobile Robots Xiang Li Ph.D. student supervised by Dr. Mohan Sridharan Stochastic Estimation and Autonomous Robotics Lab (SEARL) Department of Computer Science Texas Tech University October 23,

Object Recognition 2 Learn Object Model color, texture or shape Test match Challenges: How to identify ROI in the image (Region Of Interest) ? What features to extract in ROI (Object model) ? Efficient Implementation. ROI

Related Work Object model O. Bjorn, PAMI12; R. Fergus, CVPR03; P. Felzenszwalb, IJCV04; N.E.K. Roman, AR10. Mobile robot C. Guo, ICRA11; M. Sridharan, IROS07; D. Parikh, PAMI12 ; J. Hoey, CVIU10. Local image feature D. Lowe, IJCV05; J. Matas, BMVC02; S. Park, IROS09; M. Calonder, ECCV10. 3

Motivation 4 Goal Learn and recognize objects autonomously in dynamic environments. Our Work  Identify ROIs autonomously based on a limited number of images with moving objects.  Build probabilistic object models using the complementary properties of different visual cues.  Fuse the information by generative model and energy minimization algorithm.

Autonomously Learning Supervised Learning  Images with the labeled regions Unsupervised Learning  Images without any labeled regions Track and cluster local image gradient features [128D vector]  A short sequence of images (motion cue) 5 ROI tt+1 ROI

Object Model Given ROI (autonomous or manual) Use the complementary properties of different visual cues 6

Motivation of SCV and Undirected Graph The difference between correct and incorrect match  The spatial arrangements of local features  The connection between the local features 7 Match by Nearest Neighbor algorithm(shortest Euclidean distance). correct Incorrect

SCV from gradient features The individual gradient features may not be unique. The spatial arrangement of local gradient features corresponding to a specific object is difficult to duplicate. 8

Connection Potentials Connection potential is computed as the color distribution of pixels between gradient features in the image ROI. 9 Build an undirected graph of connection potentials to model the neighborhood relationships between gradient features.

Parts from image segments Considers the arrangement of object parts made up of image segments. Pixels within a part have similar values, while pixels in neighboring parts have dissimilar values. 10

Color-based Representation Computes the distance between every pair of pdfs and models the distribution of distances as a Gaussian. 11 Second order color distribution statisticsColor histogram (pdf) …

Local context from image segments Probabilistic mixture models Relative positions (on, under, beside) 12

Information Fusion 13 Generative model  considers the relationship between the components Energy minimization algorithm  Identifies the ROI for recognizing the stationary objects

Robot Platforms: Erratic 1.6GHz Core2 Duo CPU 2 cameras (monocular & stereo) Laser range finder 640 × 480 Resolution Wi-Fi On board computation 14

Experimental Trial 15 Test Image Match Probabilities Net Match

Experimental Trial (Cont) 16 Test Image Match Probabilities Net Match

Object Categories 17

The Classification Accuracy 18

Conclusions Mobile robot can identify interesting objects based on motion cues, and autonomously and efficiently learn object models that exploit the complementary properties of appearance- based and contextual visual cues. Exploiting the complementary properties of these visual cues enables the robot to use generative model and energy minimization algorithms to reliably and efficiently recognize the learned. 19

Future Work Consider image sequences with multiple moving objects. Add Shape representation into object model. Extend to a team of robots collaborating in dynamic environments. 20

Q & A 21

Convex Hull The convex hull of a set of points is the smallest convex set that contains the points. Quickhull algorithm 1 computes the convex hull. 1. Barber, C.B., Dobkin, D.P., and Huhdanpaa, H.T., “The Quickhull algorithm for convex hulls”, ACM Trans. on Mathematical Software, 22(4): , Dec 1996,

Gamma distribution Context Probability Context Probability Using Gamma MatchedNot Matched context0.28± ±0.03 Other components MatchedNot Matched context0.5± ±0.10 Other components0.70.3

Experiment Example Learned robot model (Corridor) Testing Corridor(1) Prob = 0.89 Corridor(2) Prob = 0.77 Office Prob = 0.12

Generative Model Randomly generating observable data Typically given some hidden parameters Specifies a joint probability distribution over observation and label sequences. 25

Generative model (Cont) 26