Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,

Slides:



Advertisements
Similar presentations
Aggregating local image descriptors into compact codes
Advertisements

Evaluating Color Descriptors for Object and Scene Recognition Koen E.A. van de Sande, Student Member, IEEE, Theo Gevers, Member, IEEE, and Cees G.M. Snoek,
Change Detection C. Stauffer and W.E.L. Grimson, “Learning patterns of activity using real time tracking,” IEEE Trans. On PAMI, 22(8): , Aug 2000.
An Adaptive Image Enhancement Algorithm for Face Detection By Lizuo Jin, Shin’ichi Satoh, and Masao Sakauchi. ECE 738 In Young Chung.
Fırat Onur Alsaran, Neşe Alyüz, Melike Esma İlter, Mürsel Taşgın, Ahmet Burak Yoldemir CmpE 58Z Term Project.
Vision Based Control Motion Matt Baker Kevin VanDyke.
Foreground Background detection from video Foreground Background detection from video מאת : אבישג אנגרמן.
Robust Foreground Detection in Video Using Pixel Layers Kedar A. Patwardhan, Guillermoo Sapire, and Vassilios Morellas IEEE TRANSACTION ON PATTERN ANAYLSIS.
Foreground Modeling The Shape of Things that Came Nathan Jacobs Advisor: Robert Pless Computer Science Washington University in St. Louis.
Adviser : Ming-Yuan Shieh Student ID : M Student : Chung-Chieh Lien VIDEO OBJECT SEGMENTATION AND ITS SALIENT MOTION DETECTION USING ADAPTIVE BACKGROUND.
Human-Computer Interaction Human-Computer Interaction Segmentation Hanyang University Jong-Il Park.
Different Tracking Techniques  1.Gaussian Mixture Model:  1.Construct the model of the Background.  2.Given sequence of background images find the.
AlgirdasBeinaravičius Gediminas Mazrimas Salman Mosslem.
A KLT-Based Approach for Occlusion Handling in Human Tracking Chenyuan Zhang, Jiu Xu, Axel Beaugendre and Satoshi Goto 2012 Picture Coding Symposium.
Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA
December 5, 2013Computer Vision Lecture 20: Hidden Markov Models/Depth 1 Stereo Vision Due to the limited resolution of images, increasing the baseline.
Kernel Similarity Modeling of Texture Pattern Flow for Motion Detection in Complex Background 2011 IEEE transection on CSVT Baochang Zhang, Yongsheng Gao,
1 Robust Video Stabilization Based on Particle Filter Tracking of Projected Camera Motion (IEEE 2009) Junlan Yang University of Illinois,Chicago.
International Conference on Image Analysis and Recognition (ICIAR’09). Halifax, Canada, 6-8 July Video Compression and Retrieval of Moving Object.
Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.
Background Estimation with Gaussian Distribution for Image Segmentation, a fast approach Gianluca Bailo, Massimo Bariani, Paivi Ijas, Marco Raggio IEEE.
Robust Object Segmentation Using Adaptive Thresholding Xiaxi Huang and Nikolaos V. Boulgouris International Conference on Image Processing 2007.
Ensemble Tracking Shai Avidan IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE February 2007.
Region-Level Motion- Based Background Modeling and Subtraction Using MRFs Shih-Shinh Huang Li-Chen Fu Pei-Yung Hsiao 2007 IEEE.
Incremental Learning of Temporally-Coherent Gaussian Mixture Models Ognjen Arandjelović, Roberto Cipolla Engineering Department, University of Cambridge.
1 Integration of Background Modeling and Object Tracking Yu-Ting Chen, Chu-Song Chen, Yi-Ping Hung IEEE ICME, 2006.
Effective Gaussian mixture learning for video background subtraction Dar-Shyang Lee, Member, IEEE.
Illumination Normalization with Time-Dependent Intrinsic Images for Video Surveillance Yasuyuki Matsushita, Member, IEEE, Ko Nishino, Member, IEEE, Katsushi.
MULTIPLE MOVING OBJECTS TRACKING FOR VIDEO SURVEILLANCE SYSTEMS.
Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2006 with a lot of slides stolen from Steve Seitz and.
Dorin Comaniciu Visvanathan Ramesh (Imaging & Visualization Dept., Siemens Corp. Res. Inc.) Peter Meer (Rutgers University) Real-Time Tracking of Non-Rigid.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Shadow Detection In Video Submitted by: Hisham Abu saleh.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
1 Activity and Motion Detection in Videos Longin Jan Latecki and Roland Miezianko, Temple University Dragoljub Pokrajac, Delaware State University Dover,
Tricolor Attenuation Model for Shadow Detection. INTRODUCTION Shadows may cause some undesirable problems in many computer vision and image analysis tasks,
Mutual Information-based Stereo Matching Combined with SIFT Descriptor in Log-chromaticity Color Space Yong Seok Heo, Kyoung Mu Lee, and Sang Uk Lee.
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
Mean-shift and its application for object tracking
1 Mean shift and feature selection ECE 738 course project Zhaozheng Yin Spring 2005 Note: Figures and ideas are copyrighted by original authors.
Hierarchical Distributed Genetic Algorithm for Image Segmentation Hanchuan Peng, Fuhui Long*, Zheru Chi, and Wanshi Siu {fhlong, phc,
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
1 Lucia Maddalena and Alfredo Petrosino, Senior Member, IEEE A Self-Organizing Approach to Background Subtraction for Visual Surveillance Applications.
1 Webcam Mouse Using Face and Eye Tracking in Various Illumination Environments Yuan-Pin Lin et al. Proceedings of the 2005 IEEE Y.S. Lee.
Algirdas Beinaravičius Gediminas Mazrimas Salman Mosslem.
Background Subtraction based on Cooccurrence of Image Variations Seki, Wada, Fujiwara & Sumi Presented by: Alon Pakash & Gilad Karni.
Kevin Cherry Robert Firth Manohar Karki. Accurate detection of moving objects within scenes with dynamic background, in scenarios where the camera is.
1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.
Expectation-Maximization (EM) Case Studies
Jiu XU, Axel BEAUGENDRE and Satoshi GOTO Computer Sciences and Convergence Information Technology (ICCIT), th International Conference on 1 Real-time.
Boosted Particle Filter: Multitarget Detection and Tracking Fayin Li.
Student Name: Honghao Chen Supervisor: Dr Jimmy Li Co-Supervisor: Dr Sherry Randhawa.
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
Timo Ahonen, Abdenour Hadid, and Matti Pietikainen
Instructor: Mircea Nicolescu Lecture 5 CS 485 / 685 Computer Vision.
Learning and Removing Cast Shadows through a Multidistribution Approach Nicolas Martel-Brisson, Andre Zaccarin IEEE TRANSACTIONS ON PATTERN ANALYSIS AND.
Shadow Detection in Remotely Sensed Images Based on Self-Adaptive Feature Selection Jiahang Liu, Tao Fang, and Deren Li IEEE TRANSACTIONS ON GEOSCIENCE.
Motion tracking TEAM D, Project 11: Laura Gui - Timisoara Calin Garboni - Timisoara Peter Horvath - Szeged Peter Kovacs - Debrecen.
Video object segmentation and its salient motion detection using adaptive background generation Kim, T.K.; Im, J.H.; Paik, J.K.;  Electronics Letters 
A. M. R. R. Bandara & L. Ranathunga
PRESENTED BY Yang Jiao Timo Ahonen, Matti Pietikainen
Face recognition using improved local texture pattern
Image Segmentation Techniques
Presented by: Yang Yu Spatiotemporal GMM for Background Subtraction with Superpixel Hierarchy Mingliang Chen, Xing Wei, Qingxiong.
CS654: Digital Image Analysis
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
The SIFT (Scale Invariant Feature Transform) Detector and Descriptor
Fourier Transform of Boundaries
Presented by Xu Miao April 20, 2005
DIGITAL IMAGE PROCESSING Elective 3 (5th Sem.)
Presentation transcript:

Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu, Matti Pietikainen, Stan Z. Li

Outline Introduction Scale Invariant Local Ternary Pattern (SILTP) Background modeling Multiscale Block-based SILTP Experimental results Conclusion

Introduction Moving object detection in video sequences is one of the main tasks in many computer vision applications. Its output is as an input to a higher level process, such as object categorization, tracking or action recognition. Background subtraction is an common approach for this task.

Introduction (cont.) The popular idea is to model temporal samples in multi-modal distribution, in either parametric or nonparametric way Parametric : GMM , Nonparametric : KDE GMM is the most popular technique. Each pixel is modeled independently using a mixture of Gaussians and updated by an online approximation. Don’t handle cast shadow and dynamic scenes well A number of existing works handle illumination such as moving shadows in special color spaces, but the learned parameters are not adaptable .

GMM Background Modeling Initial background model The first N frames of the input sequence K-means clustering The history of each pixel, , is modeled by three independent Gaussian distributions, and X is the color value, i.e. . For computational reasons, we assume the red, green, and blue pixel values are independent. 我們先用最先的N張frame來建initial的background model 為了避免這N張frame中 不是純粹只有background的景色 而會有foreground在裡面走動 我們用k-means clustering去找出存在最久的value來當作background 然後對於每個pixel的這段history 我們用三個independent的Gaussian distributions去model他 也就是RGB color value 我們在這裡假設RGB是independent的

GMM Background Modeling The probability of the observing pixel value and : the mean value and the covariance matrix of the Gaussian at time t, where . η: the Gaussian probability density function Update of and : ρ: the update rate and its value is between 0 and 1. 所以現在一個pixel value Xt 的probability可以寫成這樣 因為是RGB三個independent的Gaussian distribution 然後我們的update 是用一個update rate去update他的mean跟variance

Introduction (cont.) Besides color based background modeling, LBP based method models each pixel by a group of LBP histograms Moving shadow could not be handled well Sensitive to noise In this paper, they extend LBP to a scale invariant local ternary pattern (SILTP) operator to handle illumination variations. They propose a Pattern KDE technique to effectively model probability distribution for handling complex dynamic background In addition, they develop a multi-scale fusion scheme to consider the spatial scale information

Scale Invariant Local Ternary Pattern LBP is proved to be a powerful local image descriptor. Not robust to local image noises when neighboring pixels are similar Tan and Triggs proposed a LTP (Local Ternary Pattern) operator for face recognition. It is extended from LBP by simply adding a small offset value for comparison It’s not invariant under scale transform of intensity values

Scale Invariant Local Ternary Pattern The intensity scale invariant property of a local comparison operator is very important The illumination variations, either global or local, often cause sudden changes of gray scale intensities of neighboring pixels, which would approximately be a scale transform Given any pixel location (Xc,Yc) , SILTP encodes it as Ic is the gray intensity value of the center pixel, Ik are that of its N neighborhood pixel, denotes concatenation operator of binary strings, 𝜏 is a scale factor

The advantage of SILTP operator It is computationally efficient By introducing a tolerable range like LTP, the SILTP operator is robust to local image noises within a range. Especially in the shadowed area, the region is darker and contain more noises, in which SILTP is tolerable while local comparison result of LBP would be affected more The scale invariance property makes SILTP robust to illumination changes The illumination is suddenly changed from darker to brighter

Comparison of LBP, LTP and SILTP Black block white block

KDE of Local Patterns Given a gray scale video sequence, the pixel process with the local pattern observations over time 1,2,…,t at a pixel location(X0,Y0) is defined as where F is the texture image sequence. all background patterns within a pixel process, either uni-tary or dynamic, are distributed at just several possible bins

Pattern KDE Traditional numerical value based methods can not be used directly for modeling local patterns into background They develop a pattern KDE technique with a particular local pattern kernel that is suitable for descriptors like LBP,LTP, and SILTP First they define distance function d(p,q) as the number of different bits between two local patterns p and q Then they derive the local pattern kernel as where g is a weighting function that can typically be a Gaussian

Pattern KDE (cont.) The probability density function can be estimated smoothly as Where ci are weighting coefficients For example, the distribution of LTP pixel process with unitary back-ground is: 20(00010100)/432 hits, 21(00010101)/4 hits, 84(01010100)/6 hits, and 148(10010100)/58 hits. If no kernel technique id used, the patterns 21 and 84 might be regarded as foregrounds and the pattern 148 might be considered to be another modal In their pattern, all patterns will be considered in the same distribution

Modeling background with local patterns Given an estimated density function at time t-1 and a new coming background pattern pt, we update the new density function as (𝛼 is a learning rate) For multimodality of local pattern, they estimate K density functions for each pixel process via PKDE method, with 𝜔 𝑘,𝑡 , k= 1,2,…,K being the corresponding weights and normalized to 1 Afterwards, we estimate the probability of a new pattern pt being background as

Multiscale Block-based SILTP(MB-SILTP) Fuse multiscale spatial information to achieve better performance MB-SILTP encodes in a way similar with (1) , where Ic and Ik are replaced with mean values of corresponding blocks, of which the size 𝜔 x 𝜔 indicate the scale. To implement the MB-SILTP based background subtraction efficiently, first the original video frame is downsample by 𝜔 Afterwards, MB-SILTP can be calculated, and the proposed background model can be applied on the reduced space

Multiscale Block-based SILTP(MB-SILTP) Finally, the resulted probabilities are upsampled bilinearly to the original space and generate the foreground / background segmentation result In the way, the speed is much faster with larger scale, whereas the precision is generally lower. The background probabilities at each scale can be fused on the original space Adopt the geometric average of probabilities at each scale as the fusion score

Experimental Results Nine dataset containing complex backgrounds , such as busy human flows, moving cast shadows, etc. The proposed approach was compared with existing state-of-the-art online background subtraction algorithms Mixture of Gaussian (MoG) ACMMM03 Blockwise LBP histogram based (LBP-B) Pixelwise LBP histogram based (LBP-P) PKDE based background subtraction with LTP (PKDEltp) PKDE based background subtraction with SILTP (PKDEsiltp) MB-SILTP with 𝜔 = 3 ( ) MB-SILTP with 𝜔 = 1,2,3 ( )

Experimental Results All tested algorithm were implemented in c++ and ran on standard PC with 2.4GHz CPU, 2G memory For all algorithms, a standard OpenCV postprocessing was used which eliminates small pieces less than 15 pixels.

Experimental Results Quantitative evaluation

Conclusion They proposed an improved local image descriptor called SILTP, and demonstrated its power for background subtraction They have also proposed a multiscale block-based SILTP operator for considering the spatial scale information. A Pattern Kernel Density Estimation technique was proposed and based on it we have developed a multimodal background modeling framework