Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.

Slides:



Advertisements
Similar presentations
Human Detection Phanindra Varma. Detection -- Overview  Human detection in static images is based on the HOG (Histogram of Oriented Gradients) encoding.
Advertisements

Zhimin CaoThe Chinese University of Hong Kong Qi YinITCS, Tsinghua University Xiaoou TangShenzhen Institutes of Advanced Technology Chinese Academy of.
1 Challenge the future HON4D: Histogram of Oriented 4D Normals for Activity Recognition from Depth Sequences Omar Oreifej Zicheng Liu CVPR 2013.
Patch to the Future: Unsupervised Visual Prediction
Computer Vision for Human-Computer InteractionResearch Group, Universität Karlsruhe (TH) cv:hci Dr. Edgar Seemann 1 Computer Vision: Histograms of Oriented.
Lecture 31: Modern object recognition
LPP-HOG: A New Local Image Descriptor for Fast Human Detection Andy Qing Jun Wang and Ru Bo Zhang IEEE International Symposium.
Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs CVPR 2005 Another Descriptor.
Robust Object Tracking via Sparsity-based Collaborative Model
More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.
Local Descriptors for Spatio-Temporal Recognition
IEEE TCSVT 2011 Wonjun Kim Chanho Jung Changick Kim
Bag of Features Approach: recent work, using geometric information.
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
Event prediction CS 590v. Applications Video search Surveillance – Detecting suspicious activities – Illegally parked cars – Abandoned bags Intelligent.
Personal Driving Diary: Constructing a Video Archive of Everyday Driving Events IEEE workshop on Motion and Video Computing ( WMVC) 2011 IEEE Workshop.
A Study of Approaches for Object Recognition
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
Exercise Session 10 – Image Categorization
Learning to classify the visual dynamics of a scene Nicoletta Noceti Università degli Studi di Genova Corso di Dottorato.
Bag of Video-Words Video Representation
Distinctive Image Features from Scale-Invariant Keypoints By David G. Lowe, University of British Columbia Presented by: Tim Havinga, Joël van Neerbos.
Flow Based Action Recognition Papers to discuss: The Representation and Recognition of Action Using Temporal Templates (Bobbick & Davis 2001) Recognizing.
Action recognition with improved trajectories
Object Recognizing. Recognition -- topics Features Classifiers Example ‘winning’ system.
Watch, Listen and Learn Sonal Gupta, Joohyun Kim, Kristen Grauman and Raymond Mooney -Pratiksha Shah.
Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.
Dynamic 3D Scene Analysis from a Moving Vehicle Young Ki Baik (CV Lab.) (Wed)
Svetlana Lazebnik, Cordelia Schmid, Jean Ponce
Spatio-temporal constraints for recognizing 3D objects in videos Nicoletta Noceti Università degli Studi di Genova.
Beauty is Here! Evaluating Aesthetics in Videos Using Multimodal Features and Free Training Data Yanran Wang, Qi Dai, Rui Feng, Yu-Gang Jiang School of.
Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.
Mentor: Salman Khokhar Action Recognition in Crowds Week 7.
Pedestrian Detection and Localization
UCF REU: Weeks 1 & 2. Gradient Code Gradient Direction of the Gradient: Calculating theta.
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.
Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.
Recognizing Action at a Distance Alexei A. Efros, Alexander C. Berg, Greg Mori, Jitendra Malik Computer Science Division, UC Berkeley Presented by Pundik.
December 9, 2014Computer Vision Lecture 23: Motion Analysis 1 Now we will talk about… Motion Analysis.
Levi Smith.  Reading papers  Getting data set together  Clipping videos to form the training and testing data for our classifier  Project separation.
Efficient Visual Object Tracking with Online Nearest Neighbor Classifier Many slides adapt from Steve Gu.
Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.
Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs CVPR 2005 Another Descriptor.
Sean M. Ficht.  Problem Definition  Previous Work  Methods & Theory  Results.
First-Person Activity Recognition: What Are They Doing to Me? M. S. Ryoo and Larry Matthies Jet Propulsion Laboratory, California Institute of Technology,
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
A Tutorial on using SIFT Presented by Jimmy Huff (Slightly modified by Josiah Yoder for Winter )
Week 10 Emily Hand UNR.
WEEK4 RESEARCH Amari Lewis Aidean Sharghi. PREPARING THE DATASET  Cars – 83 samples  3 images for each sample when x=0  7 images for each sample when.
Object Recognizing. Object Classes Individual Recognition.
Line Matching Jonghee Park GIST CV-Lab..  Lines –Fundamental feature in many computer vision fields 3D reconstruction, SLAM, motion estimation –Useful.
Goggle Gist on the Google Phone A Content-based image retrieval system for the Google phone Manu Viswanathan Chin-Kai Chang Ji Hyun Moon.
SUN Database: Large-scale Scene Recognition from Abbey to Zoo Jianxiong Xiao *James Haysy Krista A. Ehinger Aude Oliva Antonio Torralba Massachusetts Institute.
Motion Features for Action Recognition YeHao 3/11/2014.
Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.
Week 3 Emily Hand UNR. Online Multiple Instance Learning The goal of MIL is to classify unseen bags, instances, by using the labeled bags as training.
Week 5 Emily Hand UNR. AdaBoost For our previous detector, we used SVM.  Color Histogram We decided to try AdaBoost  Mean Blocks.
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
Face recognition using Histograms of Oriented Gradients
A. M. R. R. Bandara & L. Ranathunga
Data Driven Attributes for Action Detection
Performance of Computer Vision
A Tutorial on HOG Human Detection
Action Recognition in Temporally Untrimmed Videos
Computer Vision James Hays
Two-Stream Convolutional Networks for Action Recognition in Videos
Multiple Feature Learning for Action Classification
Visualization of computational image feature descriptors.
Paper Reading Dalong Du April.08, 2011.
Presentation transcript:

Activity Recognition Aneeq Zia

Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features for action recognition”, Heng Wang et all “Action Recognition by Dense Trajectories”, Heng Wang et all Summary References

Typical methods used for action recognition

Evaluation of local spatio- temporal features for action recognition

Result

Action Recognition by Dense Trajectories

Dense Trajectories Feature trajectories have shown to be efficient for representing videos Extracted using KLT tracker or matching SIFT descriptors between frames However, the quantity and quality is generally not enough This paper proposes an approach to describe videos by dense trajectories

Dense Trajectories The trajectories are obtained by tracking densely sampled points using optical flow fields A local descriptor is introduced that overcomes the problem of camera motion The descriptor extends the motion coding scheme based motion motion boundaries developed in the context of human detection

Dense Trajectories Feature points are sampled on a grid spaced by W (=5) pixels and tracked in each scale separately 8 spatial scales used Each point in a certain frame is tracked to the next frame using median filtering in a dense optical flow field

Tracking Points of subsequent frames are concatenated to form a trajectory Trajectories are limited to ‘L’ frames in order to avoid drift from their initial location The shape of a trajectory of length ‘L’ is described by the sequence where The resulting vector is normalized by

Trajectory descriptors Histogram of Oriented Gradient (HOG) Histogram of Optical Flow (HOF) HOGHOF Motion Boundary Histogram (MBH) Take local gradients of x-y flow components and compute HOG as in static images

Bag of Features Codebook of descriptors (trajectories, HOG, HOF, MBH) constructed Number of visual words = ,000 randomly selected training features used Each video described by a histogram of visual word occurances Non-linear SVM with Chi-Square kernel used to classify the actions

Results

Summary Action recognition using HMM’s Temporal Template Matching Spatio Temporal Interest Points Bag of Visual Words Technique for action recognition Dense Trajectories

References “ Evaluation of local spatio-temporal features for action recognition”,Heng Wang et all “Action Recognition by Dense Trajectories”, Heng Wang et all CVPR 2011 tutorial on “Human Activity Analysis” CVPR 2014 tutorial on “Emerging topics in Human Activity recognition”