Weakly Supervised Action Recognition

Slides:

Advertisements

Similar presentations

Latent SVMs for Human Detection with a Locally Affine Deformation Field Ľubor Ladický 1 Phil Torr 2 Andrew Zisserman 1 1 University of Oxford 2 Oxford.

Advertisements

Indoor Segmentation and Support Inference from RGBD Images Nathan Silberman, Derek Hoiem, Pushmeet Kohli, Rob Fergus.

CVPR2013 Poster Modeling Actions through State Changes.

A Discriminative Key Pose Sequence Model for Recognizing Human Interactions Arash Vahdat, Bo Gao, Mani Ranjbar, and Greg Mori ICCV2011.

Recognizing Human Actions by Attributes CVPR2011 Jingen Liu, Benjamin Kuipers, Silvio Savarese Dept. of Electrical Engineering and Computer Science University.

+ Multi-label Classification using Adaptive Neighborhoods Tanwistha Saha, Huzefa Rangwala and Carlotta Domeniconi Department of Computer Science George.

A generic model to compose vision modules for holistic scene understanding Adarsh Kowdle *, Congcong Li *, Ashutosh Saxena, and Tsuhan Chen Cornell University,

Ľubor Ladický1 Phil Torr2 Andrew Zisserman1

Human Action Recognition across Datasets by Foreground-weighted Histogram Decomposition Waqas Sultani, Imran Saleemi CVPR 2014.

Maximum Margin Markov Network Ben Taskar, Carlos Guestrin Daphne Koller 2004.

Structured SVM Chen-Tse Tsai and Siddharth Gupta.

Structured Hough Voting for Vision-based Highway Border Detection

INTRODUCTION Heesoo Myeong, Ju Yong Chang, and Kyoung Mu Lee Department of EECS, ASRI, Seoul National University, Seoul, Korea Learning.

Structural Human Action Recognition from Still Images Moin Nabi Computer Vision Lab. ©IPM - Oct

Intelligent Systems Lab. Recognizing Human actions from Still Images with Latent Poses Authors: Weilong Yang, Yang Wang, and Greg Mori Simon Fraser University,

Object-centric spatial pooling for image classification Olga Russakovsky, Yuanqing Lin, Kai Yu, Li Fei-Fei ECCV 2012.

GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.

Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots Chao-Yeh Chen and Kristen Grauman University of Texas at Austin.

Detecting Pedestrians by Learning Shapelet Features

Retrieving Actions in Group Contexts Tian Lan, Yang Wang, Greg Mori, Stephen Robinovitch Simon Fraser University Sept. 11, 2010.

Beyond Actions: Discriminative Models for Contextual Group Activities Tian Lan School of Computing Science Simon Fraser University August 12, 2010 M.Sc.

Robust Higher Order Potentials For Enforcing Label Consistency

CVR05 University of California Berkeley 1 Familiar Configuration Enables Figure/Ground Assignment in Natural Scenes Xiaofeng Ren, Charless Fowlkes, Jitendra.

Learning to Segment from Diverse Data M. Pawan Kumar Daphne KollerHaithem TurkiDan Preston.

Multi-view Exploratory Learning for AKBC Problems Bhavana Dalvi and William W. Cohen School Of Computer Science, Carnegie Mellon University Motivation.

Hierarchical Subquery Evaluation for Active Learning on a Graph Oisin Mac Aodha, Neill Campbell, Jan Kautz, Gabriel Brostow CVPR 2014 University College.

Latent Boosting for Action Recognition Zhi Feng Huang et al. BMVC Jeany Son.

Bag of Video-Words Video Representation

Unsupervised Learning of Categories from Sets of Partially Matching Image Features Kristen Grauman and Trevor Darrel CVPR 2006 Presented By Sovan Biswas.

Multiple Instance Real Boosting with Aggregation Functions Hossein Hajimirsadeghi and Greg Mori School of Computing Science Simon Fraser University International.

Week 9 Presented by Christina Peterson. Recognition Accuracies on UCF Sports data set Method Accuracy (%)DivingGolfingKickingLiftingRidingRunningSkating.

SVM Support Vector Machines Presented by: Anas Assiri Supervisor Prof. Dr. Mohamed Batouche.

Optimizing Average Precision using Weakly Supervised Data Aseem Behl IIIT Hyderabad Under supervision of: Dr. M. Pawan Kumar (INRIA Paris), Prof. C.V.

Semantic Embedding Space for Zero Shot Action Recognition Xun XuTimothy HospedalesShaogang GongAuthors: Computer Vision Group Queen Mary University of.

INTRODUCTION Heesoo Myeong and Kyoung Mu Lee Department of ECE, ASRI, Seoul National University, Seoul, Korea Tensor-based High-order.

Associative Hierarchical CRFs for Object Class Image Segmentation Ľubor Ladický 1 1 Oxford Brookes University 2 Microsoft Research Cambridge Based on the.

Associative Hierarchical CRFs for Object Class Image Segmentation

Discussion of Pictorial Structures Pedro Felzenszwalb Daniel Huttenlocher Sicily Workshop September, 2006.

Ariadna Quattoni Xavier Carreras An Efficient Projection for l 1,∞ Regularization Michael Collins Trevor Darrell MIT CSAIL.

CS378 Final Project The Netflix Data Set Class Project Ideas and Guidelines.

Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.

Describing People: A Poselet-Based Approach to Attribute Classification.

Convolutional Restricted Boltzmann Machines for Feature Learning Mohammad Norouzi Advisor: Dr. Greg Mori Simon Fraser University 27 Nov

6.S093 Visual Recognition through Machine Learning Competition Image by kirkh.deviantart.com Joseph Lim and Aditya Khosla Acknowledgment: Many slides from.

1 Bernard Ng 1, Arash Vahdat 2, Ghassan Hamarneh 3, Rafeef Abugharbieh 1 Contact 1 Biomedical Signal and Image Computing Lab,

Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.

LECTURE 20: SUPPORT VECTOR MACHINES PT. 1 April 11, 2016 SDS 293 Machine Learning.

1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.

A Hierarchical Deep Temporal Model for Group Activity Recognition

Support Vector Machine Slides from Andrew Moore and Mingyue Tan.

Hybrid Deep Learning for Reflectance Confocal Microscopy Skin Images

Human Action Recognition Week 10

From Vision to Grasping: Adapting Visual Networks

Data Driven Attributes for Action Detection

Krishna Kumar Singh, Yong Jae Lee University of California, Davis

Recognizing Deformable Shapes

Action Recognition in the Presence of One

Cold-Start Heterogeneous-Device Wireless Localization

Object Localization Goal: detect the location of an object within an image Fully supervised: Training data labeled with object category and ground truth.

Machine Learning Week 1.

Attributes and Simile Classifiers for Face Verification

Machine Learning Week 2.

Bilinear Classifiers for Visual Recognition

Figure 4. Testing minimal configurations with existing models for spatiotemporal recognition. (A-B) A binary classifier is trained to separate a positive.

Jia-Bin Huang Virginia Tech ECE 6554 Advanced Computer Vision

Xiaodan Liang Sun Yat-Sen University

Data Driven Attributes for Action Detection

Human Action Recognition Week 8

Discriminative Probabilistic Models for Relational Data

MAS 622J Course Project Classification of Affective States - GP Semi-Supervised Learning, SVM and kNN Hyungil Ahn

Presentation transcript:

Weakly Supervised Action Recognition Simon Fraser University Vision and Media Lab Weakly Supervised Action Recognition Nataliya Shapovalova, Arash Vahdat, Kevin Cannons, Tian Lan, and Greg Mori PROBLEM Perform action classification while: – localizing the evidence from the video that led to the classification decision – encouraging consistency of latent variables across all the training data Contribution: A novel Similarity Constrained Latent SVM that considers pairwise similarity of latent variables across all the training data MODEL FORMULATION The scoring function for image feature x, latent region h and action label y is defined as: Latent variable h is a collection of similar regions across all the frames of the video Action label Video Training videos Test video Output Diving video-action potential latent region-action potential SIMILARITY CONSTRAINED LSVM Extends the Latent SVM, adding one more slack variable: SCLSVM learning requires inference of h, which is challenging due to the added constraint that links all the latent variables for all videos in the training set. new term, penalty for dissimilarity of latent variables constraint on similarity; linking all the latent variables together pairwise dissimilarity between selected latent region of video i and latent region of video j EXPERIMENTS Dataset: UCF-sports Quantitative results of classification accuracy and regions similarity: Qualitative examples of classification and evidence localization: BoW LSVM SCLSVM Lan et al. ICCV11 Accuracy 65.4 70.4 75.3 73.3 Regions Similarity – 0.1928 0.2322 Examples of correctly classified testing videos Misclassified videos