Event prediction CS 590v. Applications Video search Surveillance – Detecting suspicious activities – Illegally parked cars – Abandoned bags Intelligent.

Slides:

Advertisements

Similar presentations

Distinctive Image Features from Scale-Invariant Keypoints

Advertisements

Distinctive Image Features from Scale-Invariant Keypoints David Lowe.

DONG XU, MEMBER, IEEE, AND SHIH-FU CHANG, FELLOW, IEEE Video Event Recognition Using Kernel Methods with Multilevel Temporal Alignment.

Presented by Xinyu Chang

Content-Based Image Retrieval

- Recovering Human Body Configurations: Combining Segmentation and Recognition (CVPR’04) Greg Mori, Xiaofeng Ren, Alexei A. Efros and Jitendra Malik -

LARGE-SCALE IMAGE PARSING Joseph Tighe and Svetlana Lazebnik University of North Carolina at Chapel Hill road building car sky.

Tracking Learning Detection

Patch to the Future: Unsupervised Visual Prediction

Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.

Transferable Dictionary Pair based Cross-view Action Recognition Lin Hong.

Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.

IBBT – Ugent – Telin – IPI Dimitri Van Cauwelaert A study of the 2D - SIFT algorithm Dimitri Van Cauwelaert.

Computer Vision Group, University of BonnVision Laboratory, Stanford University Abstract This paper empirically compares nine image dissimilarity measures.

Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.

Robust and large-scale alignment Image from

Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.

Small Codes and Large Image Databases for Recognition CVPR 2008 Antonio Torralba, MIT Rob Fergus, NYU Yair Weiss, Hebrew University.

1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.

A Study of Approaches for Object Recognition

LARGE-SCALE NONPARAMETRIC IMAGE PARSING Joseph Tighe and Svetlana Lazebnik University of North Carolina at Chapel Hill CVPR 2011Workshop on Large-Scale.

CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.

Vigilant Real-time storage and intelligent retrieval of visual surveillance data Dr Graeme A. Jones.

Scale Invariant Feature Transform (SIFT)

1 Invariant Local Feature for Object Recognition Presented by Wyman 2/05/2006.

Opportunities of Scale Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.

Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.

Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.

A String Matching Approach for Visual Retrieval and Classification Mei-Chen Yeh* and Kwang-Ting Cheng Learning-Based Multimedia Lab Department of Electrical.

DVMM Lab, Columbia UniversityVideo Event Recognition Video Event Recognition: Multilevel Pyramid Matching Dong Xu and Shih-Fu Chang Digital Video and Multimedia.

Image Processing David Kauchak cs458 Fall 2012 Empirical Evaluation of Dissimilarity Measures for Color and Texture Jan Puzicha, Joachim M. Buhmann, Yossi.

Large Scale Recognition and Retrieval. What does the world look like? High level image statistics Object Recognition for large-scale search Focus on scaling.

Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Efficient Image Search and Retrieval using Compact Binary Codes

Recognizing Action at a Distance A.A. Efros, A.C. Berg, G. Mori, J. Malik UC Berkeley.

Learning to classify the visual dynamics of a scene Nicoletta Noceti Università degli Studi di Genova Corso di Dottorato.

Bag of Video-Words Video Representation

Computer vision.

Richard Socher Cliff Chiung-Yu Lin Andrew Y. Ng Christopher D. Manning

Flow Based Action Recognition Papers to discuss: The Representation and Recognition of Action Using Temporal Templates (Bobbick & Davis 2001) Recognizing.

Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.

Classification 2: discriminative models

Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.

Computational Photography Tamara Berg Features. Representing Images Keep all the pixels! Pros? Cons?

CAP 5415 Computer Vision Fall 2004

Yao, B., and Fei-fei, L. IEEE Transactions on PAMI(2012)

SVM-KNN Discriminative Nearest Neighbor Classification for Visual Category Recognition Hao Zhang, Alex Berg, Michael Maire, Jitendra Malik.

Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.

INTRODUCTION Heesoo Myeong and Kyoung Mu Lee Department of ECE, ASRI, Seoul National University, Seoul, Korea Tensor-based High-order.

Modeling the Shape of a Scene: Seeing the trees as a forest Scene Understanding Seminar

Efficient Visual Object Tracking with Online Nearest Neighbor Classifier Many slides adapt from Steve Gu.

Visual Data on the Internet With slides from Alexei Efros, James Hays, Antonio Torralba, and Frederic Heger : Computational Photography Jean-Francois.

Image Classification for Automatic Annotation

Semantic Extraction and Semantics-Based Annotation and Retrieval for Video Databases Authors: Yan Liu & Fei Li Department of Computer Science Columbia.

Indexing Time Series. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases Time Series databases Text databases.

Imran N. Junejo, Omar Javed and Mubarak Shah University of Central Florida, Proceedings of the 17th International Conference on Pattern Recognition, pp ,

SUMMERY 1. VOLUMETRIC FEATURES FOR EVENT DETECTION IN VIDEO correlate spatio-temporal shapes to video clips that have been automatically segmented we.

Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.

SIFT Scale-Invariant Feature Transform David Lowe

CS262: Computer Vision Lect 09: SIFT Descriptors

Data Driven Attributes for Action Detection

Traffic Sign Recognition Using Discriminative Local Features Andrzej Ruta, Yongmin Li, Xiaohui Liu School of Information Systems, Computing and Mathematics.

A Forest of Sensors: Using adaptive tracking to classify and monitor activities in a site Eric Grimson AI Lab, Massachusetts Institute of Technology

Nonparametric Semantic Segmentation

Tracking parameter optimization

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

Image Segmentation Techniques

CS 1674: Intro to Computer Vision Scene Recognition

Aim of the project Take your image Submit it to the search engine

Brief Review of Recognition + Context

Presentation transcript:

Event prediction CS 590v

Applications Video search Surveillance – Detecting suspicious activities – Illegally parked cars – Abandoned bags Intelligent environments – Healthcare: fall detector – Healthcare: hand-washing prompter

Common approaches Compare keypoint (joint) trajectory shapes Various spatio-temporal features with classifier

Trajectories of keypoints Yilmaz and Shah, 2005 – UCF Joint trajectories in XYT space Compare trajectory shapes to classify actions

Trajectories of keypoints Another approach: modeling trajectories likelihood – Flag low likelihood events

Scene context Idea: transfer event information only from similar images Use context to determine expected motion – E.g. climber on a rock wall vs. climber on a building

Features for matching images: Gist Oliva and Torralba, orientations 4 scales x 16 bins 512 dimensions Apply oriented Gabor filters over different scales Average filter energy in each bin Used for scene recognition Similar to SIFT (Lowe 1999)

Retrieving similar images from large image databases Hays and Efros, 2007 Image completion using Flickr images

Transfer of knowledge: SIFT-flow carsidewalk tree System C. Liu, J. Yuen, A. Torralba, 2009 Can “copy and paste” segmentation labels from similar labeled example to unknown

Task Predict motion from static images Predict semantic event from static images

Approach: high-level Model video as trajectories of keypoints – Cluster of trajectories for each object Global similarity measure

Tracking key points KLT feature tracker – Solve for the displacement d that optimizes a dissimilarity metric, per pair of consecutive frames Cluster trajectories into objects – By average distance between them

Comparing track clusters Create spatial histogram of cluster velocities Sum of histogram intersections is the similarity score 1-level spatial pyramid matching

Predicting local motion Average motion over N nearest neighbors

Event prediction Match query with similar track clusters from database Cluster retrieved tracks to reduce redundant results

Results: local motion prediction

Results: event prediction

Unusual events

Overall discussion Still need training data from scenes similar to target Possible corruption by unusual events in the training set? Applications – Unusual event detector – Knowledge transference between environments

The end