A Practical Approach to Recognizing Physical Activities Jonathan Lester Tanzeem Choudhury Gaetano Borriello.

Slides:



Advertisements
Similar presentations
Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.
Advertisements

Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.
Introduction The aim the project is to analyse non real time EEG (Electroencephalogram) signal using different mathematical models in Matlab to predict.
Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee.
AdaBoost & Its Applications
Face detection Many slides adapted from P. Viola.
Detecting Pedestrians by Learning Shapelet Features
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
Accelerometer-based Transportation Mode Detection on Smartphones
1 Multivariate Statistics ESM 206, 5/17/05. 2 WHAT IS MULTIVARIATE STATISTICS? A collection of techniques to help us understand patterns in and make predictions.
Robust Moving Object Detection & Categorization using self- improving classifiers Omar Javed, Saad Ali & Mubarak Shah.
HCI Final Project Robust Real Time Face Detection Paul Viola, Michael Jones, Robust Real-Time Face Detetion, International Journal of Computer Vision,
Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
TRADING OFF PREDICTION ACCURACY AND POWER CONSUMPTION FOR CONTEXT- AWARE WEARABLE COMPUTING Presented By: Jeff Khoshgozaran.
Graz University of Technology, AUSTRIA Institute for Computer Graphics and Vision Fast Visual Object Identification and Categorization Michael Grabner,
A Practical Approach to Recognizing Physical Activities Jonathan Lester, Tanzeem Choudhury, and Gaetano Borriello In Proceedings of the Fourth International.
Robust Real-time Object Detection by Paul Viola and Michael Jones ICCV 2001 Workshop on Statistical and Computation Theories of Vision Presentation by.
Online classifier construction algorithm for human activity detection using a tri-axial accelerometer Yen-Ping Chen, Jhun-Ying Yang, Shun-Nan Liou, Gwo-Yun.
Robust Real-Time Object Detection Paul Viola & Michael Jones.
Viola and Jones Object Detector Ruxandra Paun EE/CS/CNS Presentation
Model-driven Data Acquisition in Sensor Networks Amol Deshpande 1,4 Carlos Guestrin 4,2 Sam Madden 4,3 Joe Hellerstein 1,4 Wei Hong 4 1 UC Berkeley 2 Carnegie.
Viewpoint Tracking for 3D Display Systems A look at the system proposed by Yusuf Bediz, Gözde Bozdağı Akar.
Activity Recognition from User- Annotated Acceleration Data Presented by James Reinebold CSCI 546.
© Phil Hurvitz, 2006Slide 1 (of 26) Validation of New Technologies and Methodologies for Measuring Physical Activity and Location in Real Time-Space Phil.
Foundations of Computer Vision Rapid object / face detection using a Boosted Cascade of Simple features Presented by Christos Stoilas Rapid object / face.
Face Detection CSE 576. Face detection State-of-the-art face detection demo (Courtesy Boris Babenko)Boris Babenko.
FACE DETECTION AND RECOGNITION By: Paranjith Singh Lohiya Ravi Babu Lavu.
David Rogers, Stu Andrzejewski, Kelly Desmond, Brad Garrod.
Feature Extraction Spring Semester, Accelerometer Based Gestural Control of Browser Applications M. Kauppila et al., In Proc. of Int. Workshop on.
Face Detection using the Viola-Jones Method
SoundSense: Scalable Sound Sensing for People-Centric Application on Mobile Phones Hon Lu, Wei Pan, Nocholas D. lane, Tanzeem Choudhury and Andrew T. Campbell.
Sensing Meets Mobile Social Networks: The Design, Implementation and Evaluation of the CenceMe Application Emiliano Miluzzo†, Nicholas D. Lane†, Kristóf.
Operated by Los Alamos National Security, LLC for NNSA U N C L A S S I F I E D Slide 1 Sensor network based vehicle classification and license plate identification.
SoundSense by Andrius Andrijauskas. Introduction  Today’s mobile phones come with various embedded sensors such as GPS, WiFi, compass, etc.  Arguably,
Amin Rasekh, Chien-An Chen, Yan Lu CSCE 666 Project Presentation.
EADS DS / SDC LTIS Page 1 7 th CNES/DLR Workshop on Information Extraction and Scene Understanding for Meter Resolution Image – 29/03/07 - Oberpfaffenhofen.
MARS: A Muscle Activity Recognition System Enabling Self-configuring Musculoskeletal Sensor Networks IPSN 2013 NSLab study group 2013/06/17 Presented by:
EWatch: A Wearable Sensor and Notification Platform Paper By: Uwe Maurer, Anthony Rowe, Asim Smailagic, Daniel P. Siewiorek Presenter: Ke Gao.
Keystroke Recognition using WiFi Signals
Sign Classification Boosted Cascade of Classifiers using University of Southern California Thang Dinh Eunyoung Kim
Human Activity Recognition Using Accelerometer on Smartphones
Sound Detection Derek Hoiem Rahul Sukthankar (mentor) August 24, 2004.
DIEGO AGUIRRE COMPUTER VISION INTRODUCTION 1. QUESTION What is Computer Vision? 2.
Physical-layer Identification of UHF RFID Tags Authors: Davide Zanetti, Boris Danev and Srdjan Capkun Presented by Zhitao Yang 1.
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.
Limitations of Cotemporary Classification Algorithms Major limitations of classification algorithms like Adaboost, SVMs, or Naïve Bayes include, Requirement.
Power-Accuracy Tradeoffs in Human Activity Transition Detection Prepared for DATE 2010 Dresden, Germany Jeffrey Boyd, Hari Sundaram, Aviral Shrivastava.
Training Conditional Random Fields using Virtual Evidence Boosting Lin Liao, Tanzeem Choudhury †, Dieter Fox, and Henry Kautz University of Washington.
The Viola/Jones Face Detector A “paradigmatic” method for real-time object detection Training is slow, but detection is very fast Key ideas Integral images.
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, VOL. 19, NO
Providing User Context for Mobile and Social Networking Applications A. C. Santos et al., Pervasive and Mobile Computing, vol. 6, no. 1, pp , 2010.
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
Learning to Detect Faces A Large-Scale Application of Machine Learning (This material is not in the text: for further information see the paper by P.
CSE 473 Ensemble Learning. © CSE AI Faculty 2 Ensemble Learning Sometimes each learning technique yields a different hypothesis (or function) But no perfect.
Performance Comparison of Speaker and Emotion Recognition
Hand Gesture Recognition Using Haar-Like Features and a Stochastic Context-Free Grammar IEEE 高裕凱 陳思安.
Saisakul Chernbumroong, Shuang Cang, Anthony Atkins, Hongnian Yu Expert Systems with Applications 40 (2013) 1662–1674 Elderly activities recognition and.
Notes on HW 1 grading I gave full credit as long as you gave a description, confusion matrix, and working code Many people’s descriptions were quite short.
Sensor Analysis – Part II A literature based exploration Thomas Plötz [material taken from the original papers and John Krumm “Ubiquitous Computing Fundamentals”
A Discriminatively Trained, Multiscale, Deformable Part Model Yeong-Jun Cho Computer Vision and Pattern Recognition,2008.
Hand Detection with a Cascade of Boosted Classifiers Using Haar-like Features Qing Chen Discover Lab, SITE, University of Ottawa May 2, 2006.
Behavior Recognition Based on Machine Learning Algorithms for a Wireless Canine Machine Interface Students: Avichay Ben Naim Lucie Levy 14 May, 2014 Ort.
Using the Fisher kernel method to detect remote protein homologies Tommi Jaakkola, Mark Diekhams, David Haussler ISMB’ 99 Talk by O, Jangmin (2001/01/16)
Session 7: Face Detection (cont.)
ADABOOST(Adaptative Boosting)
Keystroke Recognition using Wi-Fi Signals
Power-Accuracy Tradeoffs in Human Activity Transition Detection
AUthor:Liang WanG, Tao Gu, Xianping Tao, Jian Lu reporter:何知涵
Presentation transcript:

A Practical Approach to Recognizing Physical Activities Jonathan Lester Tanzeem Choudhury Gaetano Borriello

The Idea What am I doing?

Requirements, and intriguing questions Single point of location on the body and location independence – Does it matter? Work for any person Personalization only increases accuracy – How much variation across users? Cost Effective – How many sensors are needed?

Sensors & Locations - Single versus Multiple Majority of research with single sensor modality, at multiple locations Sensor placement and data collection becomes cumbersome Use of single sensor (single location) reduced accuracy by 35% [1] Compensate for accuracy lost using multiple sensor modalities [2] More comfortable for user to wear at single location Can integrate into existing mobile platforms [1] Bao, L., Intille, S.: Activity Recognition from User-Annotated Acceleration Data. In: Proc. Proc. Pervasive (2004) 1-17 [2] Choudhury, T., Lester, J., Kern, N., Borriello, G., Hannaford, B.. A Hybrid Discriminative Discriminative/Generative Approach for Modeling Human Activities.

Components Three main components Sensing Module  Gathers low level information about activities  microphone, accelerometer, light sensors, etc Feature Processing and Selection Module  Process raw data from sensors into features  Features discriminate activities Classification Module  Tags the activity being performed from predefined types

The Experimental System A Multi-Mode Sensor Board (MSB) Bluetooth Intel Mote (iMote) USB rechargeable battery board

Methodology Data collected across 12 individuals performing 8 activities 8 in mid twenties, 4 in their thirties 2/3 of data collected in a computer science building, 1/3 collected in office building Data collected from 3 MSBs located at shoulder strap, side of waist, right wrist Given a sequence of activities - sitting followed by climbing stairs followed by brushing teeth Data collected on laptop, annotated by an observer using iPaq

Feature Extraction About 18,000 samples per second How to bring out the important details – compute features Features – linear, log scale FFT frequency coefficients, spectral entropy, band pass filter coefficients, correlations, integrals, means, etc Not every feature for every sensor Total of 651 features computed Need to pick few important features to avoid confusing classification algorithm

Training the Classifiers Have all the data with the features extracted Need to separate the data into training and testing sections Data separated into 4 folds 3 folds used for training the classifier 1 fold used for testing Done with all combinations of 3 of 4 folds Temperature and humidity sensor data not used

Classifier Structure Modified version of AdaBoost [1] used to select best features and rank them based on classification performance Most discriminative sub-set of features selected to learn static classification of activities Usage of Hidden Markov model (HMM) as a second layer using the class probabilities Discriminative classifier tuned to make activities more distinguishable, while HMM ensures temporal smoothness [1] Viola, P., Jones, M.: Rapid Object Detection using a Boosted Cascade of Simple Features. In: Proc. Computer Vision and Pattern Recognition (2001)

Confusion Matrix – The Results Table: Confusion matrix for the static and HMM classifier trained using a single stream of sensor data from all three locations on the body

Location Sensitivity Sensors at different locations on body give better classification Want to make the sensors invisible to the user Ideally classification should work accurately with data from different locations on the body Allows user to carry at any convenient location How to determine which location is best to place the sensor board? Train four sets of classifiers – (1) data from all 3 locations (2) data from shoulder (3) data from waist (4) data from wrist

Location Sensitivity Results Precision = True Positive / (True Positive + False Positive) Recall = True Positive / (True Positive + False Negative) Overall accuracy = (True Positive + True Negative) / Total number of example Table: Overall precision/recall for the static and HMM classifiers trained/tested on all locations (top row) and a single location (bottom rows).

Variation across users Is there any need for training period for particular end user? User would want it to work immediately upon purchase Good if it gives better results with more usage but reasonable performance to begin with Train the classifier with lot of data collected from diverse group of people Use the classifier for new individual – No need to collect any training data, no need to retrain the classifier

User Variation – Test 1 Data collected from 12 users. Select N (N = 1 to 12) users’ data for training. Test on all 12 users’ data Idea – To show that accuracy improves with increase in training data

User Variation – Test 2 Data collected from 12 users. Select N (N = 1 to 12) users’ data for training. Test on remaining 12 – N users’ data Idea – To show improvement in accuracy due to general classifier and not increased training data size

Sensors Necessary Do not necessarily need all sensors on MSB to accurately classify Results shown here do not use temperature and humidity sensors Reducing number of sensors makes system less susceptible to environmental changes Increasing number of sensors increases complexity, power and computational requirements, thus higher costs Fewer sensors means smaller sizes, more practical to use 3 important sensors – accelerometer, audio and barometric pressure sensor

Single Versus Multiple Sensors Table: Classifiers trained using a single sensor. Overall accuracies approximately 65% Table: Classifiers trained using three sensors. Overall accuracies approximately 90%

Conclusion Accurate recognition of range of activities can be achieved Have answered the three questions  Single-board activity recognition system generalizes well. No need to learn location specific activity models  For the data set, no need for customization to specific individuals  Although 7 different modalities on board, but three main – audio, barometric pressure and accelerometer Participants were young healthy individuals How to handle activities that do not fall into predefined classes? How to handle ambiguities associated with compound activities?