Relative Hidden Markov Models Qiang Zhang, Baoxin Li Arizona State University.

Slides:

Advertisements

Similar presentations

We consider situations in which the object is unknown the only way of doing pose estimation is then building a map between image measurements (features)

Advertisements

FEATURE PERFORMANCE COMPARISON FEATURE PERFORMANCE COMPARISON y SC is a training set of k-dimensional observations with labels S and C b C is a parameter.

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Document Summarization using Conditional Random Fields Dou Shen, Jian-Tao Sun, Hua Li, Qiang Yang, Zheng Chen IJCAI 2007 Hao-Chin Chang Department of Computer.

Hidden Markov Models (HMM) Rabiner’s Paper

Ziming Zhang, Yucheng Zhao and Yiwen Wan.  Introduction&Motivation  Problem Statement  Paper Summeries  Discussion and Conclusions.

Recognizing Human Actions by Attributes CVPR2011 Jingen Liu, Benjamin Kuipers, Silvio Savarese Dept. of Electrical Engineering and Computer Science University.

Carol E. Reiley 1 Henry C. Lin 1, Balakrishnan Varadarajan 2, Balazs Vagvolgyi 1, Sanjeev Khudanpur 2, David D. Yuh 3, Gregory D. Hager 1 1 Engineering.

Toward Automatic Music Audio Summary Generation from Signal Analysis Seminar „Communications Engineering“ 11. December 2007 Patricia Signé.

Patch to the Future: Unsupervised Visual Prediction

3D Human Body Pose Estimation from Monocular Video Moin Nabi Computer Vision Group Institute for Research in Fundamental Sciences (IPM)

Intelligent Systems Lab. Recognizing Human actions from Still Images with Latent Poses Authors: Weilong Yang, Yang Wang, and Greg Mori Simon Fraser University,

Hidden Markov Models Theory By Johan Walters (SR 2003)

Foundations of Statistical NLP Chapter 9. Markov Models 한 기 덕한 기 덕.

Content-based Video Indexing, Classification & Retrieval Presented by HOI, Chu Hong Nov. 27, 2002.

ACM Multimedia th Annual Conference, October , 2004

Expectation Maximization Method Effective Image Retrieval Based on Hidden Concept Discovery in Image Database By Sanket Korgaonkar Masters Computer Science.

Recognition of Human Gait From Video Rong Zhang, C. Vogler, and D. Metaxas Computational Biomedicine Imaging and Modeling Center Rutgers University.

Incremental Learning of Temporally-Coherent Gaussian Mixture Models Ognjen Arandjelović, Roberto Cipolla Engineering Department, University of Cambridge.

1 Integrating User Feedback Log into Relevance Feedback by Coupled SVM for Content-Based Image Retrieval 9-April, 2005 Steven C. H. Hoi *, Michael R. Lyu.

1 Hidden Markov Model Instructor : Saeed Shiry  CHAPTER 13 ETHEM ALPAYDIN © The MIT Press, 2004.

Presentation in IJCNN 2004 Biased Support Vector Machine for Relevance Feedback in Image Retrieval Hoi, Chu-Hong Steven Department of Computer Science.

Presented by Zeehasham Rasheed

AN ANALYSIS OF SINGLE- LAYER NETWORKS IN UNSUPERVISED FEATURE LEARNING [1] Yani Chen 10/14/

Visual scan path analysis using eye tracking data

Visual Speech Recognition Using Hidden Markov Models Kofi A. Boakye CS280 Course Project.

Real-Time Decentralized Articulated Motion Analysis and Object Tracking From Videos Wei Qu, Member, IEEE, and Dan Schonfeld, Senior Member, IEEE.

Learning and Recognizing Activities in Streams of Video Dinesh Govindaraju.

Enhancing Fundamentals of Laparoscopic Surgery Trainer Box via Designing A Multi-Sensor Feedback System Qiongjie Tian, Lin Chen and Baoxin Li {Qiongjie.Tian,

Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.

MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.

Action and Gait Recognition From Recovered 3-D Human Joints IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS— PART B: CYBERNETICS, VOL. 40, NO. 4, AUGUST.

Alignment and classification of time series gene expression in clinical studies Tien-ho Lin, Naftali Kaminski and Ziv Bar-Joseph.

3D Motion Capture Assisted Video human motion recognition based on the Layered HMM Myunghoon Suk & Ashok Ramadass Advisor : Dr. B. Prabhakaran Multimedia.

Ganesh Sankaranarayanan PhD April 24, 2013 Orlando/ASE 2013 The Learning Plateau and the Learning Rate for the VBLaST PT© compared to the FLS simulator.

Mining Discriminative Components With Low-Rank and Sparsity Constraints for Face Recognition Qiang Zhang, Baoxin Li Computer Science and Engineering Arizona.

1 Multimodal Group Action Clustering in Meetings Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain McCowan, Guillaume Lathoud IDIAP Research Institute.

A General Framework for Tracking Multiple People from a Moving Camera

Segmental Hidden Markov Models with Random Effects for Waveform Modeling Author: Seyoung Kim & Padhraic Smyth Presentor: Lu Ren.

International Conference on Intelligent and Advanced Systems 2007 Chee-Ming Ting Sh-Hussain Salleh Tian-Swee Tan A. K. Ariff. Jain-De,Lee.

UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.

CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.

Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.

A Regression Approach to Music Emotion Recognition Yi-Hsuan Yang, Yu-Ching Lin, Ya-Fan Su, and Homer H. Chen, Fellow, IEEE IEEE TRANSACTIONS ON AUDIO,

An Information Fusion Approach for Multiview Feature Tracking Esra Ataer-Cansizoglu and Margrit Betke ) Image and.

Structure Discovery of Pop Music Using HHMM E6820 Project Jessie Hsu 03/09/05.

Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.

Using Inactivity to Detect Unusual behavior Presenter : Siang Wang Advisor : Dr. Yen - Ting Chen Date : Motion and video Computing, WMVC.

Action and Gait Recognition From Recovered 3-D Human Joints IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS— PART B: CYBERNETICS, VOL. 40, NO. 4, AUGUST.

Conditional Random Fields for ASR Jeremy Morris July 25, 2006.

Image Classification for Automatic Annotation

ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Elements of a Discrete Model Evaluation.

Chapter 8. Learning of Gestures by Imitation in a Humanoid Robot in Imitation and Social Learning in Robots, Calinon and Billard. Course: Robots Learning.

Journal of Visual Communication and Image Representation

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

Discriminative Training and Machine Learning Approaches Machine Learning Lab, Dept. of CSIE, NCKU Chih-Pin Liao.

Statistical Models for Automatic Speech Recognition Lukáš Burget.

Statistical techniques for video analysis and searching chapter Anton Korotygin.

Constraint-Based Motion Planning for Multiple Agents Luv Kohli COMP259 March 5, 2003.

By: Nicole Cappella. Why I chose Speech Recognition  Always interested me  Dr. Phil Show Manti Teo Girlfriend Hoax  Three separate voice analysts proved.

Graphical Models for Segmenting and Labeling Sequence Data Manoj Kumar Chinnakotla NLP-AI Seminar.

Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.

Visual Recognition Tutorial1 Markov models Hidden Markov models Forward/Backward algorithm Viterbi algorithm Baum-Welch estimation algorithm Hidden.

Spectral Algorithms for Learning HMMs and Tree HMMs for Epigenetics Data Kevin C. Chen Rutgers University joint work with Jimin Song (Rutgers/Palentir),

Visual Information Retrieval

Restricted Boltzmann Machines for Classification

Computational NeuroEngineering Lab

Video-based human motion recognition using 3D mocap data

Learning Incoherent Sparse and Low-Rank Patterns from Multiple Tasks

Motivation It can effectively mine multi-modal knowledge with structured textural and visual relationships from web automatically. We propose BC-DNN method.

Presentation transcript:

Relative Hidden Markov Models Qiang Zhang, Baoxin Li Arizona State University

Introduction Understanding human motion is an important task in many ﬁelds: –sports, rehabilitation, surgery, computer animation and dance; One key problem in such applications is the analysis of skills associated with body motion. Many computational methods have been developed for this purpose: –A popular choice is HMM based.

Motivation One practical difficulty: they require the skill labels for the training data; Labeling the skill of a trainee is currently done by senior surgeons; –a costly practice; –subjective and less quantifiable; Sufficient and consistent skill label for a large amount of data—difficult, if not impossible.

Relative Label Instead of Absolute Label It is hard to say whether (b) is smiling or not. But it is easy to find (b) is less smiling than (a) but more than (c). We use similar idea in our motion analysis: given two videos, we only need to know which one is better.

Proposed Method

Proposed Method Cont’d

Measuring the Skills

Proposed Method

Proposed Method Cont’d

Update the Model

Update the Model Cont’d

Sub-problem 2

Algorithms

Relationship to Latent SVM Latent Variable State Path Pair as Latent Var.

Relationship to Latent SVM

Synthetic Experiment Randomly generate 6 HMMs and order them randomly; 200 sequences from each HMM: –50 for training and 150 for testing; From 50x6 sequences, 1000 pairs are randomly selected.

Convergence Behavior

Performances with # Training Pairs

Log Likelihood of Data

Parameter Selection C

Experiment Videos captured from FLS trainer box: –546 in total from 18 subjects; –Duration of four weeks, 3 sessions/week; Assumption: the skills of the subject get improved during training process –the scores of videos from the last session are better than those at the first session for each subject. Result: MethodHMMBaselineImproved # pairs Accuracy79.39%77.54%87.25%

Experiments: Skill Curve

Experiments: Learned Models

Experiment: Emotion Recognition Recognizing the emotional state of the speakers is very important; –Human computer interaction Existing methods try to classify the audio to predefined labels or levels: –Labeled training data is required; We can leverage the power of pairwise comparison via the proposed method;

Emotion Recognition with RHMM Extract MFCC Bag of Words RHMM Models Pairwise Rank Training Data 991 audios, 6 emotions at 7 levels, half for training and 1000 randomly selected pair for input.

Experiment Results DimensionImprovedBaselineHMM Pleasantness77.30%57.96%75.05% Arousal86.95%55.74%69.55% Dominance87.95%63.04%77.32% Credibility76.68%55.11%71.74% Interest81.90%62.56%78.07% Positivity74.99%67.84%70.36% Average81.28%53.14%73.72%

Future Work Theoretic analysis of the learned model; Allowing more types of observation models; Modeling multiple relative attributes jointly via multi-task learning framework; Modeling multiple attributes jointly can be also made possible by utilizing hierarchical Dirichlet Process.

Related Publications Qiang Zhang and Baoxin Li, Relative Hidden Markov Models for Evaluating Motion Skills, IEEE Computer Vision and Pattern Recognition (CVPR) 2013, Portland, OR Lin Chen, Qiongjie Tian, Qiang Zhang and Baoxin Li. Learning Skill-Defining Latent Space in Video-Based Analysis of Surgical Expertise: A Multi-Stream Fusion Approach. NextMed/MMVR20. San Diego, CA, Qiongjie Tian, Lin Chen, Qiang Zhang and Baoxin Li. Enhancing Fundamentals of Laparoscopic Surgery Trainer Box via Designing A Multi-Sensor Feedback System. NextMed/MMVR20. San Diego, CA, Qiang Zhang, Lin Chen, Qiongjie Tian and Baoxin Li. Video-based analysis of motion skills in simulation-based surgical training. SPIE Multimedia Content Access: Algorithms and Systems VII. San Francisco, CA, Qiang Zhang and Baoxin Li. Video-based motion expertise analysis in simulationbased surgical training using hierarchical dirichlet process hidden markov model. In Proceedings of the 2011 international ACM workshop on Medical multimedia analysis and retrieval (MMAR ’11). ACM, New York, NY, USA, Zhang, Qiang and Li, Baoxin, Towards Computational Understanding of Skill Levels in Simulation-Based Surgical Training via Automatic Video Analysis, International Symposium on Visual Computing (ISVC) 2010, Las egas, NV Qiang Zhang, Baoxin Li, “Relative Hidden Markov Models for Video-based Evaluation of Motion Skills in Surgical Training,” Pattern Analysis and Machine Intelligence, IEEE Transactions on [under review]