Tim Sheerman-Chase, Eng-Jon Ong and Richard Bowden

Slides:



Advertisements
Similar presentations
Shape Matching and Object Recognition using Low Distortion Correspondence Alexander C. Berg, Tamara L. Berg, Jitendra Malik U.C. Berkeley.
Advertisements

Active Appearance Models
Ke Chen 1, Shaogang Gong 1, Tao Xiang 1, Chen Change Loy 2 1. Queen Mary, University of London 2. The Chinese University of Hong Kong VGG reading group.
Face Alignment by Explicit Shape Regression
Loris Bazzani*, Marco Cristani*†, Alessandro Perina*, Michela Farenzena*, Vittorio Murino*† *Computer Science Department, University of Verona, Italy †Istituto.
Kien A. Hua Division of Computer Science University of Central Florida.
MIT CSAIL Vision interfaces Approximate Correspondences in High Dimensions Kristen Grauman* Trevor Darrell MIT CSAIL (*) UT Austin…
Patch to the Future: Unsupervised Visual Prediction
Semi-Supervised Hierarchical Models for 3D Human Pose Reconstruction Atul Kanaujia, CBIM, Rutgers Cristian Sminchisescu, TTI-C Dimitris Metaxas,CBIM, Rutgers.
Face Alignment by Explicit Shape Regression
Introduction to Data-driven Animation Jinxiang Chai Computer Science and Engineering Texas A&M University.
Crowdsourcing research data UMBC ebiquity,
4EyesFace-Realtime face detection, tracking, alignment and recognition Changbo Hu, Rogerio Feris and Matthew Turk.
Feature Extraction for Outlier Detection in High- Dimensional Spaces Hoang Vu Nguyen Vivekanand Gopalkrishnan.
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC Lecture 7: Coding and Representation 1 Computational Architectures in.
Learning Table Extraction from Examples Ashwin Tengli, Yiming Yang and Nian Li Ma School of Computer Science Carnegie Mellon University Coling 04.
The guessability of traffic signs: Effects of prospective-user factors and sign design features Author: Annie W.Y.Ng Alan H.S. Chan Accident Analysis and.
3D Fingertip and Palm Tracking in Depth Image Sequences
Multimodal Interaction Dr. Mike Spann
Mining Discriminative Components With Low-Rank and Sparsity Constraints for Face Recognition Qiang Zhang, Baoxin Li Computer Science and Engineering Arizona.
Graphite 2004 Statistical Synthesis of Facial Expressions for the Portrayal of Emotion Lisa Gralewski Bristol University United Kingdom
Page 1 CSISS Center for Spatial Information Science and Systems Design and Implementation of CWIC Metrics Weiguo Han, Liping Di, Yuanzheng Shao, Lingjun.
An Information Fusion Approach for Multiview Feature Tracking Esra Ataer-Cansizoglu and Margrit Betke ) Image and.
Computer Vision Lab. SNU Young Ki Baik Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)
Perceptual Analysis of Talking Avatar Head Movements: A Quantitative Perspective Xiaohan Ma, Binh H. Le, and Zhigang Deng Department of Computer Science.
Page 1 CSISS Center for Spatial Information Science and Systems CWIC Metrics: Current and Future Weiguo Han, Liping Di, Yuanzheng Shao, Lingjun Kang Center.
Interactive Learning of the Acoustic Properties of Objects by a Robot
VIP: Finding Important People in Images Clint Solomon Mathialagan Andrew C. Gallagher Dhruv Batra CVPR
3D Face Recognition Using Range Images
WCRP Extremes Workshop Sept 2010 Detecting human influence on extreme daily temperature at regional scales Photo: F. Zwiers (Long-tailed Jaeger)
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Crowd-based mining of reusable process model patterns Carlos Rodríguez, Florian Daniel, Fabio Casati BPM 2014, September 9th 2014, Eindhoven, The Netherlands.
PRESENTED BY KE CHEN DEPARTMENT OF SIGNAL PROCESSING TAMPERE UNIVERSITY OF TECHNOLOGY, FINLAND CUMULATIVE ATTRIBUTE SPACE FOR AGE AND CROWD DENSITY ESTIMATION.
Descriptive Statistics The means for all but the C 3 features exhibit a significant difference between both classes. On the other hand, the variances for.
Meta-analysis Overview
Descriptive Statistics ( )
Why Model? Make predictions or forecasts where we don’t have data.
Final Projects The final project is expected to be an independent project of your own devising addressing a topic related to cognition or neuroscience.
CSC2535: Computation in Neural Networks Lecture 11 Extracting coherent properties by maximizing mutual information across space or time Geoffrey Hinton.
University of Rochester
Lecture 1: Introduction and the Boolean Model Information Retrieval
Guillaume-Alexandre Bilodeau
Attention Components and Creative Potential: An ERP Exploration
Gait Recognition Gökhan ŞENGÜL.
Instance Based Learning
Who is the Expert? Combining Intention and Knowledge of Online Discussants in Collaborative RE Tasks Itzel Morales-Ramirez1,2, Matthieu Vergne1,2, Mirko.
General Linear Model & Classical Inference
Face Recognition and Feature Subspaces
Principal Component Analysis
Anastassia Loukina, Klaus Zechner, James Bruno, Beata Beigman Klebanov
Learning Gender with Support Faces
BUS173: Applied Statistics
Data Mining 資料探勘 分群分析 (Cluster Analysis) Min-Yuh Day 戴敏育
Ying Dai Faculty of software and information science,
UNODC-UNECE Manual on Victimization Surveys: Content
Ying Dai Faculty of software and information science,
Ying Dai Faculty of software and information science,
Presenter: Simon de Leon Date: March 2, 2006 Course: MUMT611
Liyuan Li, Jerry Kah Eng Hoe, Xinguo Yu, Li Dong, and Xinqi Chu
Ying Dai Faculty of software and information science,
Statistics II: An Overview of Statistics
Dialogue State Tracking & Dialogue Corpus Survey
J. Ellis, F. Jenet, & M. McLaughlin
15.1 The Role of Statistics in the Research Process
Introduction to Sensor Interpretation
Ying Dai Faculty of software and information science,
Introduction to Sensor Interpretation
Measuring Learning During Search: Differences in Interactions, Eye-Gaze, and Semantic Similarity to Expert Knowledge Florian Groß Mai
CVPR 2019 Poster.
Presentation transcript:

Cultural Factors in the Regression of Non-verbal Communication Perception Tim Sheerman-Chase, Eng-Jon Ong and Richard Bowden CVSSP, University of Surrey, UK Workshop on Human Interaction in Computer Vision ICCV 2011, 12 November 2011, Barcelona

Introduction State of NVC and data annotation TwoTalk Corpus Data annotation using crowd sourcing Automatic Regression of NVC Feature Extraction Testing and Performance Future work

Background Non-verbal communication in HCI Useful for novel interfaces Most emotion/NVC datasets are acted Difficulty in processing naturalistic data Annotation is time consuming and tedious A single or limited number of annotators Single cultural view point

Background Difference between acted and posed Cultural differences in expressing and perceiving NVC/emotions Diagram for cultural encoding and decoding rules?

TwoTalk Corpus Aim: minimum constraints on natural conversation Two PAL cameras, two participants Seated opposite across table 4 conversations, 12 minutes each Selected 527 files, 37 min total

Questionnaire Categorical vs. continuous, Exemplars vs. abstraction How it is presented, cultural impact Commonly seen NVC signals Agreeing, thinking, questioning, understanding

Crowdsourcing Suitable for large tasks that can be split into simple steps Web based, usually browser based Motivation by money/altruism/challenge Quality control Crowdflower, Mechanical Turk (Amazon), Samasource, different demographics

Annotation Results 711 annotators, 79130 questions answered Annotations are sparse Three main cultures identified by IP address India, Kenya and UK Many annotators did not cooperate Random results need to be removed

Annotation Quality Uncooperative workers Erroneous work may be rejected Prevention No way to pre-screen workers Workers are almost anonymous (apart from IP address and timing) Sanity questions Filtering results, during work or after

Annotation Filtering Cooperative annotators Pearson's correlation with some ideal standard Correlation: 1 (or -1) perfect correlation, 0 uncorrelated Use mode in culture to find robust consensus Remove annotators below 0.2 correlation Take mean of remaining annotators 𝜌 𝑋,𝑌 = 𝑐𝑜𝑣 𝑋,𝑌 𝜎 𝑋 𝜎 𝑌 Covariance of X w.r.t. Y Population variance

Annotation Filtering

Frequency of Correlation Correlation with own culture mode Discard annotators with correlation < 0.2

Cultural Patterns in Annotation Check for cultural differences in annotation For each culture, For each clip, Concatenate culture consensus into one 4D vector Flatten space into 2D using Sammon mapping Attempts to preserve distances

Cultural Patterns in Annotation Cultures occupy different areas of space, caused either by differences in perceiving NVC or differences in using questionnaire.

Compare Annotators to Consensus Compare filtered annotators with their culture mean consensus Better to use specialised culture model rather than ignoring culture (global mean) Correlation of Annotators with Mean Consensus

Overview of System Track Facial Features LP flock trackers (Ong et al. 2009) Extract features Distances between pairs of trackers Train regressor, ν-SVR (Schölkopf et al. 2009) 8 fold person independent testing

Position of Trackers 46 position

Feature Extraction Euclidean distance between pairs 1035 pairs between 46 trackers Features are whitened and centred Removes face shape information

System Overview

Results Correlation performances are relatively low Extreme difficulty of task Low inter-annotator agreement Questioning is lowest, verbal component

Results Training and testing on same culture is optimal Performance is worse if test data is different to training data

Results Typical results for a single NVC category Thinking, UK annotation Correlation 0.46

Results

Conclusions Crowd sourcing annotation data is effective if quality problems are managed Naturalistic NVC Regression is possible But challanging Specialising regressor for cultural annotations is better then ignoring culture

Future Work Using mean and variance of clip discards information Temporal Some frames are more important that others Multiple Instance Learning Record participants from multiple cultures then do multi-annotation, social factors Applications

Summary NVC and data annotation TwoTalk Corpus Data annotation using crowd sourcing Automatic Regression of NVC Feature Extraction Testing and Performance Future work