Cultural Differences and Similarities in Emotion Recognition Vladimir Kurbalija, Mirjana Ivanović, Miloš Radovanović, Zoltan Geler, Dejan Mitrović, Weihui.

Slides:

Advertisements

Similar presentations

What is Organizational Behavior?

Advertisements

Associations of behavioral parameters of speech emotional prosody perception with EI measures in adult listeners Elena Dmitrieva Kira Zaitseva, Alexandr.

Brain-computer interfaces: classifying imaginary movements and effects of tDCS Iulia Comşa MRes Computational Neuroscience and Cognitive Robotics Supervisors:

Visualization of dynamic power and synchrony changes in high density EEG A. Alba 1, T. Harmony2, J.L. Marroquín 2, E. Arce 1 1 Facultad de Ciencias, UASLP.

Poster Design & Printing by Genigraphics ® Leonard J. Trejo, Ph. D. Roman Rosipal, Ph. D Pacific Development and Technology, LLC Paul L.

Induced Brain Waves by Binaural Beats: A Study on Numerosity.

Vocal Emotion Recognition with Cochlear Implants Xin Luo, Qian-Jie Fu, John J. Galvin III Presentation By Archie Archibong.

LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.

1 Affective Learning with an EEG Approach Xiaowei Li School of Information Science and Engineering, Lanzhou University, Lanzhou, China

Seminar /workshop on cognitive attainment ppt Dr Charles C. Chan 28 Sept 2001 Dr Charles C. Chan 28 Sept 2001 Assessing APSS Students Learning.

Discussion Section: Review, Viirre Lecture Adrienne Moore

1 A Survey of Affect Recognition Methods: Audio, Visual, and Spontaneous Expressions Zhihong Zeng, Maja Pantic, Glenn I. Roisman, Thomas S. Huang Reported.

Visual Speech Recognition Using Hidden Markov Models Kofi A. Boakye CS280 Course Project.

05/06/2005CSIS © M. Gibbons On Evaluating Open Biometric Identification Systems Spring 2005 Michael Gibbons School of Computer Science & Information Systems.

Emotional Intelligence and Agents – Survey and Possible Applications Mirjana Ivanovic, Milos Radovanovic, Zoran Budimac, Dejan Mitrovic, Vladimir Kurbalija,

Figurative Language Development Research and Popular Children’s Literature: Why We Should Know, “Where the Wild Things Are” Kathleen Ahrens.

Introduction to machine learning

Normalization of the Speech Modulation Spectra for Robust Speech Recognition Xiong Xiao, Eng Siong Chng, and Haizhou Li Wen-Yi Chu Department of Computer.

Digital Sound and Video Chapter 10, Exploring the Digital Domain.

Attention Deficit Hyperactivity Disorder (ADHD) Student Classification Using Genetic Algorithm and Artificial Neural Network S. Yenaeng 1, S. Saelee 2.

Presented by Tienwei Tsai July, 2005

Analysis of Constrained Time-Series Similarity Measures

Methods of Media Research Communication covers a broad range of topics. Also it draws heavily from other fields like sociology, psychology, anthropology,

A Framework of Mathematics Inductive Reasoning Reporter: Lee Chun-Yi Advisor: Chen Ming-Puu Christou, C., & Papageorgiou, E. (2007). A framework of mathematics.

Individual Preferences for Uncertainty: An Ironically Pleasurable Stimulus Bankert, M., VanNess, K., Hord, E., Pena, S., Keith, V., Urecki, C., & Buchholz,

Ekapol Chuangsuwanich and James Glass MIT Computer Science and Artificial Intelligence Laboratory,Cambridge, Massachusetts 02139,USA 2012/07/2 汪逸婷.

SPEECH CONTENT Spanish Expressive Voices: Corpus for Emotion Research in Spanish R. Barra-Chicote 1, J. M. Montero 1, J. Macias-Guarasa 2, S. Lufti 1,

Critical Review on a Working Paper : Effects of background music, voice cues, earcons and gender on psychological ratings and heart rates during product.

A Regression Approach to Music Emotion Recognition Yi-Hsuan Yang, Yu-Ching Lin, Ya-Fan Su, and Homer H. Chen, Fellow, IEEE IEEE TRANSACTIONS ON AUDIO,

1 Methods for detection of hidden changes in the EEG H. Hinrikus*, M.Bachmann*, J.Kalda**, M.Säkki**, J.Lass*, R.Tomson* *Biomedical Engineering Center.

Multimodal Information Analysis for Emotion Recognition

Hypothesis & Research Questions Understanding Differences between qualitative and quantitative approaches.

Educational Objectives

VELS The Arts. VELS (3 STRANDS) Physical, Personal and Social Learning Discipline-based Learning Interdisciplinary Learning.

Qualitative Research January 19, Selecting A Topic Trying to be original while balancing need to be realistic—so you can master a reasonable amount.

Distributed Representative Reading Group. Research Highlights 1Support vector machines can robustly decode semantic information from EEG and MEG 2Multivariate.

Analysis of Movement Related EEG Signal by Time Dependent Fractal Dimension and Neural Network for Brain Computer Interface NI NI SOE (D3) Fractal and.

TUH EEG Corpus Data Analysis 38,437 files from the Corpus were analyzed. 3,738 of these EEGs do not contain the proper channel assignments specified in.

Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.

1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )

Conditional Random Fields for ASR Jeremy Morris July 25, 2006.

Module 16 Emotion.

EEG – BASED EMOTION RECOGNITION in MUSIC LEARNING.

Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.

Performance Comparison of Speaker and Emotion Recognition

Iterative similarity based adaptation technique for Cross Domain text classification Under: Prof. Amitabha Mukherjee By: Narendra Roy Roll no: Group:

Predicting Voice Elicited Emotions

Chapter 8. Learning of Gestures by Imitation in a Humanoid Robot in Imitation and Social Learning in Robots, Calinon and Billard. Course: Robots Learning.

1 Adaptive Subjective Triggers for Opinionated Document Retrieval (WSDM 09’) Kazuhiro Seki, Kuniaki Uehara Date: 11/02/09 Speaker: Hsu, Yu-Wen Advisor:

Chapter 14: Affective Assessment

Intelligent Systems Research Centre University of Ulster, Magee Campus BCI Research at the ISRC, University of Ulster N. Ireland, UK By Dr. Girijesh Prasad.

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

Advanced Gene Selection Algorithms Designed for Microarray Datasets Limitation of current feature selection methods: –Ignores gene/gene interaction: single.

Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.

Mustafa Gokce Baydogan, George Runger and Eugene Tuv INFORMS Annual Meeting 2011, Charlotte A Bag-of-Features Framework for Time Series Classification.

Opinion spam and Analysis 소프트웨어공학 연구실 G 최효린 1 / 35.

Modeling Human Emotion during watching movies using EEG Prepared By : Muniratul Husna Bt. Mohamad Sokri Matric No. : Lecturer : Dr. Farzana binti.

Detection Of Anger In Telephone Speech Using Support Vector Machine and Gaussian Mixture Model Prepared By : Siti Marahaini Binti Mahamood.

Jonatas Wehrmann, Willian Becker, Henry E. L. Cagnini, and Rodrigo C

IB Assessments CRITERION!!!.

* the sampling rate and filter bandwidth were set to 500Hz and 1-10 Hz, respectively. * an additional 60Hz notch filter was employed to avoid the power-line.

Artificial Intelligence for Speech Recognition

Supervised Time Series Pattern Discovery through Local Importance

Automatic Sleep Stage Classification using a Neural Network Algorithm

When to engage in interaction – and how

RESEARCH BASICS What is research?.

Machine Learning for Visual Scene Classification with EEG Data

BCI Research at the ISRC, University of Ulster N. Ireland, UK

Visual Recognition of American Sign Language Using Hidden Markov Models 문현구 문현구.

Data Pre-processing Lecture Notes for Chapter 2

Presentation transcript:

Cultural Differences and Similarities in Emotion Recognition Vladimir Kurbalija, Mirjana Ivanović, Miloš Radovanović, Zoltan Geler, Dejan Mitrović, Weihui Dai, Weidong Zhao, Marija Semnic 1

AGENDA INTRODUCTION - Emotional Intelligence RELATED WORK DATA PREPROCESSING: EXTRACTING FREQUENCY BANDS EXPERIMENTAL SETUP EMOTION RECOGNITION BASED ON CLASSIFICATION ACCURACIES CONCLUSION 2

Introduction Emotional Intelligence Emotional Intelligence is a new discipline of knowledge, dealing with modeling, recognition and control of human emotions. It is an interdisciplinary research which related to machine learning, natural language processing, psychology… Cognitive Model and Propagation Model 3

Introduction Cognitive Model Human emotion is caused by specific situations, emotional change triggers a series of physiological responses through the nervous system, and form the unique subjective experience which may cause external expression changes: gesture, action, language …. The objective of cognitive model is to reason about emotions, predict and understand human emotions, and to process emotions and give a response in an appropriate way. 3 levels: words, sentence, text 4

Introduction Emotions represent a multi-disciplinary topic traditionally studied by philosophy, psychology, sociology, neuroscience, medicine, etc. Recently, emotion recognition and simulation have become an important research topic in man-machine communication. Emotional intelligence deals with modeling, recognition and control of human emotions: new generation of intelligent information processing, intelligent services, and similar applications. Contemporary research is emotion recognition in human voice, apply speech signal processing and pattern recognition techniques: cepstrum analysis, dynamic time warping, and hidden Markov modeling 5

Introduction Essential domains for EI were recognized: knowing one’s emotions, managing emotions, motivating oneself, recognizing others’ emotions, and handling relationships Importance of proper emotion detection: intensify research efforts and perform experiments on numerous appropriate data sets. We conducted experiments with Serbian and Chinese colleagues: brain signals were measured as reactions to short vocal sentences in different emotional states (happiness, jealousy, anger, and sadness, all pronounced by a native Mandarin speaker). Intention: whether language understanding has an impact on emotion perception, whether people of different nationalities “feel” the emotions differently 6

AGENDA INTRODUCTION - Emotional Intelligence RELATED WORK DATA PREPROCESSING: EXTRACTING FREQUENCY BANDS EXPERIMENTAL SETUP EMOTION RECOGNITION BASED ON CLASSIFICATION ACCURACIES CONCLUSION 7

RELATED WORK Emotion recognition from EEG signals is in its relative infancy Based on several papers frontal lobes carry potentially useful information with respect to emotion recognition At first, research on emotion recognition from EEG signals focused on stimuli from one source, mostly visual, sometimes also on audio stimuli 8

RELATED WORK Research questions: (1) Is the modality of the stimulus recognizable from the recorded brain signals? (2) Are emotions recognizable from an EEG? (3) What is the influence of the modality of the stimulus on the recognition rate? (4) When using only five electrodes, what would be a good montage for emotion recognition? and (5) What features are interesting for emotion recognition? 9

RELATED WORK Features can be extracted from time series obtained from EEG signals in various ways. Successful approaches for the task of emotion recognition include discrete wavelet transform fractal dimension and crossings features with the support vector machines (one of the most successful classifiers). 10

RELATED WORK We will explore classification accuracies of EEG signals with respect to emotion classification and to combination of emotion and native language classification. EEG signals are obtained using only audio stimuli Same set of sentences is uttered by professional actors with different emotional overtones. Explore whether different individuals “feel” the same emotions in the same way whether language understanding has an impact on emotion perception. 11

AGENDA INTRODUCTION - Emotional Intelligence RELATED WORK DATA PREPROCESSING: EXTRACTING FREQUENCY BANDS EXPERIMENTAL SETUP EMOTION RECOGNITION BASED ON CLASSIFICATION ACCURACIES CONCLUSION 12

DATA PREPROCESSING: EXTRACTING FREQUENCY BANDS Usual approach in emotion detection algorithms is to decompose EEG signals into distinct frequency bands, and then work on each band separately Most commonly used frequencies: theta (4-8 Hz), alpha (8-14 Hz), beta (14-32 Hz), and gamma (32-64 Hz) lower frequencies (delta, 1-4 Hz) are associated with babies and sleeping adults, higher frequencies (i.e. above 64 Hz) represent noise. 13

DATA PREPROCESSING: EXTRACTING FREQUENCY BANDS Frequency bands can be extracted from raw EEG signals using band-pass audio filters Filters pass all frequencies within the given range To pre-process data and extract the frequency bands, we used the two-pole Butterworth filter concrete implementation of the band-pass filter. 14

DATA PREPROCESSING: EXTRACTING FREQUENCY BANDS Figure. A part of a raw EEG signal (bottom) and the corresponding theta (middle) and alpha (top) frequency bands 15

AGENDA INTRODUCTION - Emotional Intelligence RELATED WORK DATA PREPROCESSING: EXTRACTING FREQUENCY BANDS EXPERIMENTAL SETUP EMOTION RECOGNITION BASED ON CLASSIFICATION ACCURACIES CONCLUSION 16

EXPERIMENTAL SETUP IDEA: d iscover if there are differences in emotion recognition between people from different socio-cultural settings (acquire data using standard procedure) There were six participants: four native Mandarin speakers and two native Serbian speakers. Scalp electrodes were applied using the system (one of the most widely-used international standards) 17

EXPERIMENTAL SETUP Electrodes of the international standard 18

EXPERIMENTAL SETUP Instead of acquiring data from electrodes directly, it is common to use the bipolar model, which measures the difference between a pair of electrodes 8 channels (C1 to C8): C1: Fp1-T3, C2: Fp2-T4, C3: T3-O1, C4: T4-O2, C5: Fp1-C3, C6: Fp2-C4, C7: C3- O1, C8: C4-O2 Experiments are performed in a relatively controlled environment Chinese and Serbian participants have to listen to audio clips which express different kinds of emotions 19

EXPERIMENTAL SETUP The audio clips were uniformly structured 20

EXPERIMENTAL SETUP Same set of sentences is reproduced with appropriate intonation in all six emotions. The purpose: to discover whether the way of pronunciation affects the perception of emotions regardless of speech understanding. The last four events were different music clips (traditional, jazz, rock) - investigate whether the same music causes the same brain activity, or emotion, in different participants. 21

EXPERIMENTAL SETUP 22

EXPERIMENTAL SETUP The part of recorded data for one participant All 10 events were presented to each of the six participants and brain reactions were recorded Number of samples/rows per participant varies from to EEG signals from each channel are firstly decomposed to their alpha, beta, gamma and theta frequency band Each signal from each channel and each frequency band is split into 10 time series 23

EXPERIMENTAL SETUP Finally, we obtained a labeled dataset which consists of 1920 time series The label of each time series is constructed in two different ways: Event number (emotion) from Table, A combination of the event number and the nationality of the participant. We have investigated the possibility of applying some techniques of time-series data mining in the field of emotion recognition in human voice. 24

AGENDA INTRODUCTION - Emotional Intelligence RELATED WORK DATA PREPROCESSING: EXTRACTING FREQUENCY BANDS EXPERIMENTAL SETUP EMOTION RECOGNITION BASED ON CLASSIFICATION ACCURACIES CONCLUSION 25

EMOTION RECOGNITION BASED ON CLASSIFICATION ACCURACIES With the time series we performed two sets of experiments First set: the time series were labeled relying solely on the emotion (the event number) regardless of the participants. The aim was to examine whether there are similarities or common patterns between the time series of different participants for the same emotion. This could indicate that different people experience the same emotion in similar way, independently of their nationality or language understanding. 26

EMOTION RECOGNITION BASED ON CLASSIFICATION ACCURACIES Second phase: We have incorporated into the class labels the information about the nationality of the participants. Motivation: assumption that there may be significant differences in the responses of the participants of different nationalities which lead to considerably dissimilar time series. 27

EMOTION RECOGNITION BASED ON CLASSIFICATION ACCURACIES The common methodology for estimating the similarities between time series is to measure classification accuracy of labeled time series. For the evaluation technique we used 10 runs of 10-fold stratified cross validation (SCV10x10). The simple 1NN classifier. All the experiments are performed using the FAP system. 28

EMOTION RECOGNITION BASED ON CLASSIFICATION ACCURACIES By evaluation we intended to investigate whether language understanding has an impact on emotion perception and whether different nationalities “feel” the emotions differently. Conclusion: the brain signals recorded under the influence of listening to audio tracks of different emotions are not considerably different Some potentially useful findings from the perspective of emotion detection could be: Channel C3 in most of the cases gives better results, Frequency band gamma generally gives worst results. These findings are not formally grounded, and more investigation is needed to prove them. 29

AGENDA INTRODUCTION - Emotional Intelligence RELATED WORK DATA PREPROCESSING: EXTRACTING FREQUENCY BANDS EXPERIMENTAL SETUP EMOTION RECOGNITION BASED ON CLASSIFICATION ACCURACIES CONCLUSION 30

CONCLUSIONS Results are primarily rather negative, but some conclusions can be drawn signals corresponding to the same emotion produced by different people are very different literature suggests that channels associated to the frontal lobe carry most useful information with respect to the emotion recognition task in our measurements the frontal lobe was only partially accounted for, may be another reason for the exhibited high error rates 31

CONCLUSIONS In future work we will construct data sets containing multiple instances of signals that originate from one individual, with the expected effect of achieving much better classification accuracy We will also investigate discrimination of nationality based on the same ECG signals. Preliminary results are positive: use of some channels and bands which were not adequate for emotion recognition could carry useful information for classification based on nationality. 32

Cultural Differences and Similarities in Emotion Recognition Vladimir Kurbalija, Mirjana Ivanović, Miloš Radovanović, Zoltan Geler, Dejan Mitrović, Weihui Dai, Weidong Zhao, Marija Semnic 33