Recognition of meeting actions using information obtained from different modalities Natasa Jovanovic TKI University of Twente.

Slides:



Advertisements
Similar presentations
Qualities of a good facilitator
Advertisements

An overview of EMMA— Extensible MultiModal Annotation Michael Johnston AT&T Labs Research 8/9/2006.
A Meeting Browser that Learns Patrick Ehlen * Matthew Purver * John Niekrasz Computational Semantics Laboratory Center for the Study of Language and Information.
ENTERFACE’08 Multimodal Communication with Robots and Virtual Agents.
Introduction to Linguistics for lawyers
Content-based Video Indexing, Classification & Retrieval Presented by HOI, Chu Hong Nov. 27, 2002.
Chapter 3 Nonverbal Communication. What is nonverbal communication? “Everything that communicates a message but does not use words” Facial expressions,
John Hu Nov. 9, 2004 Multimodal Interfaces Oviatt, S. Multimodal interfaces Mankoff, J., Hudson, S.E., & Abowd, G.D. Interaction techniques for ambiguity.
Intonation and Communication Martha C. Pennington Martha C. Pennington Professor of Writing and Linguistics.
Unsupervised Clustering in Multimodal Multiparty Meeting Analysis.
Observational Studies Observing in the Field. Two types of observation Nonparticipant observation. Researcher is not part of the activity taking place,
Video Mining Learning Patterns of Behaviour via an Intelligent Image Analysis System.
OH 3-1 Agenda Review articles from Chapter 2 A little humor………. Chapter 3 – Communicating Effectively as a Leader and a Manager.
1 IUT de Montreuil Université Paris 8 Emotion in Interaction: Embodied Conversational Agents Catherine Pelachaud.
Building the Design Studio of the Future Aaron Adler Jacob Eisenstein Michael Oltmans Lisa Guttentag Randall Davis October 23, 2004.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
COMMUNICATION in Nursing Concepts of Nursing NUR 123.
Susan Bock Solutions 2007 Communication What is Communication? Communication Styles Communication Spectrum.
Chapter7 Symbolic Communication and Language. Chapter Outline  Language and Verbal Communication  Nonverbal Communication  Social Structure and Communications.
PREPARING YOUR DISSERTATION ENGLISH III Listening and Speaking Prof. Jonathan Vela.
Center for Human Computer Communication Department of Computer Science, OG I 1 Designing Robust Multimodal Systems for Diverse Users and Mobile Environments.
Working group on multimodal meaning representation Dagstuhl workshop, Oct
1 Multimodal Group Action Clustering in Meetings Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain McCowan, Guillaume Lathoud IDIAP Research Institute.
Chapter 7. BEAT: the Behavior Expression Animation Toolkit
APML, a Markup Language for Believable Behavior Generation Soft computing Laboratory Yonsei University October 25, 2004.
Teaching Productive Skills Which ones are they? Writing… and… Speaking They have similarities and Differences.
Aim: How can we analyze different types of communication?
Different settings for communication
Towards multimodal meaning representation Harry Bunt & Laurent Romary LREC Workshop on standards for language resources Las Palmas, May 2002.
Overview Learn about effective communication
1 Language and Social Variation. 2 1.Introduction: In the previous lecture, we focused on the variation in language use in different geographical areas.
Turn-taking Discourse and Dialogue CS 359 November 6, 2001.
ENTERFACE 08 Project 2 “multimodal high-level data integration” Mid-term presentation August 19th, 2008.
Communication Skills FCNS 426 Strategic Management Joan E. Quinn, M.Ed., RD.
Design Concepts By Deepika Chaudhary.
Issues in Multiparty Dialogues Ronak Patel. Current Trend  Only two-party case (a person and a Dialog system  Multi party (more than two persons Ex.
Towards a Theoretical Framework for the Integration of Dialogue Models into Human-Agent Interaction John R. Lee Assistive Intelligence Inc. Andrew B. Williams.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
Non-verbal Communication. How necessary is it to use and interpret it?
ENTERFACE 08 Project 1 “MultiParty Communication with a Tour Guide ECA” Mid-term presentation August 19th, 2008.
© 2011 Cengage Learning. All Rights Reserved. CHAPTER 1 Welcome to Public Speaking 1.1Understand Public Speaking 1.2Give Effective and Enthusiastic Speeches.
Communication Additional Notes. Communication Achievements 7% of all communication is accomplished Verbally. 55% of all communication is achieved through.
1 1. Representing and Parameterizing Agent Behaviors Jan Allbeck and Norm Badler 연세대학교 컴퓨터과학과 로봇 공학 특강 학기 유 지 오.
4 November 2000Bridging the Gap Workshop 1 Control of avatar gestures Francesca Barrientos Computer Science Division UC Berkeley.
Communication Vocabulary
Speaking while monitoring addressees for understanding Torsten Jachmann Herbert H. Clark and Meredyth A. Krych Seminar „Gaze as function of.
Communication. What is Communication? The process of exchanging information, ideas, and feelings between a sender and a receiver.
1 Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents S. Kawamoto, et al. October 27, 2004.
Nonverbal Communication. Communication in general is process of sending and receiving messages that enables humans to share knowledge, attitudes, and.
O The Feedback Culture. o Theories of Communication. o Barriers. REVIEW.
ENTERFACE 08 Project #1 “ MultiParty Communication with a Tour Guide ECA” Final presentation August 29th, 2008.
SEESCOASEESCOA SEESCOA Meeting Activities of LUC 9 May 2003.
Understanding Naturally Conveyed Explanations of Device Behavior Michael Oltmans and Randall Davis MIT Artificial Intelligence Lab.
Verbal Communication. Oral Communication involves what?
ENTERFACE’08 Multimodal Communication with Robots and Virtual Agents mid-term presentation.
Conversational role assignment problem in multi-party dialogues Natasa Jovanovic Dennis Reidsma Rutger Rienks TKI group University of Twente.
Segmentation and Recognition of Meeting Events M4 – Meeting Munich 23. September 2004 Stephan Reiter.
Software Architecture for Multimodal Interactive Systems : Voice-enabled Graphical Notebook.
WP6 Emotion in Interaction Embodied Conversational Agents WP6 core task: describe an interactive ECA system with capabilities beyond those of present day.
Antar Abdellah. What is speaking? Producing comprehendible sounds.
Communication between cultures 8TH EDITION Chapter 8 Verbal Messages: Exchanging Ideas Through Language © Cengage 20121Chapter 8 Verbal Messages: Exchanging.
Communication. What is communication? Communication is successfully making your wants, needs, feelings and ideas known to other people. There are three.
OVERVIEW Learn about effective communication Learn how to plan, organize, and write a variety of speeches Deliver speeches on a variety of topics You will.
Interpreting Ambiguous Emotional Expressions Speech Analysis and Interpretation Laboratory ACII 2009.
UNDERSTANDING COMMUNICATIONS. Communication is: Sharing (information) Using (symbols) Sending and (receiving) messages.
Stuart Hall ENCODING/DECODING MODEL OF COMMUNICATION.
KRISTINA Consortium Presented by: Mónica Domínguez (UPF-TALN)
Unit 1 Notes: Communication
Presented by: Mónica Domínguez
The Communication Process
Presentation transcript:

Recognition of meeting actions using information obtained from different modalities Natasa Jovanovic TKI University of Twente

2 Outline  Social psychology aspect of joint activities, joint and individual actions  Meeting as a sequence of meeting actions Semantic approach in modeling meetings  Lexicon of meeting actions  Other aspects of meetings  Semantic model  Conclusions and future directions

3 Joint activities (Social psychology aspect)  Activity types: time-bounded event (football game) or an ongoing process (teaching)  Joint activity- an activity with more than one participant. Discourse ( language has dominate role), football game, weeding ceremony, meeting  Dimensions of joint activities: formality, scriptedness, verbalness, cooperativness  Aspects of joint activities: participants, activity roles, public goals, private goals, hierarchies, boundaries, dynamics etc.  Joint activity advance through joint actions

4 Individual and joint actions (Social psychology aspect)  Joint action – a group of people doing things in coordination ( e.g speaking and listening,passing a ball in basketball etc.).  Coordination of both content and processes  Individual actions: Autonomous actions Participatory actions (individual acts performed only as the part of a joint action)  A person’s processes may be very different in individual and joint actions even when they appear identical  In joint actions participants often perform different individual actions

5 Meeting as a sequence of meeting actions (I)  Meeting is a dynamic process which consists of group interaction ( joint actions) between meeting participants -meeting actions (meeting events)  Meeting actions:monologue, discussion, note taking, presentation, consensus, disagreement etc.  Meeting actions are determined by the participants’ individual actions Beh=f(P,E) P-person; E-environment

6 Meeting as a sequence of meeting actions(II)  Multimodal human-human interaction in the meeting (natural humans behavior)  Communication channels: speech, face expressions, gestures, body movements, gaze etc.  Combination of verbal and non-verbal elements

7 Semantic approach in modeling meeting (I)  Our idea: Semantic approach in modeling meeting as a sequence of meeting actions using information obtained from different modalities  Why do we need a semantic approach?

8 Semantic approach in modeling meeting(II)  Multidimensional (multilevel) problem in meeting modeling. participant level : integration of information obtained from different modalities in order to recognize multimodal participants behavior meeting action level:recognition of meeting actions as a combination of the multimodal participants behavior

9 Lexicon of meeting actions(I)  The first step in meeting modeling is to describe a lexicon of meeting actions  Each meeting action has something like a micro grammar  Structure of lexicon: definition of a meeting action characteristics: number of speakers, time, boundaries, topics, speaker behavior, participants behavior, duration constraint etc.

10 Lexicon of meeting actions(II)  Set of 17 meeting actions divided in three groups: Single speaker dominate meeting actions Multi speaker meeting actions Non-verbal dominate meeting actions  Hierarchical organization of meeting actions

11 Meeting actions Non-verbal dominate Multi-speaker Single speaker dominate PresentationMonologue Opening Introduction White-boardLecturing EndingDiscussion Multi discussion Consensus Disagreement BreakVote Applause Note taking SilenceLaugh Lexicon of meeting actions (III)

12 Other aspects of meeting ( User profile )  Meeting is more than a sequence of meeting actions.  User profile: age, gender, native-English speaker, profession, membership to specific group, role, speech style etc.  The user profile can be explicitly specified during the registration process or be learned during the processing of the recorded meetings  Knowledge about user may be useful on individual and group level of meeting modeling.

13 Other aspects of meeting ( Background knowledge )  Background knowledge play an important role at each level of abstraction  Background knowledge may include : agenda, written notes, presentation slides, content of white-board number of meeting participants etc.

14 Other aspects of meeting ( Target detection )  ”What John said to Peter about the programming standards?“ contains three very important aspects of the meeting.  source of the messages (John)  discussed topic (programming standards)  target (addressee) of the message (Peter)

15 Other aspects of meeting ( Target detection )  Target ( addressee) detection needs a multimodal approach (speech,gaze, gesture) “What do you think about my idea?” Gaze detection ( speaker focus of attention) or pointing at the person may help to resolve this target ambiguity  Name detection as a method for target detection  Target of the message can be a particular person, group of participants or all participants

16 Other aspects of meetings (Target detection) speakeraddresseeside participant all participants bystander eavesdropper all listener Herbert. H. Clark – Using Language

17 Semantic model  Our idea is to develop a modular multimodal system which will use semantic approach on participant level and meeting action level.  Inputs:results of recognition process (WP2) Speech Recognition Gesture/Action Recognition Gaze detection Emotion detection Multimodal person identification and tracking  Output: annotated sequence of meeting actions

18 Meeting Actions Recognition Module Semantic model VideoAudio Gaze detection Action/Gesture Recognition Speech Recognition Person /Speaker ID and Tracking Unimodal Interpreters Multimodal Interpreters Sequence of meeting actions Multimodal recognizers Multimodal Fusion Participant Level Modality units Participants multimodal behavior Background Knowledge

19 Multimodal fusion on a participant level Gaze Interpreter Action/Gesture Interpreter Speech Interpreter Modality Fusion Additional Inference Multimodal recognizers Gaze detection Action/Gesture Recognition Speech Recognition Person /Speaker ID and Tracking Unimodal Interpreters Multimodal Interpreter Participants multimodal behavior Modality units

20 Multimodal fusion on a participant level  Unimodal Interpreters  Unimodal Interpreters modality units 1) Action/Gesture Interpreter participant states (sitting, standing, walking etc.) activities ( silent, talking, laughing,voting etc.) 2) Gaze interpreter ( look at X, look away) 3) Speech Interpreter turn-taking behavior is a basis for social interaction. meaning representation on turn level ( turn array level) features of an array: topic (subtopics), dialog acts (DAMSL), addressees, key words, speech form, overlapping indicator etc.

21 Multimodal fusion on a participant level  Multimodal Interpreter  Multimodal Interpreter Multimodal participants behavior 1) Modality fusion (semantic level) Typed feature structure for meaning representation Unification or/and rule-based approach for fusion 2) Additional inference Use additional information from user profile or background knowledge in order to obtain missing data or resolve ambiguity.

22 Meeting actions recognition module  Hidden Markov Models states: meeting actions observations: semantic features from participant’s behavior representation  Participant dependent features (state, activity, talking duration, dialogue acts etc.) and common features (previous dialogue act, previous key-words etc.)  IDIAP meeting data corpus

23 Conclusions and future direction  The main goal of our approach is to encode more semantic details at each level in other to enable browsing and querying of an archive of recorded meetings.  Larger and more natural meeting data corpus in order to prove our approach for low-level and high-level meeting actions.  Extraction of a set semantic features  Testing approach using techniques different than HMM.