Speaking while monitoring addressees for understanding Torsten Jachmann 16.12.2013 Herbert H. Clark and Meredyth A. Krych Seminar „Gaze as function of.

Slides:



Advertisements
Similar presentations
Alignment in multimodal dialogue corpora Robin Hill and Ellen Gurman Bard Edinburgh.
Advertisements

Ciara R. Wigham, 15 Dec Initiation 1. simple (elementary) 2. complex (episodic, instalment, provisional, dummy, proxy) Refashioning 1. request.
Rationale To encourage all students to take a full part in the life of our school, college, workplace or wider community. To provide opportunities to enable.
Chapter Eleven Delivering the Speech. Chapter Eleven Table of Contents zQualities of Effective Delivery zThe Functions of Nonverbal Communication in Delivery.
Understanding Progress in English A Guide for Parents.
Using prosody to avoid ambiguity: Effects of speaker awareness and referential context Snedeker and Trueswell (2003) Psych 526 Eun-Kyung Lee.
Collaborative Conversations Doug Fisher, PhD San Diego State University.
Social Interaction. Includes the third school of sociology Includes the third school of sociology Is easily studied using approaches at the micro level.
Human interaction is not constructed as a single channel – it is multimodal. Speech and gestures correlate to convey meaning. Moreover, human interaction.
Conversational Behaviors of Individuals with HL Audiological Rehabilitation.
1 RUNNING a CLASS (2) Pertemuan Matakuliah: G0454/Class Management & Education Media Tahun: 2006.
Public Communication 1 Focus Questions 1. What is public speaking? 2. Do ordinary people do much public speaking? 3. How do speakers earn credibility?
VENTURING LEADERSHIP SKILLS COURSE. Session II: Communication Interrupt Me.
Unsupervised Clustering in Multimodal Multiparty Meeting Analysis.
Focus Questions What is public speaking?
La Technologie des Mouvements Oculaires en Linguistique Expérimentale Rachel Shen.
Psycholinguistics 09 Conversational Interaction. Conversation is a complex process of language use and a special form of social interaction with its own.
Communication Ms. Morris.
Web 2.0 Testing and Marketing E-engagement capacity enhancement for NGOs HKU ExCEL3.
Chapter 4 Listening for advanced level learners Helgesen, M. & Brown, S. (2007). Listening [w/CD]. McGraw-Hill: New York.
Managing and Teaching the Physical Education Lesson Chapter 7.
Common Ground Linguistic referents are established w/in a “domain of interpretation”, which includes context –One component of context = Common Ground.
Recognition of meeting actions using information obtained from different modalities Natasa Jovanovic TKI University of Twente.
Communicative Resources. How Do We Communicate? Conversation involves more than language – Gestures, facial expressions, tone of voice, … – Face-to-face.
Communicating In Groups. Introduction I need four volunteers. (Five minute discussion) Did you notice anything unusual about each students behavior? Happiness.
The effects of relevance of on-screen information on gaze behaviour and communication in 3-party groups Emma L Clayes University of Glasgow Supervisor:
Collaborative Learning. (c) Frey & Fisher, 2008 TEACHER RESPONSIBILITY STUDENT RESPONSIBILITY Focused Instruction Guided Instruction “ I do it ” “ We.
Chapter 20 Choose and Rehearse a Method of Delivery and
Speaking, Writing, and Listening Skills
NordTalk - Corpus based research on spoken language 2002 Content of Child-Caregiver Conversations in Daily Activities: The Impact of Severe Speech and.
APA Writing Style I Introduction.
A Model Workplace: Critical Conversations August 6, 2013.
Speech Terms. A type of nonverbal communication that involves use of the body such as gestures, posture, or movement Body Language.
Strategies for Increasing Communication in Natural Environments.
SPEECH AND WRITING. Spoken language and speech communication In a normal speech communication a speaker tries to influence on a listener by making him:
Issues in Multiparty Dialogues Ronak Patel. Current Trend  Only two-party case (a person and a Dialog system  Multi party (more than two persons Ex.
HYMES (1964) He developed the concept that culture, language and social context are clearly interrelated and strongly rejected the idea of viewing language.
1 Natural Language Processing Lecture Notes 14 Chapter 19.
Chapter five.  Language is a communication tools whose development depends on the prior development of communication.  Language is a social tool.* 
CHECKING FOR UNDERSTANDING. Prioritizing Priorities Worth being familiar with Important to know and do Enduring Understanding.
WEEK 6 POLIITENESS AND CULTURE.  The concept of politeness is crucial in any communication, but particularly in cross cultural communication  Communication.
Animating Idle Gaze Humanoid Agents in Social Game Environments Angelo Cafaro Raffaele Gaito
Professional Conversations for Difficult Situations Active Listening Tools for Effective Communication Heidi Ricci.
1 Public Communication  Public communication as enlarged conversation (James Winans, 1938)  Preparation time  Turn-taking delay  Public speaking in.
Turn-taking and Backchannels Ryan Lish. Turn-taking We all learned it in preschool, right? Also an essential part of conversation Basic phenomenon of.
Dobrin / Weisser / Keller: Technical Communication in the Twenty-First Century. © 2010 Pearson Education. Upper Saddle River, NJ, All Rights Reserved.
8. What are the advantages and disadvantages of using a virtual reality environment to study the brain and behavior? 9.Give examples of the way that virtual.
Conducting an Interview Module 7 Level 1 Understanding Effective Communication.
Dan Bohus Researcher Microsoft Research in collaboration with: Eric Horvitz, ASI Zicheng Liu, CCS Cha Zhang, CCS George Chrysanthakopoulos, Robotics Tim.
Chapter 5.18: Controlling the Voice. The Voice- Your Instrument Your voice is a powerful instrument of expression that should express who you are and.
Conversational role assignment problem in multi-party dialogues Natasa Jovanovic Dennis Reidsma Rutger Rienks TKI group University of Twente.
Objectives of session By the end of today’s session you should be able to: Define and explain pragmatics and prosody Draw links between teaching strategies.
Chapter 18: Your Body in Delivery. Pay Attention to Body Language  Body language includes  Facial expressions;  Eye behavior;  Gestures;  General.
Presented By Meet Shah. Goal  Automatically predicting the respondent’s reactions (accept or reject) to offers during face to face negotiation by analyzing.
How Languages Are Learned
Universitetskaya Emb. 11 Universitetskaya Emb. 11 St.Petersburg, Russia, St.Petersburg, Russia, Tel./FAX (7-812) Tel./FAX (7-812)
VIDEO ANALYSIS OF TEACHING ASSESSMENT OF CLINICAL PRACTICE ECE Spring 2014 By: Megan McGuire.
Analysis of spontaneous speech
Early Intervention-Preschool Conference
Grounding by nodding GESPIN 2009, Poznan, Poland
Approaches to Discourse Analysis
Collaborative Conversations
Why bother – is this not the English Department’s job?
Outcome 2 At the end of this session you will:
Studying Spoken Language Text 17, 18 and 19
Chapter 2 Focusing on Interpersonal and Group Communication
A POCKET GUIDE TO PUBLIC SPEAKING 5TH EDITION Chapter 18
Communicative Resources
The Stages of Language & Literacy Development
Collaborative Learning
Presentation transcript:

Speaking while monitoring addressees for understanding Torsten Jachmann Herbert H. Clark and Meredyth A. Krych Seminar „Gaze as function of instructions - and vice versa“

Research Question Speaking and listening in dialog o Unilateral Speakers and listeners act autonomous No interaction o Bilateral Speakers and listeners monitor their respective partner Joint activity  What do speakers monitor?  How do they use that information?

Grounding Level 1 o Attend to vocalization Level 2 o Identify words, phrases and sentences Level 3 o Understand the meaning Level 4 o Consider answering

Grounding A: Where you there when they erected the new signs? B: Th… which new signs?(Level 3) A: Little notice boards, indicating where you had to go for everything B: No.  Bilateral account

Monitoring Voices o Attendance to partners utterances Faces o Gaze and facial expressions as indicator for understanding Workspaces o Region in front of the body o Manual gestures (but also games, etc.)

Monitoring Bodies o Head and torso movement as indicator Shared Scenes o Scenery beyond workspace Signals vs. Symptoms o Signals are constructed to get meaning across o Symptoms are not intentionally created

Least joint effort Opportunistic o Selection of the available methods that take the least effort to produce “Tailored” o Overhearers (not monitored by speaker) may misunderstand utterances

Method Pairs of directors and builders o 76 students (34 male / 42 female) Instructions to build 10 simple Lego Models 2 x 2 design (interactive) o 28 pairs Additional non-interactive condition o 10 pairs Video and audio analyses

Interactive Mixture model o Workspace (between subject) Visible Invisible o Faces (within subject) Visible Invisible No restrictions in time and talk

Non-interactive Only one condition Director records instructions o No time or talk constrains o Prototype can be examined as long as wanted before recording Builders listen to instructions o No constrains on actions Start, stop, rewind

Results Efficiency Turns Gestures and grounding o Deictic expressions o Gestures by addressees o Cross-timing of actions o Timing strategies o Visual monitoring

Efficiency Visibility of workspace improves efficiency

Efficiency Non-interactive Time needed to build much longer (245s “n-i” vs. 183s “i”) Strong drop in accuracy o Inadequate instructions

Turns Fewer SPOKEN turns of builder when workspace is visible

Deictic expressions Mainly unusable when workspace hidden o Joint attention needed o only referring to before mentioned situation

Gestures by addressees Mostly accompanied by deictic utterances (if any) Explicit verdict usually only on such utterances (otherwise continuing)

Cross-timing Gestural signals o Reflect understanding at that moment

Cross-timing Overlapping signals o Usually not in spoken dialog o Start with “sufficient information”

Cross-timing Projecting o Prediction of following actions/instructions

Cross-timing Initiation time o Waiting for partner to be able to attend the following utterance

Cross-timing Time uptake o Responses have to be timed exactly to the action and situation

Timing strategies Self-interruption o Dealing with evidence from the addressee o Usually not continued

Timing strategies Collaborative references o Deictic references rely on addressees actions

Visual monitoring Mainly used when director reaches a problem Eye gaze as support

Conclusion Grounding is fundamental Visible workspace enhances grounding speed In task-oriented dialogs faces are not important Compensation possible (only if any monitoring is available)

Conclusion Updating common ground Increments are determined jointly Much evidence for bilateral account o Addressees provide statement about current understanding o Speakers monitor to update and change utterances

Conclusion Opportunistic process o Offering options o Self-interruptions o Waiting o Instant revision Multi-modal process o Speech and gestures are combined if possible o Speech alone takes more time

Remarks Gaze only important for certain types of tasks Measurement of time maybe outdated (“old” study) No contradicting studies (To some extend commonsense)

Gaze and Turn-Taking Behavior in Casual Conversation Interactions Kristiina Jokinen, Hirohisa Furukawa, Masafumi Nishida and Seiichi Yamamoto

Differences Three-party dialogue No instructional task Stronger focus on eye gaze

Research Question How well can eye gaze help in predicting turn taking? What is the role of eye gaze when the speaker holds the turn? Is the role of eye gaze as important in three-party dialogs as in two-party dialogue?

Hypothesis In group discussions, eye gaze is important in turn to management (especially in turn holding cases) The speaker is more influential than the other partners in coordinating interactions (selects the next speaker)

Method Three-person conversational eye gaze corpus o Natural conversations o Balanced familiarity (50% familiar; 50% unfamiliar) o Balanced gender (male-only; female-only; mixed)

Method 28 conversations among Japanese students in their early 20’s with three participants each Each conversation about 10 minutes Eye gaze recorded for one participant

Method Eye tracker fixed on table to remain naturalness

Method

Used data Estimated at the last 300ms of an utterance if followed by a 500ms pause

Used data Dialog acts Speech features o Values of F0, etc. Eye gaze

Results

Conclusion Speaker signals whether he intends to give the turn or hold it by using eye gaze o fixating listener vs. focusing attention somewhere Eye gaze in multi-participant conversation as important as in two- participant conversations

Conclusion Eye gaze is used to select next speaker (seems to be correct) Maybe Japanese data interferes with value of speech data o Comparison Study? Listeners focus on speaker not vice versa

Remarks Vague information and data presentation o Although various data exists, interaction of factors is not presented o Some conclusions rely on the before mentioned point Setup only takes one participant in consideration Much of the data was unused o Lack in quality and way of creation

Remarks Study is based on data for another study o Setup is not optimal Realistic design o Yet, contains biasing flaws (situation of the participants, only one eye tracker)

Comparison Clark and Krych present interesting ideas but eye gaze is only rarely handled o How could this be altered? Jokinen et al. focus on eye gaze in a (more or less) natural situation but lack in scientific results and setup o What points and ideas of this setup could be beneficial?