Defence Research and Development Canada Recherche et développement pour la défense Canada Canada Spatialized Audio in the Common Operating Perspective.

Slides:

Advertisements

Similar presentations

Some Reflections on Augmented Cognition Eric Horvitz ISAT & Microsoft Research November 2000 Some Reflections on Augmented Cognition Eric Horvitz ISAT.

Advertisements

AVQ Automatic Volume and eQqualization control Interactive White Paper v1.6.

Voiceprint System Development Design, implement, test unique voiceprint biometric system Research Day Presentation, May 3 rd 2013 Rahul Raj (Team Lead),

Created By: Lauren Snyder, Juliana Gerard, Dom Williams, and Ryan Holsopple.

EE2F2 - Music Technology 2. Stereo and Multi-track Recording.

Perceptual Processes: Attention & Consciousness Dr. Claudia J. Stanny EXP 4507 Memory & Cognition Spring 2009.

Listening and Communication Enhancement. LACE Agenda How Auditory Training (AT) changes the hearing aid practice LACE: how it works; results it produces.

Damian Gordon Consider the Users Andrea Curley. Nature of User Many different categories of users, impossible to consider all Can you group users?

Overview What is iLs? How does iLs work? What is the science behind the method? What is the equipment like? How long is the iLs program? Is there supporting.

Multimodal feedback : an assessment of performance and mental workload (NASA-TLX) 남종용.

Command Visualisation NATO Workshop on Visualisation of Massive Military Multimedia Datasets, DREV Justin G. Hollands Human-Computer Interaction Group,

OHT 10.1 Galin, SQA from theory to implementation © Pearson Education Limited 2004 The testing process Determining the test methodology phase Planning.

Cognitive Processes PSY 334 Chapter 3 – Attention July 8, 2003.

ICS 463, Intro to Human Computer Interaction Design: 9 “Theory”. Input and Output Dan Suthers.

Evaluating Non-Visual Feedback Cues for Touch Input Device Selina Sharmin Project for the course New Interaction Techniques Spring.

Zhengyou Zhang, Qin Cai, Jay Stokes

Theoretical Foundations of Multimedia Chapter 3 Virtual Reality Devices Non interactive Slow image update rate Simple image Nonengaging content and presentation.

1 Dong Lu, Peter A. Dinda Prescience Laboratory Computer Science Department Northwestern University Virtualized.

Lecture 4: Perception and Cognition in Immersive Virtual Environments Dr. Xiangyu WANG.

Binaural Sound Localization and Filtering By: Dan Hauer Advisor: Dr. Brian D. Huggins 6 December 2005.

Call Center – What Really Makes Sense? Call Center – ce este cu adevarat important?

Biointelligence Laboratory School of Computer Science and Engineering Seoul National University Cognitive Robots © 2014, SNU CSE Biointelligence Lab.,

Virtual Reality: How Much Immersion Is Enough? Angela McCarthy CP5080, SP

Transformed Social Interaction – TSI Theory (Bailenson et al. 2008) To describe the transformation of interaction in mediated communication environments.

Allyn Romanow Mark Duckworth ) Andy Pepperell Brian Baldino

California Common Operating Picture (Cal COP) for Public Safety

Hearing Actual perception and processing of sound.

Fall 2002CS/PSY On-Speech Audio Area Overview Will it be heard ? Will it be identified ? Will it be understood Four Areas Uses of Non-speech Audio.

Interaction Media & Communication, Department of Computer Science, Queen Mary University of London THE INFLUENCE.

1 Chapter 8: Displays System Display (the represented system) Mental model Senses Attention Perception.

Computational Perception Li Liu. Course 10 lectures 2 exercises 2 labs 1 project 1 written examination.

ENTERFACE ‘08: Project4 Design and Usability Issues for multimodal cues in Interface Design/ Virtual Environments eNTERFACE ‘08| Project 4.

Cognitive demands of hands-free- phone conversation while driving Professor : Liu Student: Ruby.

CMPD273 Multimedia System Prepared by Nazrita Ibrahim © UNITEN2002 Multimedia System Characteristic Reference: F. Fluckiger: “Understanding networked multimedia,

Chapter 4 Finding out about tasks and work. Terminology GOAL: End result or objective TASK: An activity that a person has to do to accomplish a goal ACTION:

Dr. Gallimore10/18/20151 Cognitive Issues in VR Chapter 13 Wickens & Baker.

Vocabularies for Description of Accessibility Issues in MMUI Željko Obrenović, Raphaël Troncy, Lynda Hardman Semantic Media Interfaces, CWI, Amsterdam.

HPN: IFSS1 Intelligent Flight Support System (IFSS) A Real-Time Intelligent Decision Support Prototype PRESENTER/COTR Anthony Bruins (X37071) HPN Software.

ICT 1 A multimodal context aware mobile maintenance terminal for noisy environments Fredrik Vraalsen Research scientist SINTEF MOBIS’04 – Oslo, 15/9-04.

New Developments in Hearing Technology Dave Gordey, M. Sc. AUD (c)

Teachers Discovering Computers Integrating Technology and Digital Media in the Classroom 5 th Edition Let’s Review Lesson 2! Who Wants to Be a Computer.

Current Assistive Technologies Available for Orientation and Mobility Purposes: Applications, Limitations, and Criteria for Successful Use Ed Gervasoni,

Audio Streamer Exploiting simultaneity for listening Chris Schmandt and Atty Mullins MIT Media Laboratory.

Users’ Quality Ratings of Handheld devices: Supervisor: Dr. Gary Burnett Student: Hsin-Wei Chen Investigating the Most Important Sense among Vision, Hearing.

Understanding Users Cognition & Cognitive Frameworks

Fundamentals of Information Systems, Third Edition1 The Knowledge Base Stores all relevant information, data, rules, cases, and relationships used by the.

1 ISE 412 ATTENTION!!! From page 147 of Wickens et al. ATTENTION RESOURCES.

U SER I NTERFACE L ABORATORY Situation Awareness a state of knowledge, from the processes used to achieve that state (situation assessment) not encompass.

Change Blind Information Display for Ubiquitous Computing Environments Professor: Liu Student: Ruby.

Transitioning from Implicit to Explicit, Public to Personal, Interaction with Multiple Users Daniel Vogel, Ravin Balakrishnan Department of Computer Science.

MASSIVE “ Model, Architecture and System for Spatial Interaction in Virtual Environments ” a Distributed Virtual Reality System Incorporating Spatial Trading.

Stanford hci group / cs376 u Jeffrey Heer · 19 May 2009 Speech & Multimodal Interfaces.

C ONTEXT AWARE SMART PHONE YOGITHA N. & PREETHI G.D. 6 th SEM, B.E.(C.S.E) SIDDAGANGA INSTITUTE OF TECHNOLOGY TUMKUR

Efficient Opportunistic Sensing using Mobile Collaborative Platform MOSDEN.

What can we expect of cochlear implants for listening to speech in noisy environments? Andrew Faulkner: UCL Speech Hearing and Phonetic Sciences.

Submitted To: Submitted By: Seminar On Digital Audio Broadcasting.

King Saud University College of Engineering IE – 341: “Human Factors” Spring – 2016 (2 nd Sem H) Chapter 3. Information Input and Processing Part.

Perceptive Computing Democracy Communism Architecture The Steam Engine WheelFire Zero Domestication Iron Ships Electricity The Vacuum tube E=mc 2 The.

SIE 515 Universal Design Lecture 9.

AVQ Automatic Volume and eQualization control

Assist. Prof. Dr. Ilmiye Seçer Fall

Precedence-based speech segregation in a virtual auditory environment

AVQ Automatic Volume and eQqualization control

Spatial Audio - Spatial Sphere Demo Explained

seeing unfamiliar voices

SENSATION AND PERCEPTION

SENSATION AND PERCEPTION

Treatment : Media and Techniques

Treatment : Media and Techniques

SENSATION AND PERCEPTION

Presentation transcript:

Defence Research and Development Canada Recherche et développement pour la défense Canada Canada Spatialized Audio in the Common Operating Perspective Ryan Kilgore Mark Chignell University of Toronto Interactive Media Lab Capt Stephen Boyne Nada Pavlovic Defence Research and Development Canada NATO RTO VizCOP Sep 04 Toronto, ON

Overview Motivate use of Spatial Audio in COP Human Factors Issues and Sample Uses Introduction to spatial audio The Vocal Village: A spatialized VoIP system Experimental Results Proposals for Future Use

“Common Operating Perspective”? The perspective is not a picture, it is a mental model built in the head of the commander/operator based on information inputs “Picture” ignores the role of audio and other senses. Humans are multi-modal creatures. “COP is a shared mental image of what is going on” Audio should be part of the perspective

Requirements Support close interaction of a “mixed expert team” viewing the same picture Give the involved people “close to real-time” situation awareness Be intuitively understandable without special training Minimize the need for mental transformations and cognitive effort Available now or in near future “let’s try stuff out and see what works” 70% working solution is better than 110% unavailable

Human Factors Issues Stress and Performance Workload and Time stress Attentional Resource Theory Situation Awareness, Monitoring Alerting and Orienting Discrimination of signals in noise

Stress and Performance

Multiple Attentional Resources

Audio Applications for COP Cueing for multiple screen displays Monitoring multiple audio streams simultaneously Different audio info dependent upon listener’s location in CP Development of an always on audio space with spatialized components

Usage Considerations Allow personalization of audio space Avoid mental transformations Locate different auditory displays and communication channels in different spatial positions Distributed Speaker arrays can be used to provide required spatial audio effects

spatial audio | explanation Similar to binocular vision Result of the brain’s ability to perceive relative differences between signals picked up by the left and right ears (time, volume, frequency) Allows people with binaural hearing to locate sound sources in three-dimensional space Spatialized audio: worth considering for COP?

Laboratory experimentation and realistic scenarios (e.g., use by pilots) have demonstrated numerous cognitive benefits of spatial audio when compared to non-spatial audio, including: increased intelligibility of speech improved detection of signals at lower signal-to- noise ratios improving memory, comprehension, and speaker identification in audioconference environments spatial audio | benefits Spatialized audio: worth considering for COP?

the Vocal Village | overview Real-time VoIP audioconferencing through client/server architecture Low-fidelity binaural cues allow for the presentation of individual conferee voices from different apparent locations, low bandwidth requirements Voice locations may be automatically controlled or manually customized by individual participants through a Graphical User Interface Minimal requirements placed on client for maximum portability Spatialized audio: worth considering for COP?

experiment | design Subjects listened to a series of four pre-recorded audioconferences held between the same four women. The conferences were presented in four different formats, using a within-subjects design: Nonspatialized audio (Mono) Low-fidelity spatialized audio, with conferees in random positions (Random) Low-fidelity spatialized audio, with conferee location determined by subject (Vocal Village) Audio spatialized using commercial CoolEdit 3D software, with speakers in random positions (CoolEdit) Spatialized audio: worth considering for COP?

experiment | results User Preference: Audio format had a significant effect on User Preference, with the personalized, spatial Vocal Village format being the most preferred (1 is “best” in the table below). OrderMean RatingFormat 11.50VocalVillage (Spatial) 22.05Random (Spatial) 32.41CoolEdit (Spatial) 43.40Mono F[3,66] = 18.77, p < Spatialized audio: worth considering for COP?

experiment | results Perceived Attention Allocation: Varied significantly by audio format (F[3,66] = 6.572, p = 0.001). Personalized spatial display best Spatialized conference better than mono Spatialized audio: worth considering for COP?

experiment | results Perceived Speaker Identification Difficulty: F[3,66] = 7.44, p < Easier for spatialized conference Spatialized audio: worth considering for COP?

experiment | conclusions Low-fidelity, “within-the-head” spatialization techniques implemented within the Vocal Village tended to increase objective performance (not significant) Personalized spatialization provided by the Vocal Village significantly improves user perceptions of performance (increased confidence and ease of speaker identification) The Vocal Village environment was significantly preferred by users to traditional audioconferencing Spatialized audio: worth considering for COP?

spatialized audio and situation awareness Study looked at spatialized radio messages in MOUT environment 3 Levels of fidelity examined Stereo Generic HRTF Free Field

Results Spatial Audio improved message location identification Lack of head tracking appears to diminish effect of spatialized audio

Proposal/Conclusions Technology for presenting spatial audio in the COP is available Picking up the phone to communicate should be replaced with sophisticated audio spaces that promote awareness and offload visual processing Spatial Audio can also be used to simplify the task of orienting to alerts and messages To try out spatialized audioconferencing download the client from: for more info on the Vocal Village

Spatialized audio in situation awareness Study to examine use of spatialized audio cues in target localization and memory for spatial layout In support of 3D sensor views and 2D topographic displays Different audio configurations may aid different tasks