Speech User Interfaces Katherine Everitt CSE 490 JL Section Wednesday, Oct 27.

Slides:



Advertisements
Similar presentations
Communication Transferring information from one person to another. Communication is used to instruct, clarify interpret, notify, warn, receive feedback,
Advertisements

Natural Language Systems
C MU U sable P rivacy and S ecurity Laboratory Sensor-Based Interactions Kami Vaniea.
Lead Black Slide. © 2001 Business & Information Systems 2/e2 Chapter 9 Group Collaboration.
Chapter Lead Black Slide Powered by DeSiaMore Powered by DeSiaMore.
COMP 3715 Spring 05. Computer Interface Interaction between human and computer Has to deal with two things  User’s mental model Different user has different.
Dialog Styles. The Five Primary Styles of Interaction 4 Menu selection 4 Form fill-in 4 Command language 4 Natural language 4 Direct manipulation.
Class 6 LBSC 690 Information Technology Human Computer Interaction and Usability.
Dialog Styles. The Six Primary Styles of Interaction n Q & A n Menu selection n Form fill-in n Command language n Natural language n Direct manipulation.
HCI Issues in eXtreme Computing James A. Landay Endeavour-DARPA Meeting, 9/21/99.
Interface Design for ICT4B Speech, Dialects, and Interfaces Prof. Dan Klein and Prof. Marti Hearst.
ISTD 2003, Audio / Speech Interactive Systems Technical Design Seminar work: Audio / Speech Ville-Mikko Rautio Timo Salminen Vesa Hyvönen.
Single Display Groupware Ana Zanella - CPSC
1 Speech User Interfaces 2 Outline Review Review Motivation for speech UIs Motivation for speech UIs Speech recognition Speech recognition UI problems.
ITCS 6010 Speech Guidelines 1. Errors VUIs are error-prone due to speech recognition. Humans aren’t perfect speech recognizers, therefore, machines aren’t.
Human-Computer Interaction for Universal Computing James A. Landay EECS Dept., CS Division UC Berkeley Endeavor Mini Retreat, 5/25/99 Task Support.
Speech User Interfaces
CSC450 Software Engineering
Human Resources. To understand what are meant by effective communication and feedback Analyse the advantages and disadvantages of different communication.
]. Website Must-Haves Know your audience Good design Clear navigation Clear messaging Web friendly content Good marketing strategy.
Speech Guidelines 2 of Errors VUIs are error-prone due to speech recognition. Humans aren’t perfect speech recognizers, therefore, machines aren’t.
Revision Lesson : DESIGNING COMPUTER-BASED INFORMATION SYSTEMS.
Computer Organization
Assistive Technology Marla Roll, MS, OTR December 15, 2010 Denver Options.
Chapter 11: Interaction Styles. Interaction Styles Introduction: Interaction styles are primarily different ways in which a user and computer system can.
RERC on Telecommunications Access Overview: Accessibility of Voice Systems and Services.
Towards a Unified Interaction Framework for Ubicomp User Interfaces Jason I. Hong Scott Lederer Mark W. Newman G r o u p f o r User Interface Research.
Speaking to Computers Alex Acero Manager, Speech Research Group Microsoft Research Feb 14 th 2003.
Introduction To Computer System
1 WIA Section 188 Disability Checklist Element 5.5.
User Interface in the Digital Decade Kai-Fu Lee Corporate Vice President Microsoft Corporation.
Lecture 6 User Interface Design
Computer Graphics Lecture 28 Fasih ur Rehman. Last Class GUI Attributes – Windows, icons, menus, pointing devices, graphics Advantages Design Process.
The ID process Identifying needs and establishing requirements Developing alternative designs that meet those requirements Building interactive versions.
Fall 2002CS/PSY Pervasive Computing Ubiquitous computing resources Agenda Area overview Four themes Challenges/issues Pervasive/Ubiquitous Computing.
A context-aware communication system Natalia Marmasse advisor: Chris Schmandt Speech Interface Group MIT Media Lab.
Designing Speech and Multimodal Applications for Seniors Deborah Dahl Conversational Technologies SpeechTEK 2009 New York August
1 Chapter 15 User Interface Design. 2 Interface Design Easy to use? Easy to understand? Easy to learn?
Modal Interfaces & Speech User Interfaces Katherine Everitt CSE 490F Section Nov 20 & 21, 2006.
AVI/Psych 358/IE 340: Human Factors Interfaces and Interaction September 22, 2008.
Human-Computer Interaction
1 3132/3192 User Accessibility © University of Stirling /3192 User Accessibility 2.
INFO 355Week #71 Systems Analysis II User and system interface design INFO 355 Glenn Booker.
Speech Interfaces User Interfaces Spring 1998 Drew Roselli.
1 Natural Language Processing Lecture Notes 14 Chapter 19.
TOOL5100: CSCL Issues in CSCW and groupware A. Mørch, Issues in CSCW and Groupware: Anders Mørch TOOL 5100,
Marketing Research Approaches. Research Approaches Observational Research Ethnographic Research Survey Research Experimental Research.
Chapter 5:User Interface Design Concepts Of UI Interface Model Internal an External Design Evaluation Interaction Information Display Software.
Conceptual Model Design Informing the user what to do Lecture # 10 (a) Gabriel Spitz.
Ergonomics/Human Integrated Systems (Project 02)
Speech Processing 1 Introduction Waldemar Skoberla phone: fax: WWW:
Systems and User Interface Software. Types of Operating System  Single User  Multi User  Multi-tasking  Batch Processing  Interactive  Real Time.
Stanford hci group / cs376 u Jeffrey Heer · 19 May 2009 Speech & Multimodal Interfaces.
6.S196 / PPAT: Principles and Practice of Assistive Technology Wed, 19 Sept Prof. Rob Miller Today: User-Centered Design [C&H Ch. 4]
Communication. What is communication Communication refers to the transmission of information from a sender to a receiver, via a given medium. Two-way.
Speech and multimodal Jesse Cirimele. papers “Multimodal interaction” Sharon Oviatt “Designing SpeechActs” Yankelovich et al.
Speech User Interface 10/26/2010. Pervasive Information Access Information & Services I-Land vision by Streitz, et. al.
Communication.
2. OPERATING SYSTEM 2.1 Operating System Function
Lesson Objectives Aims You should be able to:
System Design Ashima Wadhwa.
Designing Speech and Multimodal Applications for Seniors
Interaction Styles.
Chapter 6: Interfaces and interactions
Introduction to Computers
A vision for learning with Digital technologies
GRAPHICAL USER INTERFACE GITAM GADTAULA. OVERVIEW What is Human Computer Interface (User Interface) principles of user interface design What makes a good.
GRAPHICAL USER INTERFACE GITAM GADTAULA KATHMANDU UNIVERSITY CLASS PRESENTATION.
DATABASE DESIGN & DEVELOPMENT
Accessible Forms Gaby de Jongh, IT Accessibility Specialist
Presentation transcript:

Speech User Interfaces Katherine Everitt CSE 490 JL Section Wednesday, Oct 27

2 Motivation for Speech UIs: Pervasive Information Access Information & Services I-Land vision by Streitz, et. al.

3 UIs in the Pervasive Computing Era Future computing devices won’t have the same UI as current PCs Wide range of devices –Small or embedded in environment –Often with alternative I/O & w/o screens –Information appliances I-Land vision by Streitz, et. al.

4 Information access via speech Read my important

5 Motivation Smaller devices -> difficult I/O –People can talk at ~90 wpm (high speed) “Virtually Unlimited” set of commands Freedom for other body parts –Imagine you are working on your car and need to know something from the manual Natural –Evolutionarily selected for Reading, writing and typing are not (too new)

6 When To Use Speech Mobile: no keyboard/mouse/screen available Hands-busy/eyes-busy Assistive Technologies: GUI not appropriate for user

7 Why are they hard to get right? What is the difference between humans and computers? What is the difference between Visual UIs and Speech UIs?

8 Why are they hard to get right? Speech recognition far from perfect –Imagine inputting commands w/ the mouse & getting the wrong result 5-20% of the time Speech UIs have no visible state –Can’t see what you have done before –Can’t see what affect your commands have had Speech UIs are hard to learn –How do you explore the interface? –How do you find out what you can say?

9 Why are they hard to get right? Isolated, short words difficult Segmentation –Recognize speech versus Wreck a nice beach Spelling –mail vs. male -> need to understand language Context is necessary

10 Speech UIs Require: Speech recognition –the computer understanding what the customer is saying. Speech production (or synthesis) –the computer talking to the customer.

11 Designing Speech UIs Speech UI no-no’s –modes (no feedback) certain commands only work when in specific states –deep hierarchies (aka voice mail hell) Verbose feedback wastes time/patience –only confirm consequential things –use meaningful, short cues Interruption –half-duplex communication (i.e., no barge-in support)

12 Designing Speech UIs Too much speech on the part of customer is tiring Speech takes up space in working memory –can cause problems when problem solving Establish common ground & shared context –Make sure people know what type of tool they are using ex. , calendar, weather, stock quotes –Make sure people know where they are in the conversation

13 Designing Speech UIs Pacing –recognition delays are unnatural, make it clear when this occurs –barge-in lets user interrupt like in real conversations –tapering of prompts –progressive assistance: short error messages at first, longer when user needs more help –implicit confirmation: include confirm in next command

14 Disadvantages of Speech UIS Close to Home John McPherson

15 Disadvantages of Speech UIS Disruptive Privacy Concerns Recognition Errors Multiple Verbal Tasks (Interference) Context Errors

16 Future UIs for Information Access Star Trek style UI –verbally ask the computer for info or services –may be common in mobile/hands-free situations –hard to get to work well since it requires perfect speech recognition & unambiguous language understanding Future:

17 Multimodal interaction Multimodal interfaces use different kinds of input (e.g., pen and speech) together Achieves “put that there” ScanMail Future:

18 Context-Aware Applications Apps are aware of context –User location –What they are doing –Who is around –What is appropriate / relevant Future:

19 Questions When would you use a speech UI? What speech UIs have you encountered? Have they been good? How have speech UIs changed? What are the problems with Speech UIs? [Affective UIs & Prosody].

20 Summary Speech UIs –May permit more natural computer access –Allows us to use computers in more situations –Are hard to get to work well Lack of visible state, tax working memory, recognition problems, etc. Multimodal UIs address some of the problems with pure speech UIs.

21 Exercise Would you use a speech UI? Why or why not? Pros/Cons 1.Banking system 2.Registration/Enrollment for University 3.Internet browser for blind users 4.Remote service manual for traveling repairman 5.Database management system