The Voice-Enabled Web: VoiceXML and Related Standards for Telephone Access to Web Applications 14 Feb. 2002 Christophe Strobbe K.U.Leuven - ESAT-SCD-DocArch.

Slides:



Advertisements
Similar presentations
(1) VoiceXML Overview, Opportunities & Challenges Hitesh Kr. Seth Chief Technology Evangelist SeraNova, Inc OReilly Conference.
Advertisements

INTEGRATION OF VOICE SERVICES IN INTERNET APPLICATIONS By Eduardo Carrillo (lecturer), J. J Samper, J.J. Martínez-Durá Universidad Autónoma de Bucaramanga.
VoiceXML: A Field Evaluation By: Kristy Bradnum Supervisor: Peter Clayton Presented in partial fulfilment of the CS Honours Project.
Speech Synthesis Markup Language V1.0 (SSML) W3C Recommendation on September 7, 2004 SSML is an XML application designed to control aspects of synthesized.
Speech Synthesis Markup Language SSML. Introduced in September 2004 XML based Assists the generation of synthetic speech Specifies the way speech is outputted.
Collaborative Customer Relationship Management (CCRM) User Group June 23 rd, 2004.
Rob Marchand Genesys Telecommunications
H E L S I N K I U N I V E R S I T Y O F T E C H N O L O G Y G O p r o j e c t : S e r v i c e A r c h i t e c t u r e f o r t h e N o m a d i c I n t e.
Voice XML Team 1 Matt Ganis, Jonathan Hill, Henry Wong Anne I. Mannette-Wright Team 1 Matt Ganis, Jonathan Hill, Henry Wong Anne I. Mannette-Wright.
XISL language XISL= eXtensible Interaction Sheet Language or XISL=eXtensible Interaction Scenario Language.
Basics of HTML What is HTML?  HTML or Hyper Text Markup Language is the standard markup language used to create Web pages.  HTML is.
Objective Understand web-based digital media production methods, software, and hardware. Course Weight : 10%
Understand Web Page Development Software Development Fundamentals LESSON 4.1.
Speech in.NET Sphinx CMU November Presenter casey chesnut brains-N-brawn.com – Web Services – Mobile / Wireless – Speech.
Project 1 Introduction to HTML.
Understand Web Services
The State of the Art in VoiceXML Chetan Sharma, MS Graduate Student School of CSIS, Pace University.
Pace VoiceXML Absentee System Paul Visokey, Ping Gallivan, Yani Mulyani, Lisa Jordan, Elaine Li, George Mathew, Qisheng Hong Presenter Name : Paul Visokey.
VoiceXML and Internet Telephony Kundan Singh and Henning Schulzrinne Columbia University Joint work (in progress) with Daniel,
Template-based framework for building VoiceXML application Jonathan Law.
About VoiceXML 2.0 Stefanie Shriver a lot of this stuff is pulled directly from the 2.0 spec:
Upgrading to XHTML DECO 3001 Tutorial 1 – Part 1 Presented by Ji Soo Yoon 19 February 2004 Slides adopted from
Introducing XHTML: Module B: HTML to XHTML. Goals Understand how XHTML evolved as a language for Web delivery Understand the importance of DTDs Understand.
1st Project Introduction to HTML.
Find The Better Way Expand Your Voice with VXML May 10 th, 2005.
4.01B Authoring Languages and Web Authoring Software 4.01 Examine webpage development and design.
AN EXTENSIBLE TRANSCODER FOR HTML TO VOICEXML CONVERSION by Narayanan Annamala Gopal Gupta B. Prabhakaran DEPARTMENT OF COMPUTER SCIENCE THE UNIVERSITY.
2440: 141 Web Site Administration Web Server-Side Programming Professor: Enoch E. Damson.
Introducing HTML & XHTML:. Goals  Understand hyperlinking  Understand how tags are formed and used.  Understand HTML as a markup language  Understand.
Chapter ONE Introduction to HTML.
Speech Synthesis Markup Language -----Aim at Extension Dr. Jianhua Tao National Laboratory of Pattern Recognition (NLPR) Institute of Automation, Chinese.
Introduction and overview
By: Shawn Li. OUTLINE XML Definition HTML vs. XML Advantage of XML Facts Utilization SAX Definition DOM Definition History Comparison between SAX and.
VoiceXML Builder Arturo Ramirez ACS 494 Master’s Graduate Project May 04, 2001.
AN EXTENSIBLE TRANSCODER FOR HTML TO VOICEXML CONVERSION by Narayanan Annamalai B.E. Master’s Thesis Advisors: Dr. Gopal Gupta and Dr. B Prabhakaran THE.
XForms: A case study Rajiv Shivane & Pavitar Singh.
UWSP Web Speech Research Group Joe Frost Mark Stenerson Professor Dave Gibbs Presentation to AITP Monday, October 17, 2005.
An XML Introduction Extensible Markup Language Describe Structure and Content of Data Sample XML Document.
Conversational Applications Workshop Introduction Jim Larson.
1 © 2004 Cisco Systems, Inc. All rights reserved. Session Number Presentation_ID Media Resource Control Protocol v2 Sarvi Shanmugham, Editor: MRCP v1/v2.
Multimodal user interfaces: Implementation Chris Vandervelpen
ITCS 6010 SALT. Speech Application Language Tags (SALT) Speech interface markup language Extension of HTML and other markup languages Adds speech and.
3 XHTML.
Integrating VoiceXML with SIP services
1Copyright © PIPEBEACH AB All rights reserved. Scott McGlashan zW3C Voice Browser Dialog Requirements and Specifications zTranscoding WML into VoiceXML.
Speech Technologies and VoiceXML try Department of Computer Science National Cheng-Chi University.
Outline Grammar-based speech recognition Statistical language model-based recognition Speech Synthesis Dialog Management Natural Language Processing ©
WEB BASED DATA TRANSFORMATION USING XML, JAVA Group members: Darius Balarashti & Matt Smith.
Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.
Session: 1. © Aptech Ltd. 2Introduction to the Web / Session 1  Explain the evolution of HTML  Explain the page structure used by HTML  List the drawbacks.
AN EXTENSIBLE TRANSCODER FOR HTML TO VOICEXML CONVERSION by Narayanan Annamalai B.E. Master’s Thesis Advisors: Dr. Gopal Gupta and Dr. B Prabhakaran THE.
XML, XSL, and SOAP Building Object Systems from Documents CSC/ECE 591o Summer 2000.
Phone Mashups Integrating Telephony & the Web Irv Shapiro CEO, Ifbyphone, Inc.
© 2013 by Larson Technical Services
Listener-Control Navigation of VoiceXML. Nuance Speech Analysis 92% of customer service is through phone. 84% of industrialists believe speech better.
4.01B Authoring Languages and Web Authoring Software 4.01 Examine webpage development and design.
1 Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents S. Kawamoto, et al. October 27, 2004.
VoiceXML Version 2.0 Jon Pitcherella. What is it? A W3C standard for specifying interactive voice dialogues. Uses a “voice” browser to interpret documents,
Presentation Title 1 1/27/2016 Lucent Technologies - Proprietary Voice Interface On Wireless Applications Protocol A PDA Implementation Sherif Abdou Qiru.
Understanding Web-Based Digital Media Production Methods, Software, and Hardware Objective
W3C Multimodal Interaction Activities Deborah A. Dahl August 9, 2006.
VoiceXML. Nuance Speech Analysis 92% of customer service is through phone. 84% of industrialists believe speech better than web.
Presented By Sharmin Sirajudeen S7 CS Reg No :
HTML PROJECT #1 Project 1 Introduction to HTML. HTML Project 1: Introduction to HTML 2 Project Objectives 1.Describe the Internet and its associated key.
Website Source Code Free Download.
VoiceXML Tutorial: Part 1 Introduction and User Interaction with DTMF
Project 1 Introduction to HTML.
SALT & The Microsoft Speech Application SDK
Teaching slides Chapter 6.
VoiceXML An investigation Author: Mya Anderson
Presentation transcript:

The Voice-Enabled Web: VoiceXML and Related Standards for Telephone Access to Web Applications 14 Feb Christophe Strobbe K.U.Leuven - ESAT-SCD-DocArch

Overview Voice browsers History of voice markup languages W3C Speech Interface Framework Communication Architecture VoiceXML 2.0 Grammars SALT Not WAP/WML, Voice over IP

Voice Browser Device (hardware and software) that interprets voice markup languages to generate voice output and interpret voice input.

Companies

History 1990s: companies developed their own markup languages: PhoneML (AT&T) PhoneML (Lucent) VoxML (Motorola) TalkML (HP Labs) SpeechML (IBM) => VoiceXML Forum : VoiceXML : W3C Voice Browser Workshop

VoiceXML Specification History April 1999 – Initial spec – Request For Comment August 1999 – 0.9 Spec released March 2000 – 1.0 Spec released October 2001 – 2.0 Working Draft (W3C) March 2002 – next Working Draft 4th quarter 2002 – 2.0 Recommendation W3C?

Why Voice Markup Languages? “Voicifying” web pages by adding a few VoiceXML tags is not feasible: –basic design principles that make a good web page are very different from those that make an efficient voice interface –e.g. Raggett & Ben-Natan: “Voice Browsers” (W3C, 1998) … unless you want to create a multimodal interface (cf. SALT) ?

Speech Interface Framework TTS Language Understanding World Wide Web User Telephone System Dialog Manager Language Generation Media Planning Prerecorded audio player ASR DTMF tone recognizer Context Inter- pretation Lexicon Natural Language Semantics ML VoiceXML 2.0 Reusable Components Speech Synthesis ML N-gram Grammar ML Speech Recognition Grammar ML

Communication Architecture

What is VoiceXML? For creating audio dialogs that include Synthesized speech Digitized audio Recognition of spoken and DTMF key input Recording of spoken input Telephony Mixed-initiative conversations Major goal: bring the advantages of web-based development and content delivery to interactive voice response applications.

Advantages of VoiceXML As perceived by Motorola et al: People want a better mobile user interface while on the go Device Independent Open standards create and drive market demand Easy to program since similar to other XML- based languages Utilizes existing web infrastructure

Developing applications To develop VoiceXML applications you have to learn several languages: –VoiceXML –ECMAScript (JavaScript/Jscript) –a grammar format (GSL, JSGF, Speech Recognition Grammar Specification) –a back end scripting language (Perl, Java, …) Web developers are used to this kind of environment

Voice XML 2.0 Features Clears up grammar ambiguity SRGF – Speech Recognition Grammar Format –Grammars can be represented as an XML document. XSLT can generate. XML parsers can parse –Augmented BNF (ABNF) Similar to many current proprietary standards Speech Synthesis Specification Language –based on JSML (Sun)

VoiceXML Basics XML-based More structured then HTML (describes structure and semantics of data, not presentation) –Must close all tags (i.e. ) Structure of language described in a Document Type Description (DTD)

VoiceXML Applications An application consists of a single application root document as well as zero or more other documents The application root document is loaded whenever any other document is accessed The application root document grammars and variables are visible in other application documents Document root Document

VoiceXML Documents Documents can contain two types of dialogs: –forms ( ) –menus ( ) Other elements: – : metadata, defined as name/value pair – : for declaring variables – : for client-side ECMAScript – : for catching events – : transitions to other dialogs

Forms and menus Forms may contain zero or more elements –the user must provide a value for the field before proceeding to the next element in the form –each field may specify a grammar that defines the allowable inputs Menus may contain one or more elements –a menu presents the user with a choice of options and then transitions to another dialog

VoiceXML Example Hello World!

Example with Grammar Would you like coffee, tea, or juice? [coffee tea juice] Your 10 will be ready momentarily

Dynamic VoiceXML #!perl –w print "Content-type: text/x-vxml \n\n"; $HOMEBUFFER = ' Hello World '; print $HOMEBUFFER;

Other Markup Languages JSML: JSpeech Markup Language (Sun) Dialog ML (Dennis Heuer) SABLE (SABLE Consortium) DMML (Dialogue Moves Markup Language) SALT: Speech Application Language Tags (SALT Forum) (CallXML, Telephony Markup Language, …) Progress since March 2000 (VoiceXML 1.0) ?

JSML JSpeech Markup Language (Sun) XML specification for controlling text-to- speech engines includes elements that describe the structure of a document, provide pronunciation of words and phrases, and place markers in the text includes elements that control phrasing, emphasis, pitch, speaking rate, … elements borrowed by VoiceXML

DialogML Dialog Management Language (Dennis Heuer) open source project hosted by Sourceforge XML language for defining dialog-driven setup processes not the “Dialog ML” mentioned in the W3C notes on requirements for voice markup languages!

SABLE SABLE Consortium XML/SGML-based markup language for controlling text-to-speech engines evolved out of work on combining three existing text-to-speech languages (SSML, STML, JSML) (documentation hosted by Bell Labs no longer available; any progress since publication of draft specification?)

SALT Speech Application Language Tags (SALT Forum) SALT Forum founded by Microsoft, Intel, …; 15 October 2001 very simple set of tags for extending existing markup languages (xHTML, XML) specification available Q specification submitted to standards body (W3C??) mid 2002