Automatic Switchboard Operator Luboš Šmídl, Tomáš Valenta Department of Cybernetics Faculty of Applied Sciences University of West Bohemia in Pilsen.

Slides:



Advertisements
Similar presentations
Usability Lab 2007 Corso Laboratorio di Basi Dati II Rome, 2007 Enea Usability Lab - C.R. Casaccia, Roma, Italy User Requirements & Cyber Enterprise Web.
Advertisements

Bachelors Degree in Computer and Electro-techniques Engineering Hugo Monteiro Nº Marco Mouta Nº Final Project towards the Bachelors Degree.
INTEGRATION OF VOICE SERVICES IN INTERNET APPLICATIONS By Eduardo Carrillo (lecturer), J. J Samper, J.J. Martínez-Durá Universidad Autónoma de Bucaramanga.
Facts about Welcome to this video from Ozeki. In this video I will present what makes Ozeki Phone System XE the Worlds best on-site software PBX for Windows.
Tuning Jenny Burr August Discussion Topics What is tuning? What is the process of tuning?
© 2007 Avaya Inc. All rights reserved. Interactive Voice and Video Response Applications Dr. Valentine C. Matula
© Aastra 2012 CMG 7.5 Speech Attendant Sales Presentation.
Filling the gap between users and objects: a multichannel interactive environment Davide Carboni, Gavino Paddeu, Stefano Sanna, Andrea Piras {dcarboni,
Software Development Languages and Environments. Programming languages High level languages are problem orientated contain many English words are easier.
Collaborative Customer Relationship Management (CCRM) User Group June 23 rd, 2004.
Rob Marchand Genesys Telecommunications
TECHNOLOGY FOR MOBILE ADVERTISING SEARCH & COMMERCE © 2007 Apptera Inc. Optimizing Software Architecture for Voice Search SpeechTek 2007.
XISL language XISL= eXtensible Interaction Sheet Language or XISL=eXtensible Interaction Scenario Language.
The State of the Art in VoiceXML Chetan Sharma, MS Graduate Student School of CSIS, Pace University.
Pace VoiceXML Absentee System Paul Visokey, Ping Gallivan, Yani Mulyani, Lisa Jordan, Elaine Li, George Mathew, Qisheng Hong Presenter Name : Paul Visokey.
VoiceXML and Internet Telephony Kundan Singh and Henning Schulzrinne Columbia University Joint work (in progress) with Daniel,
Kashif Jalal CA-240 (072) Web Development Using ASP.NET CA – 240 Kashif Jalal Welcome to week – 2 of…
HEX Travel Agent Thorarinn Stefansson CEO Hex software.
Interpret Application Specifications
Distributed Collaborations Using Network Mobile Agents Anand Tripathi, Tanvir Ahmed, Vineet Kakani and Shremattie Jaman Department of computer science.
Application Architectures Vijayan Sugumaran Department of DIS Oakland University.
VoiceXML Basic COCOMO Calculator By Greg Kutcher.
Development of mobile applications using PhoneGap and HTML 5
Web Application Architecture: multi-tier (2-tier, 3-tier) & mvc
The PHP Story. PHP Story PHP is a programming language. Incorporate(join) sophisticated business logic. Widely used general purpose scripting language.
VoiceXML Builder Arturo Ramirez ACS 494 Master’s Graduate Project May 04, 2001.
1 Web Database Processing. Web Database Applications Static Report Publishing a report is prepared from a database application and exported to HTML DB.
Architecture Of ASP.NET. What is ASP?  Server-side scripting technology.  Files containing HTML and scripting code.  Access via HTTP requests.  Scripting.
A Scalable Application Architecture for composing News Portals on the Internet Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta Famagusta.
SednaSpace A software development platform for all delivers SOA and BPM.
Conversational Applications Workshop Introduction Jim Larson.
THROUGH DIVERSITY EFFECTIVENESS AIR Forum 2006 May 18, 2006 Dynamic Charts: An approach to making institutional data available through graphical means.
Integrating VoiceXML with SIP services
Putting it all together Dynamic Data Base Access Norman White Stern School of Business.
The Voice-Enabled Web: VoiceXML and Related Standards for Telephone Access to Web Applications 14 Feb Christophe Strobbe K.U.Leuven - ESAT-SCD-DocArch.
Outline Grammar-based speech recognition Statistical language model-based recognition Speech Synthesis Dialog Management Natural Language Processing ©
PHP Features. Features Clean syntax. Object-oriented fundamentals. An extensible architecture that encourages innovation. Support for both current and.
Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.
Voice User Interface
FlexElink Winter presentation 26 February 2002 Flexible linking (and formatting) management software Hector Sanchez Universitat Jaume I Ing. Informatica.
Syllabus Management System. The Problem There is need for a management system for syllabi that: Provides a simple and effective user interface Allows.
Research Topics CSC Parallel Computing & Compilers CSC 3990.
DAWN: Dynamic Aural Web Navigation Gopal Gupta, S. Sunder Raman, Mike Nichols, H. Reddy, N. Annamalai Department of Computer Science University of Texas.
ASP (Active Server Pages) by Bülent & Resul. Presentation Outline Introduction What is an ASP file? How does ASP work? What can ASP do? Differences Between.
© 2013 by Larson Technical Services
Management System For Graduate Students Projects Day Presentation – June 2011.
CS562 Advanced Java and Internet Application Introduction to the Computer Warehouse Web Application. Java Server Pages (JSP) Technology. By Team Alpha.
Listener-Control Navigation of VoiceXML. Nuance Speech Analysis 92% of customer service is through phone. 84% of industrialists believe speech better.
Adaptive User Interface Modelling for Web-environments T – Antti Martikainen
D R A T D R A T ABSTRACT Every semester each department at Iowa State University has to assign its faculty members and teaching assistants (TAs) to the.
VoiceXML Version 2.0 Jon Pitcherella. What is it? A W3C standard for specifying interactive voice dialogues. Uses a “voice” browser to interpret documents,
The basics of knowing the difference CLIENT VS. SERVER.
Presentation Title 1 1/27/2016 Lucent Technologies - Proprietary Voice Interface On Wireless Applications Protocol A PDA Implementation Sherif Abdou Qiru.
JavaScript 101 Introduction to Programming. Topics What is programming? The common elements found in most programming languages Introduction to JavaScript.
8 th Semester, Batch 2009 Department Of Computer Science SSUET.
Speech Processing 1 Introduction Waldemar Skoberla phone: fax: WWW:
GROUP PresentsPresents. WEB CRAWLER A visualization of links in the World Wide Web Software Engineering C Semester Two Massey University - Palmerston.
W3C Multimodal Interaction Activities Deborah A. Dahl August 9, 2006.
ECpE Student Database Team 21 Adviser: Tien Nguyen ECpE and Tony Moore.
VoiceXML. Nuance Speech Analysis 92% of customer service is through phone. 84% of industrialists believe speech better than web.
1 Unit E-Guidelines (c) elsaddik SEG 3210 User Interface Design & Implementation Prof. Dr.-Ing. Abdulmotaleb.
A seminar by Ramesh Kumar Raju S CSSE 07121A1547.
PHP / MySQL Introduction
SALT & The Microsoft Speech Application SDK
Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta
Experience with the process automation at SORS
Database Management Systems
AIMS Equipment & Automation monitoring solution
Latest 3314 Practice Test Dumps
VoiceXML An investigation Author: Mya Anderson
Presentation transcript:

Automatic Switchboard Operator Luboš Šmídl, Tomáš Valenta Department of Cybernetics Faculty of Applied Sciences University of West Bohemia in Pilsen

Contents A dialogue system The dialogue Automatic speech recognition and speech grammar The data Advanced features Maintenance-free running Experiences and future Interesting facts Other applications

Purpose Automatic Switchboard Operator is a voice application whose purpose is to answer phone calls and transfer callers to requested persons. The caller makes input preferably by voice and the system informs him by voice as well. A voice dialogue system Whole UWB in Pilsen First such a large application of its kind in CZ

A dialogue system Data: MySQL, Oracle, … Document server: PHP Dialogue controller: VoiceXML Interpreter Speech engines: ERIS by SpeechTech and Dpt. of Cybernetics Telephony: SIP or ISDN

The dialogue Experienced vs. newbies Shortcuts Call n-th number Called person specification First name and surname Titles and degrees Department and function Voice or DTMF input Smith → 76484#

Automatic Speech Recognition Methods LVCSR Isolated words Grammars person = ( [(salut function) | (function salut) | salut | function] [degrees] (([firstname] surname) | (surname firstname)) [degrees] [function | department] ) | function;

Speech grammar complexity Prof. Ing. Josef Psutka, CSc., boss of DCy 1. Josef Psutka 2. Engineer Psutka 3. Boss of the Department of Cybernetics 4. Mister Psutka, professor 5. Professor Psutka, the Department of Cybernetics 6. Psutka Josef 7. etc. 26, 042 acceptable utterances

The Data Visual data vs. Aural data Prof. Ing. Psutka professor engineer psutka Generating pronunciations Rules-based, for TTS vs. for ASR Tomáš Tomáš, Thomas, Tom Fields tagging Better grammar matching, faster DB search J(firstname) P(surname) D(department) F(function) T(degree)

Advanced features Web presentation Administration Rules for pronunciations Shortcuts or Direct numbers Callers’ rights Phonebook searching Monitoring Statistics

Maintenance-free running Windows services, daemons Task scheduler 1. Import data 2. Generate pronunciations 3. Generate and compile grammar 4. Optional sanitary restart

Experiences Running since 2008 Extended grammar accepting Hello, Please, Thank you I would like to talk to Optimizing prompts Application made general Future Using statistics for person/number selection More info about employees More features and speed for experienced users New technologies: better TTS and ASR

Interesting numbers 2,095 persons 2,322 telephones 35,566,194 utterances 2.5 hours – grammar compilation time

Other dialogue applications  Entrance exams Since June ,000–5,000 calls a year  Exams Web access alternative  Recent news reading RSS from Categories: general, sport, economics, …  ASR demo Users can test ASR capabilities Web interface, users log in, own set of utterances  and others…

Thank you for attention Do you have any questions?

VoiceXML Mark-up language based upon XML Main advantage Minimizes client/server communication (more interactions in a document) Hides low-lever implementation details from the programmer Enables better portability Designed for content providers, dialogue designers Separates user interface (VoiceXML) from program logic Easy for both simple and complex applications VoiceXML Interpreter (like web browser) Document getter  Document interpreter (dialogue controller)  I/O interface – speech engine: telephony, ASR and TTS units Two kinds of dialogue: forms and menus