Dieter Kopp 9.7.2001 1 Dieter Kopp Alcatel Research & Innovation Distributed Speech Recognition ETSI STQ Aurora Distributed.

Slides:



Advertisements
Similar presentations
1 IP Cablecom and MEDIACOM 2004 Instrumental speech quality measures: market needs and standardisation within the ITU Harald Klaus – T-Systems Rapporteur.
Advertisements

Click to continue Network Protocols. Click to continue Networking Protocols A protocol defines the rules of procedures, which computers must obey when.
4.01 How Web Pages Work.
Rob Marchand Genesys Telecommunications
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
XISL language XISL= eXtensible Interaction Sheet Language or XISL=eXtensible Interaction Scenario Language.
Chapter 6 Telecommunications & Networks.
The State of the Art in VoiceXML Chetan Sharma, MS Graduate Student School of CSIS, Pace University.
Application layer (continued) Week 4 – Lecture 2.
1 Pertemuan 13 Servers for E-Business Matakuliah: M0284/Teknologi & Infrastruktur E-Business Tahun: 2005 Versi: >
Wireless Application Protocol and i-Mode By Sridevi Madduri Swetha Kucherlapati Sharrmila Jeyachandran.
Voice XML Application Design Issues Darshan Desai And Shreenath Laxman Pace University.
IP Telephony (Article Presentation) by Samir Goswami Source: Rivier College, CS699 Professional Seminar.
SIP vs H323 Over Wireless networks Presented by Srikar Reddy Yeruva Instructor Chin Chin Chang.
Internet Telephony Helen J. Wang Network Reading Group, Jan 27, 99 Acknowledgement: Jimmy, Bhaskar.
V1.00 © 2009 Research In Motion Limited Introduction to Mobile Device Web Development Trainer name Date.
1 Networking A computer network is a collection of computing devices that are connected in various ways in order to communicate and share resources. The.
Communication Network Protocols ----Krishna Priyanka Chebrolu.
Mobile Computing Lecture: 4.
Wireshark Presented By: Hiral Chhaya, Anvita Priyam.
VoiceXML Builder Arturo Ramirez ACS 494 Master’s Graduate Project May 04, 2001.
NETWORK CENTRIC COMPUTING (With included EMBEDDED SYSTEMS)
Hands-On Microsoft Windows Server 2003 Networking Chapter Three TCP/IP Architecture.
ITNW 1380 COOPERATIVE EDUCATION – NETWORKING Spring 2010 Seminar # 4 VOIP Network Solutions.
1 10 THE INTERNET AND THE NEW INFORMATION TECHNOLOGY INFRASTRUCTURE.
VoIP Study and Implementation VoIP Ecosystem and Strategy Version 1.0 – Author : Marc PYBOURDIN / Julien BERTON Last Update : 15/05/2012.
Wireless Application Protocol. . The Two Paradigms W – World W – Wide W -- Web W – World W – Wide W – Wireless W -- Web.
July 13, 2006 © 2006 IBM Corporation Distributed Multimodal Synchronization Protocol (DMSP) Chris Cross IETF 66 July 13, 2006 With Contribution from Gerald.
WAP (Wireless Application Protocol). W – World W – Wide W -- Web W – World W – Wide W – Wireless W -- Web The Two Paradigms.
Copyright © 2002 Pearson Education, Inc. Slide 3-1 CHAPTER 3 Created by, David Zolzer, Northwestern State University—Louisiana The Internet and World Wide.
04/06/ Applications on Wireless Platform Ulpiano Conde González.
Lector: Aliyev H.U. Lecture №15: Telecommun ication network software design multimedia services. TASHKENT UNIVERSITY OF INFORMATION TECHNOLOGIES THE DEPARTMENT.
PHILIPS SPEECH PROCESSING Voic Association Vienna, Reimund Schmald Regional Sales Director GSM
ETSI STQ-Aurora Distributed Speech Recognition (DSR) Bernhard Noé Distributed Speech Recognition.
Lectured By: Vivek Dimri Assistant Professor, CSE Dept. SET, Sharda University, Gr. Noida.
1 Analysis of Push Initiator Tool used for Wireless Application Protocol Taotao Huang Helsinki University of Technology Department of Electrical and Communication.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 Network Services Networking for Home and Small Businesses – Chapter 6.
Polycom VideoPlus A New Level In Video Conferencing.
Ericsson Competence Solutions Rev A16/11/011 Mobile Learning Course for R380 and R520 Presented by Michelle Almeida Course Structure Design Guidelines.
H.323 An International Telecommunications Union (ITU) standard. Architecture consisting of several protocols oG.711: Encoding and decoding of speech (other.
Network Monitoring Through Mobile (MOBTOP) Developed By : Akanksha Jain. (102199) Deepika Reddy (102210) Team Name: Beans Guided By: Prof. Robert Zhu SUBMITTED.
Copyright © 2007 Pearson Education, Inc. Slide 3-1 E-commerce Kenneth C. Laudon Carol Guercio Traver business. technology. society. Third Edition.
Overview Web Session 3 Matakuliah: Web Database Tahun: 2008.
March 20, 2006 © 2005 IBM Corporation Distributed Multimodal Synchronization Protocol (DMSP) Chris Cross IETF 65 March 20, 2006 With Contribution from.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
INTRODUCTION TO WEB APPLICATION Chapter 1. In this chapter, you will learn about:  The evolution of the Internet  The beginning of the World Wide Web,
Speech. Understanding. Action. The Voice Web Players Dr. Christian Dugast Director Europe 05/00 The Voice Web Players Dr. Christian Dugast Director Europe.
Internet Architecture and Governance
QoS framework (PR0002) Rev.0.5 (Work in progress).
Web application architecture1 Based on Jim Conallen: Web Applications with UML.
T Research Seminar on Telecommuncations Business II - Unified Interfaces for Messaging Services 1 T Research Seminar on Telecommuncations.
Video – Any Device, Anytime, Anywhere - Motorola Inc.
Page 1Wireless World Research Forum (WWRF) WWRF WG2 Service infrastructure of the wireless world  Chair: Prof. Radu Popescu-Zeletin, Fraunhofer FOKUS,
WAP Architecture Presented by, Nithya Inbamani. WAP Background Wireless Application Protocol – secure specification. Wireless Application Protocol – secure.
Web Technologies Lecture 1 The Internet and HTTP.
AIMS’99 Workshop Heidelberg, May 1999 Assessing Audio Visual Quality P905 - AQUAVIT Assessment of Quality for audio-visual signals over Internet.
March 20, 2006 © 2005 IBM Corporation Distributed Multimodal Synchronization Protocol (DMSP) Chris Cross IETF 65 March 21, 2006 With Contribution from.
VoiceXML Version 2.0 Jon Pitcherella. What is it? A W3C standard for specifying interactive voice dialogues. Uses a “voice” browser to interpret documents,
The basics of knowing the difference CLIENT VS. SERVER.
IMS developments in 3GPP
Presentation Title 1 1/27/2016 Lucent Technologies - Proprietary Voice Interface On Wireless Applications Protocol A PDA Implementation Sherif Abdou Qiru.
Multi-Modal Dialogue in Personal Navigation Systems Arthur Chan.
Computer Network Architecture Lecture 6: OSI Model Layers Examples 1 20/12/2012.
Stefan Arbanowski, FOKUS Wolfgang Kellerer, DoCoMo Euro-Labs WWRF13, Jeju, Korea, Feb.
3/10/2016 Subject Name: Computer Networks - II Subject Code: 10CS64 Prepared By: Madhuleena Das Department: Computer Science & Engineering Date :
Software Group 7-December-2005 | Cross © 2005 IBM Corporation Distributed Multimodal Synchronization Protocol (DMSP) Chris Cross, Multimodal Browser Architect.
Java’s networking capabilities are declared by the classes and interfaces of package java.net, through which Java offers stream-based communications that.
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
Forschungszentrum Telekommunikation Wien An initiative of the K plus Programme MONA Mobile Multimodal Next Generation Applications Rudolf Pailer
1 BCMCS Framework TSG-X BCMCS Adhoc August 20, 2003.
Presentation transcript:

Dieter Kopp Dieter Kopp Alcatel Research & Innovation Distributed Speech Recognition ETSI STQ Aurora Distributed Speech Recognition (DSR)

Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp DSR system vision

Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp ETSI STQ Aurora  Participants  Alcatel, AT&T, British Telecom, Ericsson, France Telecom, Hewlett Packard, Motorola, Nokia, Qualcomm, Siemens, Sony, Texas Instruments, IBM, Conversay, etc.  MEL-Cepstrum DSR Front-End & Compression  Complete - ETSI standard published in February 2000  Advanced Noise Robust DSR Front-End  Current activity - standard expected in 2002  DSR Application & Protocols  Architecture definition, Client /Server protocol specification & contribution to other standardization group

Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp ETSI STQ Aurora Front- End Standardization

Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp DSR Elements

Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp Telephone Application& DSR Performance Enhancement with DSR

Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp Worst performance obtained using speech codec Speech Recognition over IP using DSR has at 50% packet lost only 3% recognition rate degradation compared to 63% for coded speech transmission Benefit of DSR for IP transmission (Simulation done by BT)

Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp Advanced Noise Robust DSR Front-End  Goals:  Standardization of a Noise Robust DSR Front-End algorithm under following conditions: 50% recognition rate improvement compared to the existing DSR Front-End standard Latency below 250ms Complexity below 17wMOPs  Selection process using:  Aurora database, SpeechDatCar (top 2/3 cluster selection)  Large vocabulary database (final winner)

Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp ETSI STQ Aurora Application & Protocols

Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp Application & Protocols Subgroup  Definition of DSR scenarios for applications  Information applications Voice portals (flight, weather, news, movies) Location-specific information Voice Navigation of maps  Transaction-based applications Finance e-commerce (various)  Information capture Dictation Form filling

Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp Application & Protocols Subgroup  Specification of the Client /Server architecture  Specification of the communications elements (voice transport interface, synchronization between Client/Server, etc.)  Contribution to other standardization groups  Participants: Alcatel, British Telecommunications, Ericsson, HP, IBM, ICSI, Intel Labs, Motorola, Nokia, Qualcomm, SpeechWorks, Temic/Daimler Chrysler, TI, Verbaltek, WaveMakers, Philips, etc.

Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp ETSI/STQ-Aurora Protocol & Application GUI page Graphic I/O Speech output Speech output Voice Recognition Voice page URL DSR Mobile Network Open & establish connection, Capability negotiation Connection to DSR Back-End Server Pre-processing data, Speech output, contents exchange

Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp Applications for Multi-modal Distributed Speech Recognition  Advanced Applications towards 3G terminals

Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp Multi-modal User Interaction Service Request Presentation Manager User Profile Capability Application Feedback/ Interaction Input: Speech, Key, Pen, etc. Output: Speech, Display Dependent on the environment (background noise) and the user preferences more or less speech I/O could be used

Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp Mobile `02 How may I help you? Menu WAP Select Scenario: Personal Information Manager Tell me todays schedule! 1 Tuesday, :30 9:00 MAP TP 4 9:30 phone conference 10:00 10:30 ? M. Hauser 11:00 ? Marketing 11:30 Lunch 12:00 12:30 1:00 department conv. You have meetings at 9, 11:30 and 1 p.m.. You have have two meeting requests. Details: 9 until 10 o’clock, phone-conference MAP 10:30 possible meeting with M. Hauser Marketing, 11:30 until 12:30 lunch :00 e-business O’Neill, Scott Dumont, Denise 5 Invite Jim Mason! 3 Who will participating the 9 o’clock phone call?

Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp Communication Manager Voice Transport Interface Data Transport Interface Synchronization Interface GUI Browser DOM Wrapper Voice Browser DOM Wrapper MM Shell Conversational Engines DSR encoderGUI driversGUI I/OAudio drivers Audio I/O Content Server HTTP Synchronization Protocols Network Server Audio Codec (s) DSR decoder Network Transport Layer Gateway and router with Voice transport and Synchronization Support Multi-modal Architecture

Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp  Voice Transport protocol specification and contribution to 3GPP  Definition of the Multi-modal Shell function. How the synchronization could be managed  Liaison offer to W3C for the standardization of the DOM interface for VoiceXML  Contribution to W3C Multi-modality group with ETSI multi-modal architecture  Common interface to all speech recognizers (IBM activity) P&A next steps

Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp Thank You