Presentation is loading. Please wait.

Presentation is loading. Please wait.

Dieter Kopp 9.7.2001 1 Dieter Kopp Alcatel Research & Innovation Distributed Speech Recognition ETSI STQ Aurora Distributed.

Similar presentations


Presentation on theme: "Dieter Kopp 9.7.2001 1 Dieter Kopp Alcatel Research & Innovation Distributed Speech Recognition ETSI STQ Aurora Distributed."— Presentation transcript:

1 Dieter Kopp 9.7.2001 1 Dieter Kopp Alcatel Research & Innovation email:Dieter.Kopp@alcatel.de Distributed Speech Recognition ETSI STQ Aurora Distributed Speech Recognition (DSR)

2 Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp 9.7.2001 2 DSR system vision

3 Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp 9.7.2001 3 ETSI STQ Aurora  Participants  Alcatel, AT&T, British Telecom, Ericsson, France Telecom, Hewlett Packard, Motorola, Nokia, Qualcomm, Siemens, Sony, Texas Instruments, IBM, Conversay, etc.  MEL-Cepstrum DSR Front-End & Compression  Complete - ETSI standard published in February 2000  Advanced Noise Robust DSR Front-End  Current activity - standard expected in 2002  DSR Application & Protocols  Architecture definition, Client /Server protocol specification & contribution to other standardization group

4 Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp 9.7.2001 4 ETSI STQ Aurora Front- End Standardization

5 Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp 9.7.2001 5 DSR Elements

6 Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp 9.7.2001 6 Telephone Application& DSR Performance Enhancement with DSR

7 Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp 9.7.2001 7 Worst performance obtained using speech codec Speech Recognition over IP using DSR has at 50% packet lost only 3% recognition rate degradation compared to 63% for coded speech transmission Benefit of DSR for IP transmission (Simulation done by BT)

8 Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp 9.7.2001 8 Advanced Noise Robust DSR Front-End  Goals:  Standardization of a Noise Robust DSR Front-End algorithm under following conditions: 50% recognition rate improvement compared to the existing DSR Front-End standard Latency below 250ms Complexity below 17wMOPs  Selection process using:  Aurora database, SpeechDatCar (top 2/3 cluster selection)  Large vocabulary database (final winner)

9 Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp 9.7.2001 9 ETSI STQ Aurora Application & Protocols

10 Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp 9.7.2001 10 Application & Protocols Subgroup  Definition of DSR scenarios for applications  Information applications Voice portals (flight, weather, news, movies) Location-specific information Voice Navigation of maps  Transaction-based applications Finance e-commerce (various)  Information capture Dictation Form filling

11 Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp 9.7.2001 11 Application & Protocols Subgroup  Specification of the Client /Server architecture  Specification of the communications elements (voice transport interface, synchronization between Client/Server, etc.)  Contribution to other standardization groups  Participants: Alcatel, British Telecommunications, Ericsson, HP, IBM, ICSI, Intel Labs, Motorola, Nokia, Qualcomm, SpeechWorks, Temic/Daimler Chrysler, TI, Verbaltek, WaveMakers, Philips, etc.

12 Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp 9.7.2001 12 ETSI/STQ-Aurora Protocol & Application GUI page Graphic I/O Speech output Speech output Voice Recognition Voice page URL DSR Mobile Network Open & establish connection, Capability negotiation Connection to DSR Back-End Server Pre-processing data, Speech output, contents exchange

13 Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp 9.7.2001 13 Applications for Multi-modal Distributed Speech Recognition  Advanced Applications towards 3G terminals

14 Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp 9.7.2001 14 Multi-modal User Interaction Service Request Presentation Manager User Profile Capability Application Feedback/ Interaction Input: Speech, Key, Pen, etc. Output: Speech, Display Dependent on the environment (background noise) and the user preferences more or less speech I/O could be used

15 Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp 9.7.2001 15 Mobile `02 How may I help you? Menu WAP Select Scenario: Personal Information Manager Tell me todays schedule! 1 Tuesday, 26.6.2001 8:30 9:00 MAP TP 4 9:30 phone conference 10:00 10:30 ? M. Hauser 11:00 ? Marketing 11:30 Lunch 12:00 12:30 1:00 department conv. You have meetings at 9, 11:30 and 1 p.m.. You have have two meeting requests. Details: 9 until 10 o’clock, phone-conference MAP 10:30 possible meeting with M. Hauser Marketing, 11:30 until 12:30 lunch... 2 4 9:00 e-business O’Neill, Scott Dumont, Denise 5 Invite Jim Mason! 3 Who will participating the 9 o’clock phone call?

16 Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp 9.7.2001 16 Communication Manager Voice Transport Interface Data Transport Interface Synchronization Interface GUI Browser DOM Wrapper Voice Browser DOM Wrapper MM Shell Conversational Engines DSR encoderGUI driversGUI I/OAudio drivers Audio I/O Content Server HTTP Synchronization Protocols Network Server Audio Codec (s) DSR decoder Network Transport Layer Gateway and router with Voice transport and Synchronization Support Multi-modal Architecture

17 Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp 9.7.2001 17  Voice Transport protocol specification and contribution to 3GPP  Definition of the Multi-modal Shell function. How the synchronization could be managed  Liaison offer to W3C for the standardization of the DOM interface for VoiceXML  Contribution to W3C Multi-modality group with ETSI multi-modal architecture  Common interface to all speech recognizers (IBM activity) P&A next steps

18 Etsi_P&A-multi-modal.ppt Distributed Speech Recognition Dieter Kopp 9.7.2001 18 Thank You


Download ppt "Dieter Kopp 9.7.2001 1 Dieter Kopp Alcatel Research & Innovation Distributed Speech Recognition ETSI STQ Aurora Distributed."

Similar presentations


Ads by Google