Download presentation
Presentation is loading. Please wait.
Published byDwayne Casey Modified over 10 years ago
1
MCL-Collection and Voice Recognition 5 January 2007 Richard Fullard MCL Technologies
2
Why Voice on Mobile Computers? Picking Accuracy and Efficiency !!!
3
DEMO Remote Viewer from Terminal-Screen
4
Demonstration
5
The Drivers behind Voice Increases Mobile Worker efficiency and productivity 15% minimum increase of productivity (paper to voice picking) Simply faster Increased accuracy; 99.9% plus is not uncommon No need to constantly ‘swap’ between paper/terminal and the picking task to perform. Don’t ever lose sight of the actual task Allows hands-free operation Mobile Worker can easily pick the products of the shelf, and easily move or drive the picking cage/pallet etc. Allows eyes-free operation Mobile Worker can focus on other activities such as safe driving Pickers speak and listen while “on the move” (Highly developed multi-tasking skills that humans have been practicing for years)
6
History of Voice Voice recognition systems have been commercially available for over 15 years but only recently ‘crossed the chasm’ (started to work well) Early systems were Voice Independent: A sample of the population was taken as the ‘voice mean’ against which the individual's voice profile was compared Works quite well in white collar environments such as call centres Does not work in Industrial environments Works for dictating systems, Microsoft Office Products & PDA products, IBM’s Via Voice, telephone systems, etc.
7
History of Voice Recently Voice-Dependent systems developed into stable and reliable solutions An individual trains the system on only the words that are used in the application Saves the individual’s voice profile in the system The application compares the voice entry against the voice profile of the relevant individual on the system ensuring high accuracy of recognition Smaller word-set, much higher accuracy Used in industrial data capture applications
8
History of Voice Up until recently, only “vertical” voice solutions available Dedicated, proprietary devices
9
But things have changed! Industry wants Multimodal, it needs efficiency, but accuracy too. It doesn’t want a compromise! RFID when it makes sense Scan when it makes sense Key when it makes sense Speak when it makes sense Eyes and Ears when it makes sense
10
What is MCL-Voice MCL-Voice is a fully standard MCL solution with the addition of Voice recognition and Voice synthesis capabilities
11
MCL- LINK / MCL- NET RS232. Modem. GSM/GPRS. Ethernet. Internet. WiFi 802.11 MCL ODBC Bridge MCL DLL Bridge MCL SAP R/3 Bridge Communication & Dispatching servers MCL – Designer MCL-Client Voice MCL … Bridge … MCL-Client + Vocollect Voice Your Host Systems
12
What is Voice Recognition Voice Recognition is a technology that converts analog signal coming from a microphone (Voice) into a sequence of digital bytes (words) (Also known as “ASR” or “Speech To Text” ) “1” Voice SignalData Word
13
What is Voice Synthesis Voice Synthesis is a technology that converts data words into analog Speech signal in a given language (also named “Text to Speech” or “TTS”) “1” Voice Signal Data Word “One”
14
Embedded Voice Engine Voice Recognition and Voice Synthesis is done in the Terminal (No Server required)
15
System Requirements Terminal Hardware Certified by MCL Technologies Windows operating system: Microsoft® Windows® Pocket PC 2003/CE4/CE5/WM5. > 300MHz XScale processor. Min 64 MB memory. Audio Interface (Jack). MCL Certified headset (with quick release connector) MCL-Voice Client (activated)
16
Certified Devices / Dec 2006 Symbol TechnologiesIntermec MC50 MC70 MC90xx WT40xx CN2B 730-751B
17
Terminal Accessories Quick release connector Operator safety feature Quickly separates headset cable from the terminal. High quality release mechanism supports multiple connections and disconnections for personalized headsets.
18
MCL-Collection Voice Three new MCL-Collection components: MCL-Designer Voice Add-on (for development) Requires an additional License to standard MCL-Designer MCL-Client Voice (for deployment) Requires a specific Voice Client License. MCL-Voice Manager (for deployment) Requires a specific product License.
19
Multimodal Approach
20
What Is Multimodal? Definition - MCL Multimodal access. The combination of multiple data capture technologies in one mobile worker application. Barcode scanners. Imagers. Displays. Touch screens. Signature capture. Keyboards. Weigh scales. Printers. … and now, Voice.
21
What is Multimodal
22
Multimodal Data Capture
23
Voice in Multimodal Applications Consider retail inventory application. Barcode scan item UPC or EAN. Keyboard or voice entry of quantity. Hands full of merchandise – voice more expedient. Consider warehouse receiving application. Radio frequency identification of pallet. Voice acceptance of pallet. Voice receipt of item level goods. Hands free operation. Mobile worker has flexibility to use whichever input method is more convenient and efficient.
24
Voice Recognition Application Vocabularies Voice Templates
25
Application Vocabularies Unique vocabulary for each MCL-Voice application. Action words used by mobile worker to: Enter transaction data. Navigate within the application. Control Voice Parameters MCL-Designer builds application vocabulary word lists. Mobile worker applications typically have small vocabularies. 50 words very typical (+/- 120KB). 100 words unusually large.
26
Application Vocabularies MCL-Voice vocabulary word categories. Action Words: Application words. Task specific words: data entry of transaction data. Such as “Quantity”, “1”, “2”, “3”, “4”. Navigation words: command addressed to the MCL engine Such as “Picking” to branch to the Picking function Global words. Words that you define to be valid on every screen. Commonly used for application navigation words. Such as “Next”, “Back”, “Clear”, “Previous”, “Delete”, “Enter". Control words. Commands to control the voice engine. Such as “Pause”, “Resume”, “Volume UP”, “Volume Down”.
27
Voice Recognition Training Like fingerprints, everybody has a unique voice print. Different accent, different pronunciation of words. Each individual must “train” the voice recognizer to: Understand the application vocabulary for that individual’s voice pattern. Allow worker to speak naturally and be understood. Training generates a “User Voice Template”
28
Voice Recognition Training User voice template. The “User Voice Template” file contains: The selected language. The specific User’s settings (volume, speed, pitch, sensibility). All the application words and their “speaking image” Voice training is performed on the mobile computer. +/-20 minutes per worker typical to train 50 word vocabulary. Voice training result saved in unique template for each worker. Suggest using employee ID or badge ID to create unique, personalized template names. Template can be uploaded to a server and deployed to any other device.
29
Voice Recognition Training How is the voice template used? Mobile worker says a word. Voice recognizer compares spoken word against template. Spoken word translated into template word it matches. The smaller the vocabulary. The faster the comparison. The more accurate the match. Multi-User. Voice templates for several individuals may be: Saved on a mobile computer. –Allows a mobile computer to be shared by several workers. Saved and downloaded from a central server. –Allows a pool of mobile computers to be shared by all mobile workers.
30
Multi-User User Voice Template File & Voice Preferences are loaded in the device (for execution) & on server (for distribution) UVT Download/Upload Server User 1 User 2 Multi-Users
31
Advanced Features
32
Noise Cancellation on MCL Certified Headsets. MCL-Voice operates in noisy, industrial environments: Warehouses, distribution centers, dockyards, transportation hubs, assembly/manufacturing plants… Very effective noise suppression on certified headsets. High voice recognition directly proportional to ability to reduce ambient noise. Impossible task on inferior headsets. To maximize voice recognition: Perform voice recognition trainings in actual work environment. Creates most representative voice templates.
33
Advanced Features Noise sampling. Adjusts microphone pickup levels to compensate for ambient noise level. Noise sampling can be performed: Automatically on boot up. On mobile worker demand.
34
Advanced Features Dynamic voice training. Incremental addition of vocabulary words to existing template. On boot up. Untrained words introduced by new version of application. Prompts mobile worker to train any untrained words. Consider application vocabulary of 50 words. –New version introduces one new word. –Mobile worker trains only the new word, not all 51 words. On demand by mobile worker. Recognizer continually has trouble understanding a given word. Retrain any poorly trained words.
35
Advanced Features Fast voice recognition. Explicit application vocabulary. Very efficient. Performed on the mobile computer. Voice recognition and audio feedback virtually instantaneous.
36
Advanced Features Vocabulary optimization. Vocabulary subset definable on each input field. Limits template choices valid for match. Further decreases voice recognition search times. Further increases word match accuracies.
37
Advanced Features Audio Feedback (Echo) Any input. Mobile worker says “five” into microphone, and immediately hears synthesized “five” in the headset. Immediate verification. Multimodal approach. Any input can be echoed: –barcode scanned data, keyboard entry data, etc. MCL-Voice TTS synthesizer sees all data the same regardless of the original multimodal input source of the data.
38
Advanced Features Talk Over Experienced operators may speak the data before the prompt is given. Prompts are canceled if operator enters a valid response.
39
MCL-Voice Certification Guidelines and Best Practices
40
Why? Voice recognition is not a “black and white” science. How to position MCL-Voice. Benefits of MCL-Voice. Customer competitive advantages.
41
Why? Design and implementation of a successful voice application is done by following guidelines and best practices. Technical training teaches, for example: Avoid vocabulary items like “cherry” and “sherry” on the same data entry field. Avoid words and phrases with common initial words, like “orange” and “orange peel”. Avoid vocabulary words similar to environment background noise. Consider a work area equipped with air compressors. Avoid vocabulary words that end with “S” and “Z”, like “Pass”. The voice recognition engine might interpret the background hiss as a valid vocabulary word.
42
Requirements Training. Successful completion of the MCL-Collection Voice Technical Training course by at least two representatives from your company. Successful completion of the MCL-Collection Voice Sales Training course by at least one individual from your company. Equipment. Purchase of an MCL-Collection Voice Demo Kit by your company.
43
Company Certification Given to companies that satisfy all Certification Requirements. Tied to both the company and the trained individuals. Lost by a company when one or both trained individuals leave the company. Not automatically given to a company that hires individuals with prior MCL-Collection Voice training. Each company must satisfy all Certification Requirements to receive the MCL-Collection Voice Certification.
44
Company Certification Lists your company in an online directory. Entitles your company to purchase MCL-Voice products. Develops best practices technical skills to implement successful voice enabled applications. Provides understanding of the benefits voice brings to mobile workforce deployments. Elicits customer confidence. Gives access to MCL Support. Benefits
45
MCL-Designer Voice Enabling a Project
46
MCL-Designer Voice Enabling a Project Voice Enabling a project At Project level … At Screen level … At Screen Object level … At Process Object level …
47
MCL-Designer Voice Enabling a Project At Project level All settings defined at “Project Level” are used by all programs and processes of the project.
48
MCL-Designer Voice Enabling a project at Project level Terminal Setup Programs Local files System & User Variables Image files Fonts & Styles Keyboard definition Voice Settings MCL Project components The “Voice settings” define the “ASR” and “TTS” parameters. This includes the language and the speed definition, the global field words etc…
49
MCL-Designer Voice Enabling a project at Project level Voice settings
50
MCL-Designer Voice Enabling a project at Project level Settings Parameters: Speech In Speech Out Global Words Control Words Phonetic Substitution Table
51
MCL-Designer Voice Enabling a project at Project level Speech Out TextWill be spoken as ABC “alpha bravo charlie” Morning “morning” A12“alpha one two” A 12“alpha twelve” 25“twenty five” 123“one hundred twenty three” AB12D“alpha bravo one two delta”
52
MCL-Designer Voice Enabling a project at Project level Speech Out Special symbol #pound $dollar €euro &ersand *asterisk +plus -dash .point /slash \back slash =equals
53
MCL-Designer Voice Enabling a project at Project level The Input timers
54
MCL-Designer Voice Enabling a project at Project level Settings Parameters: Speech In Speech Out Global Words Control Words Phonetic Substitution Table
55
MCL-Designer Voice Enabling a project at Project level Settings Parameters: Speech In Speech Out Global Words Control Words Phonetic Substitution Table
56
MCL-Designer Voice Enabling a project at Project level Settings Parameters: Speech In Speech Out Global Words Control Words Phonetic Substitution Table Note: Words are case sensitive
57
MCL-Designer Voice Enabling a project At Screen level All settings defined at “Screen Level” are used by all objects and processes of the selected screen.
58
MCL-Designer Voice Enabling a project at Screen level MCL « Voice » program structure Process-In Process-In lines Voice object definitions Start of screen Options Clear Screen Backlight Screen Label Process-Out Process-Out lines Screen Display data, Input fields, (with «In Screen» processes) Buttons, Menu Etc… Next screen
59
MCL-Designer Voice Enabling a project at Screen level Enabling or Disabling Speech In & Speech Out independently
60
MCL-Designer Voice Enabling a project Screen Object level All settings defined at the level of a specific screen or process object are used by this object only.
61
MCL-Designer Voice Enabling a project at Screen Object Level Output Screen Objects Display Text Display Variable
62
MCL-Designer Voice Enabling at Screen Object Level Input Screen Objects Input Barcode and Keyboard Input Spin Input List Pull Down List Check boxes Radio buttons Text buttons and Image buttons Menu Text and Menu Buttons File Browse
63
MCL-Designer Voice Enabling a project at Screen Object Level Voice Control Input Barcode and Keyboard, Input Spin, Input List Focus Prompt Words Word List 1, 2 & 3 Audio Feedback Completion Prompt
64
MCL-Designer Voice Enabling a project at Screen Object Level Voice Control Input Barcode and Keyboard, Input Spin, Input List Focus Prompt Words Word List 1, 2 & 3 Audio Feedback Completion Prompt
65
MCL-Designer Voice Enabling a project at Screen Object Level Voice Control Input Barcode and Keyboard, Input Spin, Input List Focus Prompt Words Word List 1, 2 & 3 Audio Feedback Completion Prompt
66
MCL-Designer Voice Enabling a project at Screen Object Level Voice Control Input Barcode and Keyboard, Input Spin, Input List Focus Prompt Words Word List 1, 2 & 3 Audio Feedback Completion Prompt
67
MCL-Designer Voice Enabling a project at Screen Object Level Voice Control Pull Down List, Check box, Radio buttons Focus Focus Word Words Word List 1, 2 & 3 Audio Feedback
68
MCL-Designer Voice Enabling a project at Screen Object Level Voice Control Pull Down List, Check box, Radio buttons Focus Focus Word Words Word List 1, 2 & 3 Audio Feedback
69
MCL-Designer Voice Enabling a project at Screen Object Level Voice Control Pull Down List, Check box, Radio buttons Focus Focus Word Words Word List 1, 2 & 3 Audio Feedback
70
MCL-Designer Voice Enabling a project at Screen Object Level Voice Control Text buttons, Image buttons, Focus Focus Word
71
MCL-Designer Voice Enabling a project at Screen Object Level Voice Control Menu Text, Menu buttons Focus Prompt Words Word List 1, 2 & 3 Audio Feedback
72
MCL-Designer Voice Enabling a project at Screen Object Level Voice Control Menu Text, Menu buttons Focus Prompt Words Word List 1, 2 & 3 Audio Feedback
73
MCL-Designer Voice Enabling a project at Screen Object Level Voice Control Menu Text, Menu buttons Focus Prompt Words Word List 1, 2 & 3 Audio Feedback
74
MCL-Designer Voice Enabling a project at Screen Object Level Voice Control File Browse Focus Prompt Words Word List 1, 2 & 3 Audio Feedback Completion Prompt
75
MCL-Designer Voice Enabling a project at Screen Object Level Voice Control File Browse Focus Prompt Words Word List 1, 2 & 3 Audio Feedback Completion Prompt
76
MCL-Designer Voice Enabling a project at Screen Object Level Voice Control File Browse Focus Prompt Words Word List 1, 2 & 3 Audio Feedback Completion Prompt
77
MCL-Designer Voice Enabling a project At Process Object level All settings defined at the level of a specific screen or process object are used by this object only.
78
MCL-Designer Voice Enabling a project at Process Object Level Double Click Process-Out Process-In
79
MCL-Designer Voice Enabling a project at Process Object Level In-Screen processes
80
MCL-Designer Voice Enabling a project at Process Object Level Voice Processes Speech Input Speech Output Play Sound Wave Noise Sample Voice Training Set Voice State Set Voice Operator Set Recognizer parameters Set Synthesizer parameters
81
MCL-Designer Voice Enabling a project at Process Object Level Voice Processes Speech Input Speech Output Play Sound Wave Noise Sample Voice Training Set Voice State Set Voice Operator Set Recognizer parameters Set Synthesizer parameters
82
MCL-Designer Voice Enabling a project at Process Object Level Voice Processes Speech Input Speech Output Play Sound Wave Noise Sample Voice Training Set Voice State Set Voice Operator Set Recognizer parameters Set Synthesizer parameters
83
MCL-Designer Voice Enabling a project at Process Object Level Voice Processes Speech Input Speech Output Play Sound Wave Noise Sample Voice Training Set Voice State Set Voice Operator Set Recognizer parameters Set Synthesizer parameters
84
MCL-Designer Voice Enabling a project at Process Object Level Voice Processes Speech Input Speech Output Play Sound Wave Noise Sample Voice Training Set Voice State Set Voice Operator Set Recognizer parameters Set Synthesizer parameters
85
MCL-Designer Voice Enabling a project at Process Object Level Voice Processes Speech Input Speech Output Play Sound Wave Noise Sample Voice Training Set Voice State Set Voice Operator Set Recognizer parameters Set Synthesizer parameters
86
MCL-Designer Voice Enabling a project at Process Object Level Voice Processes Speech Input Speech Output Play Sound Wave Noise Sample Voice Training Set Voice State Set Voice Operator Set Recognizer parameters Set Synthesizer parameters
87
MCL-Designer Voice Enabling a project at Process Object Level Voice Processes Speech Input Speech Output Play Sound Wave Noise Sample Voice Training Set Voice State Set Voice Operator Set Recognizer parameters Set Synthesizer parameters
88
MCL-Designer Voice Enabling a project at Process Object Level Voice Processes Speech Input Speech Output Play Sound Wave Noise Sample Voice Training Set Voice State Set Voice Operator Set Recognizer parameters Set Synthesizer parameters
89
MCL Voice: Principles
90
MCL-Client Voice Recognizer: Principle Voice Engine Settings Maximum delay between Words Default Timers For Input Fields NRT = Noise Rejection Timer For Combined Words MWT = Multiple Word Timer
91
MCL-Client Voice Recognizer: Principle Voice Recognizer Settings Sensitivity settings
92
MCL-Client Voice Recognizer: Principle ASR Voice Template Best Match Word Score Word List Threshold Data Signal +- Score
93
MCL-Designer Voice Enabling a project at Process Object Level Original image Same Image? Consider that the template of a word is like an image This is the “template” image This is the image with a certain level of noise YES
94
MCL-Designer Voice Enabling a project at Process Object Level Original image Same Image? This is the “template” image This is the image with the same level of noise NO
95
MCL-Designer Voice Enabling a project at Process Object Level Original image This is the “template” image Same Image? Maybe This is the image with a higher level of noise
96
MCL Voice Manager
97
Voice-Manager Management & Tuning Software for Voice Users & Terminals Log User Voice Data Date & time Terminal ID Spoken Word Word Score Result
98
Voice-Manager Management & Tuning Software for Voice Users & Terminals User Score over Time Average Score Per Hour Word & Noise Volume / Per Hour
99
Voice-Manager Management & Tuning Software for Voice Users & Terminals Statistical Analyzer Score Distribution Scoring Split Analysis Words versus Noise Volume Word Details Analyzer
100
Voice-Manager Management & Tuning Software for Voice Users & Terminals Analysis & Recommendation Suggested Word To Re-Train
101
MCL Client Installation & Activation
102
MCL-Client Activation The “MCL-Voice” client will generally be installed either on the Flash memory or on an SD card Installing on the Flash memory Using Activesync, Install the “MCL-Voice Client” on the device using the.exe file. Activesync will create the necessary folders and install the “MCL-Voice” client Installing on a SD card The.zip file must be unzipped and copied on the SD card. The card will then be placed in the terminal.
103
MCL-Client Activation Start MCL on the device Define the Terminal and Subnet ID’s Go to the Activation screen Enter the License number and the Activation code Notes: The MCL-Client with MCL-Voice does not support the “Demo mode” The terminal activation uses the “Off-Line” activation procedure. The MCL.key file is stored in the main folder of the MCL- Voice client.
104
MCL-Client System Menu System Menu This screen gives access to the different options of the System Menu The « Setup » option is used to access the « Voice » settings menu.
105
Q & A
Similar presentations
© 2025 SlidePlayer.com Inc.
All rights reserved.