Presentation on theme: "How to use grammars in a more flexible way 1 Paolo Baggia 1 Feb 22 nd, 2007 SpeechTEK WEST How to Use Grammars in a More Flexible Way Paolo Baggia."— Presentation transcript:
How to use grammars in a more flexible way 1 Paolo Baggia 1 Feb 22 nd, 2007 SpeechTEK WEST How to Use Grammars in a More Flexible Way Paolo Baggia
How to use grammars in a more flexible way 2 Paolo Baggia Overview Inside the ASR Language Constraints Speech Grammars SLMs Pros and Cons More flexible grammars Garbage Techniques Experimental Results Applications Final Remarks
How to use grammars in a more flexible way 3 Paolo Baggia Loquendo Today Global company formed in 2001 as a spin-off from the Telecom Italia R&D center with over 30 years experience in Speech Technologies Complete set of Multilingual speech technologies on a wide spectrum of devices Full support of international standards (VoiceXML, MRCP, VoIP) Ready for challenging future scenarios: Multimodality, Security A Telecom Italia Group company HQ in Italy, Offices in US, Spain, Germany and France, and Worldwide Network of Parters “Best Innovation in Speech Synthesis” Prize AVIOS-SpeechTEK West 2006. “Best Innovation in Multi-Lingual Speech Synthesis” Prize AVIOS-SpeechTEK West 2005.
How to use grammars in a more flexible way 4 Paolo Baggia Inside the ASR engine ASR Acoustic Signal Recognized Words O = o 1, o 2, …, o n W = w 1, w 2, …, w m Likelihood Prior
How to use grammars in a more flexible way 5 Paolo Baggia Two Probabilities P(O|W) Likelihood Acoustic Models ASR engine P(W) Prior probability Language Constraints Application Developer – COSTLY ACTIVITY
How to use grammars in a more flexible way 6 Paolo Baggia Language Constraints On one side: Very constrained speech grammars On the other side: Statistical Language Models Proposal of a new tecniques in the middle Speech Grammars SLMs
How to use grammars in a more flexible way 7 Paolo Baggia Speech Grammars vs. SLMs SPEECH GRAMMARS Method: A compact and yet complete description of the user’s response Constraints: Rules have to correctly represent sentence construction Issues: –Standard syntax (W3C SRGS) –Standard results (Semantic Interpretation – W3C SISR) –Best performance for in grammar utt. –Grammar should cover all possible responses Cost of developing a grammar Cost of fine-tuning grammars (need for tuning tools) STATISTICAL LANGUAGE MODELS Method: Assesses the probability of word occurrence in a sentence Constraints: Probability of 2, 3, …, n preceding words (n-gram) Issues: –Easy to generate, but only with a specific corpus for each application –Difficult to assign probabilities to unforeseen events (smoothing techniques) Very large corpora are needed Time-consuming transcription of field data required to tune the SLMs
How to use grammars in a more flexible way 8 Paolo Baggia More Flexible Grammars How can we simplify grammar creation? We can focus on modeling just the relevant content, and not the rest of the phrase. Eg. (I’d like to travel) from Rome to Venice (please) (I need to go) from Rome to Venice (as soon as possible) (Well…I’m going…er…sorry, a ticket) from Rome to Venice Use a special grammar node to discard the rest of the sentence! Garbage
How to use grammars in a more flexible way 9 Paolo Baggia Simple usage More complex usages are possible. Still under evaluation! Where to put the Garbage rule? Garbage Relevant part Prefix Postfix
How to use grammars in a more flexible way 10 Paolo Baggia Example Grammar with Garbage Rule from out.from = rules.latest(); to out.to = rules.latest(); Rome Venice
How to use grammars in a more flexible way 11 Paolo Baggia Garbage Techniques a.General model Can be trained at an acoustic level as an average model or as the average of the first N-best activated unit A calibration of the general model could be problematic b.Filler words A vocabulary composed of different words from the speech grammar is inserted as a garbage node Efficiency and accuracy performance greatly depends upon the number and choice of the added words c.Phonetic models (Loquendo solution) A node containing a chosen subset of the phonetic units of the acoustic model is selected and used as a garbage node The “power” of the garbage could be easily modulated through the unit subset selection and the garbage node weights Vocabulary size is usually smaller than the list of ‘filler-words’
How to use grammars in a more flexible way 12 Paolo Baggia Experimental Results Garbage grammar accuracy should be analyzed in 2 ways –If the input speech matches the speech grammar The presence of garbage nodes should not affect recognition –If the input speech is partially covered by the speech grammar The garbage should cover several parts of the input speech Laboratory test bed: -0.99%English -0.10%Spanish -1.26%Italian Average % accuracy loss on Built-in Grammars: dates, pin codes, currency amounts, time expressions, …. 300 Filler Words (random selection) 90.5% 300 Filler Words (frequent syllables) 95.3 % 41 Filler Words (oracle -> test word) 98.6 % Phonetic garbage97.3 % Identifying keywords: months of the year, a test on spontaneous date expressions. The phonetic garbage outperforms the filler words technique except in the ORACLE case.
How to use grammars in a more flexible way 14 Paolo Baggia First Real Applications Quatro Rodas - Travel Guide Voice Portal (Brazil) phone number: +55 2140034842 Bizvox's innovative solution allows customers to access information on 16,000 businesses (cost of hotels, restaurants and tourist attractions), by means of a single local call from anywhere in Brazil. The user also receives an SMS with the correct address of the desired location. Automated Police Info Service - (Italy) www.carabinieri.it phone number: +39 06 80985232 An automated service for the police force providing info on joining the force and moving up through the service, on upcoming exams, exam results etc. The caller can speak freely and naturally thanks to an extensive use of garbage nodes. (experimental) Telecom Italia Directory Assistance
How to use grammars in a more flexible way 15 Paolo Baggia Final Remarks Develop ASR application is a costly activity, even if progresses have been done on standard formats and tools A technique based on GARBAGE nodes in grammars: – Greatly simplifies the grammar development – May be used to fast prototype system to be tuned in a second phase – Helps in noisy and chatty environments (garbage covers unwanted speech) – Promotes a more flexible dialog developments (from system guided dialogs to more open prompts, shift of contexts, etc.) First feedbacks from real applications
How to use grammars in a more flexible way 16 Paolo Baggia Thank You! For testing this feature go to: “Silver Bullet” demo In building 3, floor 4, “Unione Square” room #14 For more information please: Visit Loquendo’s booth #314 and try the ASR Keep an eye on: www.loquendo.comwww.loquendo.com Contact us: email@example.com
How to use grammars in a more flexible way 17 Paolo Baggia Live Demo – Corps of Carabinieri Go to www.carabinieri.it Corps of Carabinieri (armed force):www.carabinieri.it Click on “Operatore Virtuale” (Virtual Operator) Call +39 06 80985232 Information service about: Appliance tests, requirements, documents Calendar of the tests Results of the tests (20.000 people)
Your consent to our cookies if you continue to use this website.