OPTIMAL TEXT SELECTION ALGORITHM ASR Project Meetings Dt: 08 June 2004 - Rohit Kumar - LTRC, IIIT Hyderabad.

Slides:



Advertisements
Similar presentations
Information Retrieval in Practice
Advertisements

Longest Common Subsequence
CS252: Systems Programming Ninghui Li Program Interview Questions.
Analysis of Algorithms
Speech Recognition Part 3 Back end processing. Speech recognition simplified block diagram Speech Capture Speech Capture Feature Extraction Feature Extraction.
ACOT Intro/Copyright Succeeding in Business with Microsoft Excel 2010: Chapter1.
Operations Research Assignment Problem.
S. P. Kishore*, Rohit Kumar** and Rajeev Sangal* * Language Technologies Research Center International Institute of Information Technology Hyderabad **
Data Compressor---Huffman Encoding and Decoding. Huffman Encoding Compression Typically, in files and messages, Each character requires 1 byte or 8 bits.
MULTI LINGUAL ISSUES IN SPEECH SYNTHESIS AND RECOGNITION IN INDIAN LANGUAGES NIXON PATEL Bhrigus Inc Multilingual & International Speech.
Bioinformatics Finding signals and motifs in DNA and proteins Expectation Maximization Algorithm MEME The Gibbs sampler Lecture 10.
CSE 830: Design and Theory of Algorithms
Unsupervised Learning of Natural Language Morphology using MDL John Goldsmith November 9, 2001.
1 Data Structures A program solves a problem. A program solves a problem. A solution consists of: A solution consists of:  a way to organize the data.
Chapter 2: Design of Algorithms
Text-To-Speech Synthesis An Overview. What is a TTS System  Goal A system that can read any text Automatic production of new sentences Not just audio.
Huffman Codes Message consisting of five characters: a, b, c, d,e
1 SSML Extensions for TTS in Indian Languages II workshop on Internationalizing SSML May 2006, Greece Nixon Patel and Kishore Prahallad Bhrigus.
Semantic and phonetic automatic reconstruction of medical dictations STEFAN PETRIK, CHRISTINA DREXEL, LEO FESSLER, JEREMY JANCSARY, ALEXANDRA KLEIN,GERNOT.
CS223 Algorithms D-Term 2013 Instructor: Mohamed Eltabakh WPI, CS Introduction Slide 1.
Introduction CSE 1310 – Introduction to Computers and Programming Vassilis Athitsos University of Texas at Arlington 1.
Unit III : Introduction To Data Structures and Analysis Of Algorithm 10/8/ Objective : 1.To understand primitive storage structures and types 2.To.
Copyright 2007, Toshiba Corporation. How (not) to Select Your Voice Corpus: Random Selection vs. Phonologically Balanced Tanya Lambert, Norbert Braunschweiler,
Reading Aid for Visually Impaired Veera Raghavendra, Anand Arokia Raj, Alan W Black, Kishore Prahallad, Rajeev Sangal Language Technologies Research Center,
Korea Maritime and Ocean University NLP Jung Tae LEE
26 Sep 2014Lecture 3 1. Last lecture: Experimental observation & prediction Cost models: Counting the number of executions of Every single kind of command.
Kishore Prahallad IIIT-Hyderabad 1 Unit Selection Synthesis in Indian Languages (Workshop Talk at IIT Kharagpur, Mar 4-5, 2009)
Presented by: Fang-Hui Chu Boosting HMM acoustic models in large vocabulary speech recognition Carsten Meyer, Hauke Schramm Philips Research Laboratories,
Modeling Speech using POMDPs In this work we apply a new model, POMPD, in place of the traditional HMM to acoustically model the speech signal. We use.
Major objective of this course is: Design and analysis of modern algorithms Different variants Accuracy Efficiency Comparing efficiencies Motivation thinking.
NLP Research Group Meeting ( 27. March ) Text Processing Front End for Indian Language TTS System Text Processing Front End Speech Synthesizer Phonetic.
Improving out of vocabulary name resolution The Hanks David Palmer and Mari Ostendorf Computer Speech and Language 19 (2005) Presented by Aasish Pappu,
An Iterative Approach to Extract Dictionaries from Wikipedia for Under-resourced Languages G. Rohit Bharadwaj Niket Tandon Vasudeva Varma Search and Information.
ALGORITHMS.
1 Centroid Based multi-document summarization: Efficient sentence extraction method Presenter: Chen Yi-Ting.
INVITATION TO Computer Science 1 11 Chapter 2 The Algorithmic Foundations of Computer Science.
HANGMAN OPTIMIZATION Kyle Anderson, Sean Barton and Brandyn Deffinbaugh.
EXTRA HELP WITH SYSTEMS OF EQUATIONS. SOLVING SYSTEMS OF EQUATIONS USING ELIMINATION Steps: 1. Place both equations in Standard Form, Ax + By = C. 2.
Claritas 3D REFSTAT: Refraction statics calculations.
Sorting & Searching Geletaw S (MSC, MCITP). Objectives At the end of this session the students should be able to: – Design and implement the following.
NATURAL LANGUAGE PROCESSING
 Problem Analysis  Coding  Debugging  Testing.
Advanced Data Structures Lecture 8 Mingmin Xie. Agenda Overview Trie Suffix Tree Suffix Array, LCP Construction Applications.
Algorithm Complexity is concerned about how fast or slow particular algorithm performs.
G. Anushiya Rachel Project Officer
Victorian Curriculum Mathematics F - 6 Algorithms unplugged
Advanced Algorithms Analysis and Design
Advanced Computer Systems
Auburn University
Information Retrieval in Practice
Applied Discrete Mathematics Week 2: Functions and Sequences
Text-To-Speech System for English
MARKETING PROCESS.
3-2: Solving Systems of Equations using Elimination
Algorithms Chapter 3 With Question/Answer Animations
Chapter 7: Strings and Characters
EXPERIMENTS WITH UNIT SELECTION SPEECH DATABASES FOR INDIAN LANGUAGES
Algorithm An algorithm is a finite set of steps required to solve a problem. An algorithm must have following properties: Input: An algorithm must have.
3.3: Solving Systems of Equations using Elimination
Objective of This Course
GC 211:Data Structures Algorithm Analysis Tools
Algorithm Discovery and Design
3-2: Solving Systems of Equations using Elimination
3-2: Solving Systems of Equations using Elimination
Rohit Kumar *, Amit Kataria, Sanjeev Sofat
Fourier Transform of Boundaries
3-2: Solving Systems of Equations using Elimination
Solving Systems of Equations using Elimination
Algorithms Tutorial 27th Sept, 2019.
Presentation transcript:

OPTIMAL TEXT SELECTION ALGORITHM ASR Project Meetings Dt: 08 June Rohit Kumar - LTRC, IIIT Hyderabad

OPTIMAL TEXT SELECTION ALGORITHM Basic Greedy Algorithm 1.Get the Frequency Distribution of basic units in a language by analyzing a large corpus 2.Iterate for as many sentence as you want to select 1.For Each Sentence on the Corpus Score the sentence for its desirability in Optimal Text 2.Choose the sentence with best score into Optimal Text 3.Delete Selected Sentence from Corpus 4.Update the frequency distribution based on the sentence selected

OPTIMAL TEXT SELECTION ALGORITHM Text Corpus Phonetizer Syllabifier This Gives you a sequence of phonemes Basically a set of sentences This Gives you a sequence of basic units Diphone, Triphones, Syllables … Unit Distribution Analysis Counts the number of occurrences of each basic unit Corpus Frequency Distribution Analysis Step Unit Frequency ka10000 ek8756 ne6593 …

OPTIMAL TEXT SELECTION ALGORITHM Sentence Phonetizer Syllabifier This Gives you a sequence of phonemes This Gives you a sequence of basic units Diphone, Triphones, Syllables … Corpus Frequency Distribution How to Score each Sentence 1. Each units is scored on the basis of its desirability. 2. Desirability is proportional to Frequency of the unit in large corpus 3. Sentence Score = Sum of Fn(Score of all units in the sentence) / Number of Units Ranking Algorithm Scoring Function could either be linear or Inverse function

OPTIMAL TEXT SELECTION ALGORITHM Sentence Phonetizer Syllabifier This Gives you a sequence of phonemes This Gives you a sequence of basic units Diphone, Triphones, Syllables … Sentence Level Unit Frequency Distribution How to Update the Frequency Distribution Unit Distribution Analysis Counts the number of occurrences of each basic unit Corpus Frequency Distribution For Each Unit in Sentence Frequency Distribution, Subtract Modify its corpus frequency by K x (Frequency of Unit in Sentence) Modified Corpus Frequency Distribution

OPTIMAL TEXT SELECTION ALGORITHM Issues 1.Complete Desirable Coverage will not be possible with one step simple selection as it will bring a large number of sentences into the optimal text. “Optimal Text means Maximum Coverage and Minimum Size” How to Solve Follow multiple small steps as described ahead

OPTIMAL TEXT SELECTION ALGORITHM Our Strategy for Optimal Text Selection 1. From the large database, filter out sentences that are not of length between 5 to 15 words 2. From the frequency analysis of the unit, choose a set of N units (out of total M units), whose frequency is higher than a threshold (say around above 100). 3. Select the sentences (say X) which cover these N units 4. Repeat the process again with P (P = M - N) units – but restrict the number of sentences to be not more than 2 * X 5. For all the remaining units, select words which cover these units

OPTIMAL TEXT SELECTION ALGORITHM Phonetizer: A class that takes as input a text and gives as output a sequence of phonemes. What Phonemes ?? We will be following ITrans-3 as the notation across all our work. WordItrans – 3Phonemes namastenamasten, a, m, a, s, t, e dhanywaddhanywaaddh, a, n, y, w, aa, d textilet’ekstaailt’, e, k, s, t, aa, i, l khabreinqhabren’qh, a, b, r, e, n’ krishnakrxshhnaak, rx, shh, n, aa

OPTIMAL TEXT SELECTION ALGORITHM Class Details 1.Class constructor 2.AddText (inputs string, no output) 3.GetPhoneme (no input, outputs one phoneme) 4.IsEmpty (no input, outputs flag if no text to work on left) 3 is the phonetizing function which breaks a text into phonemes and will broadly be the same of all languages. The list of phonemes is shown in the next slide

OPTIMAL TEXT SELECTION ALGORITHM Phoneme list (for hindi, minor modifications for other languages) a a1 aa aa* aa1 i ii u uu e e* e1 ai o au n' : *' * rx lx rxx lxx k kh g gh ng- ch chh j jh nj- t' t'h d ' d'h nd- t th d dh n n~ p ph b bh m y r r~ l l' l'~ v sh shh s s- h q qh gx z dr~ dd~ f y~

OPTIMAL TEXT SELECTION ALGORITHM Handling English Words 1.Dictionary Lookup 2.Letter to Sounds Module

OPTIMAL TEXT SELECTION ALGORITHM Implementation: Syllabifier Basic Units: Diphones (2 phones), Triphones (3 phones), Syllables Basically takes the phonemes from Phonetizer and gives units. So if we are working with triphones t’, e, k, s, t, aa, i, l >> t’-e-k, e-k-s, k-s-t, s-t-aa, t-aa-i, aa-i-l Class Details 1. Class constructor 2. AddPhoneme (inputs a string, no output) 3. GetUnit (no input, outputs one string) 4. IsEmpty (no input, outputs flag if no phonemes left)

OPTIMAL TEXT SELECTION ALGORITHM Effort for each Language 1. Collect the Corpus (most of Hindi, Telugu, Tamil, Marathi already available) 2. Automatic Cleaning and Conversions on the Corpus * English Word to ITrans Conversion by dictionary lookup 3. Modifying the Phonetizer (and Syllabifier) for the language 4. Running the OTS strategy 5. Manually Checking Selected Corpus and Corrections 6. Optional: Reiterating 1 or more steps in OTS Strategy if need be