Automatic identification of vocalic intervals in speech signal Jesus Garcia Antonio Galves Flaviane Fernandes Janaisa Viscardi Ulrike Gut Phonological.

Slides:



Advertisements
Similar presentations
An Adaptive, Dynamical Model of Linguistic Rhythm Sean McLennan Proposal Defense
Advertisements

Tom Lentz (slides Ivana Brasileiro)
Chapter 12: Speech and Music Perception
CS 551/651: Structure of Spoken Language Spectrogram Reading: Approximants John-Paul Hosom Fall 2010.
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Basic Spectrogram & Clinical Application: Consonants
Vowel Formants in a Spectogram Nural Akbayir, Kim Brodziak, Sabuha Erdogan.
1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et.
Vowels (again) February 23, 2010 The News For Thursday: Give me a (one paragraph or so) description of what you’re thinking of doing for a term project.
SPEECH RECOGNITION 2 DAY 15 – SEPT 30, 2013 Brain & Language LING NSCI Harry Howard Tulane University.
Vineel Pratap Girish Govind Abhilash Veeragouni. Human listeners are capable of extracting information from the acoustic signal beyond just the linguistic.
Speech and speaker normalization (in vowel normalization)
AN ACOUSTIC PROFILE OF SPEECH EFFICIENCY R.J.J.H. van Son, Barbertje M. Streefkerk, and Louis C.W. Pols Institute of Phonetic Sciences / ACLC University.
Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.
Structure of Human Speech Chris Darwin Vocal Tract.
1 WHISTLED TURKISH: STATISTICAL ANALYSIS OF VOWEL DISTRIBUTION AND CONSONANT MODULATIONS XVI ICPhS Universitat Politecnica.
The statistical analysis of acoustic correlates of speech rhythm.
Languages’ rhythm and language acquisition Franck Ramus Laboratoire de Sciences Cognitives et Psycholinguistique, Paris Jacques Mehler, Marina Nespor,
Sonority as a Basis for Rhythmic Class Discrimination Antonio Galves, USP. Jesus Garcia, USP. Denise Duarte, USP and UFGo. Charlotte Galves, UNICAMP.
SOME SIMPLE MANIPULATIONS OF SOUND USING DIGITAL SIGNAL PROCESSING Richard M. Stern demo August 31, 2004 Department of Electrical and Computer.
Whispered Speech A Presentation by Susanne Filges, Agata Mroczkowska and Annette Radon.
SPEECH PERCEPTION The Speech Stimulus Perceiving Phonemes Top-Down Processing Is Speech Special?
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Representing Acoustic Information
Physics 1251 The Science and Technology of Musical Sound Unit 3 Session 31 MWF The Fundamentals of the Human Voice Unit 3 Session 31 MWF The Fundamentals.
Speech Perception. Phoneme - a basic unit of a speech sound that distinguishes one word from another Phonemes do not have meaning on their own but they.
Artificial Intelligence 2004 Speech & Natural Language Processing Natural Language Processing written text as input sentences (well-formed) Speech.
9 th Conference on Telecommunications – Conftele 2013 Castelo Branco, Portugal, May 8-10, 2013 Sara Candeias 1 Dirce Celorico 1 Jorge Proença 1 Arlindo.
Audio Scene Analysis and Music Cognitive Elements of Music Listening
Acoustic Phonetics 3/9/00. Acoustic Theory of Speech Production Modeling the vocal tract –Modeling= the construction of some replica of the actual physical.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
ECE 598: The Speech Chain Lecture 7: Fourier Transform; Speech Sources and Filters.
LING 001 Introduction to Linguistics Fall 2010 Sound Structure I: Phonetics Acoustic phonetics Jan. 27.
♥♥♥♥ 1. Intro. 2. VTS Var.. 3. Method 4. Results 5. Concl. ♠♠ ◄◄ ►► 1/181. Intro.2. VTS Var..3. Method4. Results5. Concl ♠♠◄◄►► IIT Bombay NCC 2011 : 17.
Artificial Intelligence 2004 Speech & Natural Language Processing Natural Language Processing written text as input sentences (well-formed) Speech.
SEPARATION OF CO-OCCURRING SYLLABLES: SEQUENTIAL AND SIMULTANEOUS GROUPING or CAN SCHEMATA OVERRULE PRIMITIVE GROUPING CUES IN SPEECH PERCEPTION? William.
Speech Science VI Resonances WS Resonances Reading: Borden, Harris & Raphael, p Kentp Pompino-Marschallp Reetzp
Do Now Find the mean, median, mode, and range of each data set and then state which measure of central tendency best represents the data. 1)2, 3, 3, 3,
SIL Speech Analyzer: Tutorial Part 2 Dr. Barbara Brindle CD 508 – Voice Disorders Dr. Dudley Bryant PHYS Acoustics.
New Acoustic-Phonetic Correlates Sorin Dusan and Larry Rabiner Center for Advanced Information Processing Rutgers University Piscataway,
Introduction to Digital Speech Processing Presented by Dr. Allam Mousa 1 An Najah National University SP_1_intro.
Unit 5 Phonetics and Phonology. Phonetics Sounds produced by the human speech organs are called the “phonic/auditory medium” Phonetics is the study of.
Speech recognition Home Work 1. Problem 1 Problem 2 Here in this problem, all the phonemes are detected by using phoncode.doc There are several phonetics.
Detection of Vowel Onset Point in Speech S.R. Mahadeva Prasanna & Jinu Mariam Zachariah Department of Computer Science & Engineering Indian Institute.
IIT Bombay 17 th National Conference on Communications, Jan. 2011, Bangalore, India Sp Pr. 1, P3 1/21 Detection of Burst Onset Landmarks in Speech.
A. R. Jayan, P. C. Pandey, EE Dept., IIT Bombay 1 Abstract Perception of speech under adverse listening conditions may be improved by processing it to.
1 Acoustic Phonetics 3/28/00. 2 Nasal Consonants Produced with nasal radiation of acoustic energy Sound energy is transmitted through the nasal cavity.
Acoustic Phonetics 3/14/00.
$200 $400 $600 $800 $1000 $200 $400 $600 $800 $1000 $200 $400 $600 $800 $1000 $200 $400 $600 $800 $1000 $200 $400 $600 $800 $1000 $200.
ACOUSTIC CAMERA Copyright © PechoM Tüm hakları saklıdır.
Phonetics Unit 1.
Copyright © American Speech-Language-Hearing Association
Copyright © American Speech-Language-Hearing Association
Spectral Analysis Spectral analysis is concerned with the determination of the energy or power spectrum of a continuous-time signal It is assumed that.
Copyright © American Speech-Language-Hearing Association
Copyright © American Speech-Language-Hearing Association
Copyright © American Speech-Language-Hearing Association
English Phonetics and Phonology
Speech Database/Tool System And Preliminary Accent study.
Speech Perception.
Mark Sayles, Ian M. Winter  Neuron 
Phonology Project Waisman Center University of Wisconsin-Madison
Measured Period VOICE SIGNAL
The Production of Speech
Towards Automatic Fluency Assessment
“Speech: It’s Not as Acoustic as You Think”
Speech Perception (acoustic cues)
Speech Processing Dec. 11, 2006 YOUNG-CHAN LEE
Volume 54, Issue 6, Pages (June 2007)
Eugeniusz Cyran KUL, Lublin
Presentation transcript:

Automatic identification of vocalic intervals in speech signal Jesus Garcia Antonio Galves Flaviane Fernandes Janaisa Viscardi Ulrike Gut Phonological Support: Many thanks to: Dafydd Gibbon Emmanuel Dupoux Franck Ramus

Problem We have: and we want: v c v c

Identification Cues to the segmentation: " acoustic signal plot " spectrogram plot " listening

Acoustic Signal Spectrogram Noise Vowel Cons. Vowel. “A autoridade do governador diminuiu” Example:

Marca Noise Vowel Cons. Vowel.

vowels are recurrent

“t” signal:

c(i,t) = energy for the frequency i in time t

Relative entropy of the column t respect to the column s.

we use h(t-1,t), h(t-2,t) e h(t-3,t). plot for h(t): h(t)=h(t-1,t)+h(t-2,t) +h(t-3,t)

Distance between consecutives columns

Spectrogram for the sentence: ' A hurricane was announced this afternoon on the TV.'

Signal Relative Entropy hz Relative Entropy hz

Spectrogram and segmentation for ' A hurricane was announced this afternoon on the TV.'

Box Plots for the integral of the relative entropy for each language

Spectrogram and R. entropy for the sentence with the lower integral

Ramus classification (1999). %V = vowel percent DeltaC = standard deviation of consonantal intervals