Structure of Spoken Language

Slides:



Advertisements
Similar presentations
Pushpak Bhattacharyya CSE Dept., IIT Bombay 31st March, 2011
Advertisements

Normal Aspects of Articulation. Definitions Phonetics Phonology Articulatory phonetics Acoustic phonetics Speech perception Phonemic transcription Phonetic.
CS 551/651: Structure of Spoken Language Spectrogram Reading: Approximants John-Paul Hosom Fall 2010.
1 CS 551/651: Structure of Spoken Language Lecture 4: Characteristics of Manner of Articulation John-Paul Hosom Fall 2008.
1 CS 551/651: Structure of Spoken Language Spectrogram Reading: Stops John-Paul Hosom Fall 2010.
Phonological rules LING 200 Spring 2006 Foreign accents and borrowed words Borrowed words –often pronounced according to phonological rules of borrowing.
Phonology Organization and interaction of sounds in a language sound system.
The sound patterns of language
Basic Phonology of English
Phonology, part 5: Features and Phonotactics
Digital Systems: Hardware Organization and Design
Clinical Phonetics.
Chapter two speech sounds
A Course in Phonetics Ladefoged & Johnson Chapter 3
第二個音發音是在第一個音還沒發完的時候 就開始,有時候又稱為 anticipatory coarticulation ( 預期協同發音 ) 通常不同發聲器官的動作只是一種移動到某 個目標的情形。那個目標只是一個預計做的 動作,而不一定要發出來,因為有可能還沒 發完又有下一個目標音出現.
CS 4705 Lecture 4 CS4705 Sound Systems and Text-to- Speech.
Recognition of Voice Onset Time for Use in Detecting Pronunciation Variation ● Project Description ● What is Voice Onset Time (VOT)? – Physical Realization.
Phonology & Phonotactics
Research on teaching and learning pronunciation
Chapter three Phonology
Chapter 3 Consonants PHONOLOGY (Lane 335).
Chapter 3 Phonetics: Describing Sounds. Phonetics -study of speech sounds Sounds and symbols --use a system of written symbols --one sound represents.
Step 1: Memorize IPA - practice quiz today - real quiz on Tuesday (over consonants)! Phonology is about looking for patterns and arguing your assessment.
Last minute Phonetics questions?
Classification of the Consonants Place-Voice-Manner.
1 CS 551/651: Structure of Spoken Language Lecture 4: Characteristics of Manner of Articulation John-Paul Hosom Fall 2010.
Phonological Processes
Structure of Spoken Language
Phonology, phonotactics, and suprasegmentals
Speech Signal Processing
An Introduction to Linguistics
Structure of Spoken Language
1 Phonetics and Phonemics. 2 Phonetics and Phonemics : Phonetics The principle goal of Phonetics is to provide an exact description of every known speech.
CS 551/652: Structure of Spoken Language Lecture 2: Spectrogram Reading and Introductory Phonetics John-Paul Hosom Fall 2010.
CS : Speech, NLP and the Web/Topics in AI Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture-27: Phonology (quiz took place on 12/10/09; Lect 26.
Ch 7 Slide 1  Rule ordering – when there are multiple rules in the data, we have to decide if these rules interact with each other and how to order those.
1 CS 551/651: Structure of Spoken Language Lecture 6: Phonological Processes John-Paul Hosom Fall 2008.
Experimentation Duration is the most significant feature with around 40% correlation. Experimentation Duration is the most significant feature with around.
Daniel May Department of Electrical and Computer Engineering Mississippi State University Analysis of Correlation Dimension Across Phones.
Part aspiration (p. 56) aspiration, a period of voicelessness after the stop articulation and before the start of the voicing for the vowel.
Quantitative and qualitative differences in understanding sentences interrupted with noise by young normal-hearing and elderly hearing-impaired listeners.
Phonetics 2. Phonology 2.1 The phonic medium of language Sounds which are meaningful in human communication constitute the phonic medium of language.
Experimentation Duration is the most significant feature with around 40% correlation. Experimentation Duration is the most significant feature with around.
Phonetics and phonology EXPLANATION FOR EXERCISE 2 (SEMENTAL PHONOLOGY) RULES OF PHONOLOGY DO THI HONG + TO NGUYEN KHANH1.
Phonology: The Context Foundation Skills Cognition Play Socialization Pragmatics Phonology Semantics Metalinguistics.
Introduction to Linguistics n How do linguists use phonetics to analyse language?
Robust speaking rate estimation using broad phonetic class recognition Jiahong Yuan and Mark Liberman University of Pennsylvania Mar. 16, 2010.
Statistical NLP Spring 2011
Experimentation Duration is the most significant feature with around 40% correlation. Experimentation Duration is the most significant feature with around.
1 CS 551/651: Structure of Spoken Language Spectrogram Reading: Nasals John-Paul Hosom Fall 2010.
Stop + Approximant Acoustics
Ch4 – Features Features are partly acoustic partly articulatory aspects of sounds but they are used for phonology so sometimes they are created to distinguish.
CS : Speech, NLP and the Web/Topics in AI Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture-19: Speech: Phonetics (Using Ananthakrishnan’s presentation.
Allophonic processes Kuiper and Allan Chapter 5.4.
Week 3 – Part 2 Phonology The following PowerPoint is to be used as a guideline for the important vocabulary and terminology to know as you do your readings,
ASSESSING SEARCH TERM STRENGTH IN SPOKEN TERM DETECTION Amir Harati and Joseph Picone Institute for Signal and Information Processing, Temple University.
CS460/626 : Natural Language Processing/Speech, NLP and the Web (Lecture 31–Inside and Outside probabilities; PCFG training; start of phonetics and phonology)
Phonology Practice - HW Ex 4
Structure of Spoken Language
Structure of Spoken Language
Structure of Spoken Language
Phonological Rules of English
Structure of Spoken Language
Kuiper and Allan Chapter 5.4
Phonetics & Phonology of English: How & Why We Speak the Way We Do
Jennifer J. Venditti Postdoctoral Research Associate
Structure of Spoken Language
Spoken Language Processing:Summing Up
Phonetics and Phonemics
Phonetics and Phonemics
Presentation transcript:

Structure of Spoken Language CS 551/651: Structure of Spoken Language Lecture 6: Phonological Processes John-Paul Hosom Fall 2010

Phonological Processes Phonemes undergo systematic variation depending on their context For example, forming the past tense: cause /k aa z/  caused /k aa z d/ talk /t aa k/  talked /t aa k t/ /d/ vs. /t/ is predictable based on voicing of word-final phoneme Allophones can be viewed as systematic variations of phonemes that are a result of cultural and/or physiological processes, but do not distinguish meaning of utterance For example, /p/ and /ph/ in English is predictable: word or syllable initial voiceless stops are aspirated pit  [ph ih t[h]] tip  [th ih p[h]] kin  [kh ih n] spit  [s p ih t[h]] stick  [s t ih k[h]] skin  [s k ih n]

Phonological Processes /ph ih th th ih ph kh ih n/ /s p ih th s t ih kh s k ih n/

Phonological Processes Other types of phonetic processes: Assimilation, Deletion, Reduction, Insertion, Substitution, Me'tathesis (switching order of two phonemes) Assimilation “A feature of one segment is shared by a neighboring segment” Examples of Assimilation  Nasalization of vowels before nasal consonants  in- (negative prefix) becomes im- in words beginning with bilabial consonant (imbalance, imperfect, indifferent, intolerance)

Phonological Processes Assimilation may be due to coarticulation, or it may be language-specific, “arbitrary”: “word-final alveolar obstruent may take on place of articulation of following word-initial segment if word-initial segment is palato-alveoar” this /dh ih s/ shop /sh aa ph/  this shop /dh ih sh sh aa ph/ this /dh ih s/ fish /f ih sh/  this fish /dh ih s f ih sh/ this /dh ih s/ thing /th ih ng/  this thing /dh ih s th ih ng/ also, depending on dialect, not within-word: misshapen /m ih s sh ei p en/

Phonological Processes Example of assimilation of /s/ with /sh/ but not /f/: /dh ih sh sh aa pcl ph dh ih s f ih sh/

Phonological Processes Substitution: common in foreign accents or speaking impairments: welcome /v eh l k ah m/ McDonald /m a k uw d ow n aa r uw d ow/ Roger /w aa jh er/ Metathesis: changing order of two phonemes within a word (dialect variation) pretty /p er dx iy/ ask /ae k s/ For the history of ask/aks, Google “axe ask england”: http://www.randomhouse.com/wotd/index.pperl?date=19991216

Phonological Processes Deletion: Barbara /b aa r b ax r ah/  /b aa r b r ah/ Memory /m eh m ax r iy/  /m eh m r iy/ Reduction: unstressed vowels become /ax/ conduct (verb) /k ax n d ah k t/ conduct (noun) /k aa n d ax k t/ Insertion: voiceless stop inserted between nasal and voiceless consonant; voiceless stop always has same place of articulation as nasal fancy /f ae n t s iy/ Chomsky /ch aa m p s k iy/ schwa inserted after word-final nasal nine /n ay n ax/ dictionary pronunciation=

Phonological Processes Deletion: /m eh m r iy/

Phonological Processes Insertion: /f ae n t s iy ch aa m p s k iy/

Phonological Processes: Ladefoged Rules [–voiced, +stop]  [+aspirated] when syllable initial pit vs. spit [ax]  [–voiced] after syllable-initial [–voiced, +stop] and before [–voiced, +stop] potato [+consonantal]  longer at end of phrase bib, did, don, nod [–voiced, +stop]  [–aspirated] after syllable-initial /s/ spew, stew, skew [+vowel]  shorter before unvoiced phonemes in same syllable cap vs. cab, back vs. bag

Phonological Processes: Ladefoged Rules Devoicing, End-of-Phrase Length: /ph ax tcl th ey dx ow/ /d aa n n aa dcl d/

Phonological Processes: Ladefoged Rules Length before Voiceless: /khae pc ph kh ae bc b b ae kc kh b ae gc g/

Phonological Processes: Ladefoged Rules [–voiced]  longer when at end of syllable sass, shook vs. push [+stop]  unreleased before [+stop] apt, act (often see some mark in spectrogram) [–voiced, +alveolar, +stop]  [+glottal stop] when before an alveolar nasal in same word beaten  /b iy q en/ [+nasal]  [+syllabic] at word end when following [+obstruent] chasm  /k ae z em/ NOT film (obstruent = complete closure of airway; /l/ is not) [+liquid]  [+syllabic] at word end and following [+consonant] paddle, whistle, kennel, razor, hammer, tailor NOT snarl; change to “following [+obstruent]”?

Phonological Processes: Ladefoged Rules /ae pcl tcl th ae kcl tcl th/ /bcl b iy q tcl en ax_h/

Phonological Processes: Ladefoged Rules [+alveolar, +stop]  [+voiced, +flap] when between two vowels, second of which is unstressed This rule has speaker-dependent variations [+alveolar, +stop]  omitted between two consonants most people, sandpaper, grand master [+consonant]  shortened before identical [+consonant]   [–voice, +stop] between [+nasal] and [–voice, +fricative] when following vowel absent or unstressed prince vs. prints (e'penthesis)   [&] following word-final [+nasal, +consonantal] nine come sang (e'penthesis)

Phonological Processes: Ladefoged Rules “most people and grand masters use sandpaper” /m ow s pc ph iy pc ph el n gc g r ae n m ae s tc th er z yu z s ae n pc ph ey pc ph er/

Phonological Processes: Ladefoged Rules “nine come sang” /n ay n ax kcl kh ah m ax s ae ng ax/

Phonological Processes: Ladefoged Rules [+vowel]  longer in open syllables sea vs. seed vs. seat sigh vs. side vs. sight (equalize length of syllables with differing numbers of segments) [+vowel]  longer in stressed syllable below vs. billow (stressed syllables are longer in duration than unstressed) [+vowel]  [+nasal] before [+nasal] consonant [+vowel, –stressed]  schwa (vowel reduction) able vs. ability Canada vs. Canadian photograph vs. photography

Phonological Processes: Ladefoged Rules “sigh side sight” /s ay s ay dcl d s a tcl th/

Phonological Processes: Ladefoged Rules “below billow” /b ax l ow b ih l ow/

Phonological Processes Why is this useful? (a) Providing models of known phenomenon is better than having classifier learn the phenomenon from data (b) Provides humans with appropriate cues for understanding, naturalness (c) Accurate phonetic modeling improves ability of classifier to discriminate between classes Example for Text-to-Speech (case (b)):  Create a TTS system  Don’t shorten vowels before voiceless plosives  Creates, by default, acoustic cue for voiced plosives  Decrease intelligibility or at least naturalness of system

Phonological Processes Example for Automatic Speech Recognition (case (c)):  Train a speech recognizer using “dictionary” pronunciation  Then, in all cases where [–voice, +stop] between [+nasal] and [–voice, +fricative] such as “fancy” (in CMU dictionary as /f ae n s iy/), acoustics show alveolar stop, but trained as either nasal /n/ or fricative /s/.  Decreases ability of model to discriminate classes  Decreases performance of system Difficulty is in providing comprehensive, accurate rules that are not inappropriately “forced” on a system