Three examples of sound-system research using web-available materials Andy Wedel LSA Summer Institute: The Data Goldmine July 9, 2015.

Slides:



Advertisements
Similar presentations
Tom Lentz (slides Ivana Brasileiro)
Advertisements

Spelling Correction for Search Engine Queries Bruno Martins, Mario J. Silva In Proceedings of EsTAL-04, España for Natural Language Processing Presenter:
Lexical functional load predicts the direction of phoneme system change SCIHS Berkeley 2014 Andrew Wedel University of Arizona Scott Jackson University.
Tone perception and production by Cantonese-speaking and English- speaking L2 learners of Mandarin Chinese Yen-Chen Hao Indiana University.
{ “Age” Effects on Second Language Acquisition Examination of 4 hypotheses related to age and language learning
Speech Perception Dynamics of Speech
Human Speech Recognition Julia Hirschberg CS4706 (thanks to John-Paul Hosum for some slides)
JPN494: Japanese Language and Linguistics JPN543: Advanced Japanese Language and Linguistics Phonology & Phonetics (2)
Infant sensitivity to distributional information can affect phonetic discrimination Jessica Maye, Janet F. Werker, LouAnn Gerken A brief article from Cognition.
NOVA Comprehensive Perspectives on Child Speech Development and Disorders Chapter 10 Acquiring French Andrea MacLeod 1.
“Speech and the Hearing-Impaired Child: Theory and Practice” Ch. 13 Vowels and Diphthongs –Vowels are formed when sound produced at the glottal source.
Results ISI Variance in STP Corpus ISI Variance in BU Corpus * p
Phonology Phonology is essentially the description of the systems and patterns of speech sounds in a language. It is, in effect, based on a theory of.
Development of Automatic Speech Recognition and Synthesis Technologies to Support Chinese Learners of English: The CUHK Experience Helen Meng, Wai-Kit.
Development of coarticulatory patterns in spontaneous speech Melinda Fricke Keith Johnson University of California, Berkeley.
Psych 56L/ Ling 51: Acquisition of Language Lecture 8 Phonological Development III.
Yao LSA Separating speaker- and listener- oriented forces in speech – Evidence from phonological neighborhood density.
The Phonetic Space of Phonological Categories in Heritage Speakers of Mandarin The 44 th Annual Meeting of the Chicago Linguistics Society 24 April 2008.
Linguisitics Levels of description. Speech and language Language as communication Speech vs. text –Speech primary –Text is derived –Text is not “written.
Research on teaching and learning pronunciation
Sound and Speech. The vocal tract Figures from Graddol et al.
Chapter three Phonology
Morphological information and acoustic salience in Dutch compounds Victor Kuperman, IWTS Radboud University Nijmegen.
Usage vs Acquisition in Language Change Andrew Wedel and Clay Beckner Language as a Complex System Workshop University of Arizona, 2008.
Redundancy Ratio: An Invariant Property of the Consonant Inventories of the World’s Languages Animesh Mukherjee, Monojit Choudhury, Anupam Basu and Niloy.
-- A corpus study using logistic regression Yao 1 Vowel alternation in the pronunciation of THE in American English.
Phonology, phonotactics, and suprasegmentals
Acoustic and Linguistic Characterization of Spontaneous Speech Masanobu Nakamura, Koji Iwano, and Sadaoki Furui Department of Computer Science Tokyo Institute.
A Complex Systems Approach to Language Patterning Andrew Wedel University of Arizona April 10, 2008.
Sebastián-Gallés, N. & Bosch, L. (2009) Developmental shift in the discrimination of vowel contrasts in bilingual infants: is the distributional account.
Diphthongs Five most frequent diphthongs in Māori are /ai ae au ou ao/. Mergers between /ai~ae/ and /au~ou/ [3] (Figure 2). Only one of these mergers is.
Lemmatization Tagging LELA /20 Lemmatization Basic form of annotation involving identification of underlying lemmas (lexemes) of the words in.
Nasal endings of Taiwan Mandarin: Production, perception, and linguistic change Student : Shu-Ping Huang ID No. : NA3C0004 Professor : Dr. Chung Chienjer.
Some thoughts on modelling phonetic effects in corpora.
TagHelper: Basics Part 1 Carolyn Penstein Rosé Carnegie Mellon University Funded through the Pittsburgh Science of Learning Center and The Office of Naval.
The Linguistics of Second Language Acquisition
Statistical learning, cross- constraints, and the acquisition of speech categories: a computational approach. Joseph Toscano & Bob McMurray Psychology.
Adaptive Design of Speech Sound Systems Randy Diehl In collaboration with Bjőrn Lindblom, Carl Creeger, Lori Holt, and Andrew Lotto.
1 The Ferret Copy Detector Finding short passages of similar texts in large document collections Relevance to natural computing: System is based on processing.
Copyright 2007, Toshiba Corporation. How (not) to Select Your Voice Corpus: Random Selection vs. Phonologically Balanced Tanya Lambert, Norbert Braunschweiler,
PHONETICS & PHONOLOGY 3/24/2014. AGENDA GO OVER CORRECTED HOMEWORK IN PAIRS/SMALL GROUPS (5 MIN) MAKE ANY CORRECTIONS TO HWK DUE TODAY, THEN TURN IN (5.
Ch 7 Slide 1  Rule ordering – when there are multiple rules in the data, we have to decide if these rules interact with each other and how to order those.
A prosodically sensitive diphone synthesis system for Korean Kyuchul Yoon Linguistics Department The Ohio State University.
Linguistic Items Linguistic Items: — vocabulary ( “ lexical items, or “ lexemes) — sound-pattern ( “ sound ” ) — larger syntactic patterns ( “ constructions)
Ch 3 Slide 1 Is there a connection between phonemes and speakers’ perception of phonetic differences? (audibility of fine distinctions) Due to phonology,
Lecture 2 Phonology Sounds: Basic Principles. Definition Phonology is the component of linguistic knowledge concerned with rules, representations, and.
Introduction to Speech Neal Snider, For LIN110, April 12 th, 2005 (adapted from slides by Florian Jaeger)
LING 580: Today Goals: 1. What constitute possible changes for the vowel systems of natural languages? 2. Schools of thought (McMahon 2) Neogrammarian.
Article Summary of The Structural Complexity of Software: An Experimental Test By Darcy, Kemerer, Slaughter and Tomayko In IEEE Transactions of Software.
Auckland 2012Kilgarriff: NLP and Corpus Processing1 The contribution of NLP: corpus processing.
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
Combining Speech Attributes for Speech Recognition Jeremy Morris November 9, 2006.
Today A. Sound Change, continued: Rotations, Mergers and Splits Understanding the Great Vowel Shift Bringing synchronic data to bear on past data (Milroy.
Language Perception.
A Maximum Entropy Language Model Integrating N-grams and Topic Dependencies for Conversational Speech Recognition Sanjeev Khudanpur and Jun Wu Johns Hopkins.
Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:
Against formal phonology (Port and Leary).  Generative phonology assumes:  Units (phones) are discrete (not continuous, not variable)  Phonetic space.
Maximum Entropy techniques for exploiting syntactic, semantic and collocational dependencies in Language Modeling Sanjeev Khudanpur, Jun Wu Center for.
King Faisal University جامعة الملك فيصل Deanship of E-Learning and Distance Education عمادة التعلم الإلكتروني والتعليم عن بعد [ ] 1 جامعة الملك فيصل عمادة.
Welcome to All S. Course Code: EL 120 Course Name English Phonetics and Linguistics Lecture 1 Introducing the Course (p.2-8) Unit 1: Introducing Phonetics.
Consumer guide to phonological evidence (Oostendorp)  What is used as evidence in phonology?
English Vowels and diphthongs
VISUAL WORD RECOGNITION. What is Word Recognition? Features, letters & word interactions Interactive Activation Model Lexical and Sublexical Approach.
Introduction to Linguistics
CHAPTER 5 This chapter introduces students to the study of linguistics. It discusses the basic categories and definitions used to study language, and the.
Consonant variegations in first words: Infants’ actual productions of
Job Google Job Title: Linguistic Project Manager
Understanding Variation of VOT in spontaneous speech
Detecting evolutionary forces in language change (2017)
Review for Test 2.
Presentation transcript:

Three examples of sound-system research using web-available materials Andy Wedel LSA Summer Institute: The Data Goldmine July 9, 2015

1.Functional load and diachronic phoneme inventory change –Published literature on sound-change in combination with phonemically-coded corpora 2.Lexical competition and hyperarticulation in natural speech –Phonetic measures in the Buckeye Corpus in combination with lexical data on English 3.Correlation between crosslinguistic and language-internal phoneme frequencies –A database of phoneme inventories combined with available phonemically-coded corpora

Organizational steps in research What is the question? –Identify your general hypothesis What is the approach? –Operationalize your hypothesis –Develop a method/experiment Find data/create materials Analysis/Results Dissemination

1. Functional load and diachronic phoneme inventory change With: Abby Kaplan Department of Linguistics University of Utah Scott Jackson Center for the Advanced Study of Language (CASL) University of Maryland

Functional load and diachronic phoneme inventory change 1.What is the question? 2.What is the approach? 3.Find data/create materials 4.Analysis/Results

Phoneme inventories change over time 6

–Gilliéron (1918), Jakobson (1931), Mathesius (1931), Trubetzkoy (1939) –Martinet (1952), King (1967), Hockett (1967) –Surendran & Niyogi (2006), Silverman (2011), Kaplan (2011) Does the functional load of a phoneme contrast influence its trajectory of change?

Functional load “The notion of functional load is that a phonemic system … has a (quantifiable) job to do, and that the contrast between any two phonemes, say /a/ and /b/, carries its share.” Charles Hockett

Functional load Specific Hypothesis:  Neutralization is less likely for contrasts that have a higher functional load. (Martinet 1955, Hockett 1967) 9

Phoneme Mergers / ɑ ~ ɔ / merger in western American English cot ɑ ɔ caught

How has functional load been operationalized? In terms of the lexicon: –Number of minimal pairs (Martinet 1955) Various ways of counting number of homophones (Silverman 2009, Kaplan in press) –Lexical level entropy (Surendran and Niyogi 2006) In terms of the sound system –Type or token phoneme frequency (Currie-Hall 2010) –Phoneme level entropy (Hockett 1967, King 1967, Surendran and Niyogi 2006)

Why hadn’t this been successfully tested before? Previous approaches involve case-studies: 1.Find a contrast merger or set of mergers 2.Assess the change in the system given your favorite measure of functional load 3.Compare to a set of similar contrasts that did not merge. 4.Is the change in the system smaller for the actual mergers than for the non-mergers? Problem: if we assume that functional load is just one of many factors influencing sound change, we expect many ‘exceptions’ to the hypothesis.  We need to assess outcomes statistically. 12

Functional load and diachronic phoneme inventory change 1.What is the question? 2.What is the approach? 3.Find data/create materials 4.Analysis/Results

Strategy for dealing with data sparseness, diversity of data source 1.Pool data on mergers from multiple languages. 2.Use linear mixed effects modeling. –Random effects structure helps control for structure inherent in different data-sources.

What’s the balance between hypothesis generation and testing? Broad general hypothesis to be tested: –Functional load predicts merger Narrower hypotheses to be explored: –what specific measure(s) of functional load are predictive?

Functional load and diachronic phoneme inventory change 1.What is the question? 2.What is the approach? 3.Find data/create materials 4.Analysis/Results

Building a database Hockett 1955 “... unfortunately the determination [of functional load] has not been made yet [because] the amount of counting and computation is formidable, so we can give no example...”  Use existing frequency corpora to build a large database of reasonably recent mergers and associated comparison sets. 17

Find word lists from a variety of languages We don’t know what measure of functional load is appropriate: want to be able to test a variety of measures –Minimal pair count –Average neighborhood density –System entropy Requirements for each word list: –Phonemically coded –Lemmatized –Frequency

German Dutch, RP English American English Spanish French Turkish Korean HK Cantonese Find word lists from a variety of languages

Material won’t perfectly match your question Key! –Always keep your eyes open for new data sources. –Be ready to do some work to transform information into a form appropriate for your question. –You’ll often have to make semi-arbitrary decisions Keep notes, and be ready to describe/defend your choices. Examples differing in ease: –Turkish > American English > Spanish

Turkish: easy to work with Obtained by ing authors Easy to work with: –Orthographic coding already near-phonemic coding is pre-merger –Morphologically parsed into stem + affixes –Syntactic category given –ArisoyTurkishDataArisoyTurkishData –LemmaFormsLemmaForms

American English: moderately easy to work with Get standard US pronunciation from Carnegie-Mellon Pronouncing Dictionary (CMUDict) (CMUDict) Frequency databases freely available –CELEX, SubtlexUSCELEX –How to deal with homographs? Example output files with ND calculated –LemmaFormsLemmaForms

Spanish: More complex Spanish Gigaword corpus (Linguistics Data Consortium) –Text files from newswires –ExampleExample Use TreeTagger to morphologically parse and add categoriesTreeTagger Example of output Map to phonemic representation and count Show code and outputcodeoutput

Looking for changes of interest Look through the literature for diachronically recent phoneme mergers in varieties of these languages that share the same phonemic inventory as the dialect on which the word list is based. –For example: American and RP English have distinct vowel inventories; RP and Australian English share phoneme inventories, even though they are phonetically different.

Looking for changes of interest Identify a set of comparison phonemes of the same major class (consonant, vowel) as the merged phoneme pair that are phonologically similar. –1 basic feature distant, e.g., t ~ d, t ~ k, u ~ o

56 mergers 524 non-mergers 8 languages phoneme-pair systems: Each contains at least one merger, and as comparisons, all other phoneme pairs in the same major class (vowel or consonant) that are one phonological feature apart. Wedel, A., A. Kaplan & S. Jackson (2013). Language and Speech. Wedel, A., S. Jackson A. Kaplan (2013). Cognition.

Independent measures Lexical measures: –Number of minimal pairs distinguished by each phoneme pair Write a script that goes through each phonemic form, merges the contrast using a regular expression, and counts how many other phonemic forms it becomes identical to. –Lemma vs word-form counts –Within/across word category

Independent measures Lexical measures: –Number of lexical ‘prefixes’ distinguished by each phoneme pair (Cohen-Priva, in press) –Average neighborhood density for words containing each phoneme –Lexical entropy change on merger (Surendran & Niyogi 2006)

Calculating functional load in terms of informational entropy (Shannon 1951) General form (Hockett 1967, Surendran and Niyogi 2006) : FL(a ↔ b) = H(L) − H(La↔b) H(L) where 29

Independent measures Sublexical measures: –Phoneme type/token frequencies uniphone, biphone, triphone –Sublexical entropy change upon merger –Dataset exampleDataset example

Functional load and diachronic phoneme inventory change 1.What is the question? 2.What is the approach? 3.Find data/create materials 4.Analysis/Results

Number of minimal pairs is inversely correlated with merger 32 Wedel, A., A. Kaplan & S. Jackson (2013). Language and Speech.

What kind of minimal pairs? Lemma vs word form? Within vs Between Category? Frequency?

What does not seem to substitute for minimal pairs in this effect? Lexical measures –Neighborhood measures –Lexical entropy change Sublexical measures –sublexical entropy changes –uniphone, biphone, triphone probabilities

Intriguing: Higher phoneme frequency is positively correlated with merger …but only for phoneme pairs that don’t distinguish minimal pairs.

Example model predictions 36 American English

What about changes that might index avoidance of merger? Phoneme Shift: concerted shift of a phoneme pair in the same dimensional space. Phoneme Split: merger of a contrast associated with enhancement of an associated contrast in a different dimension.

Phoneme Shifts - California Vowel shift fat u ɑ æ o ɛ ɪ dude dress

Phoneme Splits –Vowel length split in Pittsburgh English town ~ ton ta ʊ n ~ t ʌ n  t ʌ :n ~ t ʌ n a ʌ town ton ʊ

What’s the balance between hypothesis generation and testing? We already have a strong prediction that a small number of within-category minimal lemma pairs predicts merger. Narrower hypothesis to be explored: –Shifts and splits… which are phoneme inventory changes that preserve lexical distinctions… – are correlated with a significantly larger number within-category minimal lemma pairs.

Get examples of shifts/splits in our set of languages Shifts –Spanish voiced/voiceless stop pairs Lewis 2000 –American English vowel shifts: Northern cities, Southern Shift Labov et al –NZ English front vowel shifts Hay, Macglagan, & Gordon 2008 –Polder Dutch diphthongs Jacobi 2009 –Canadian French vowel shift Walker 1983

Database of Shifts/Splits Splits –Pittsburgh / ɑʊ ~ ʌ /, Inland North /e ~ ɑ /  vowel length Labov et al, 2006 –Turkish ɣ deletion  vowel length Lewis 1967 –NZE /dress ~ fleece/  diphthongization Maclagan and Hay, 2005 –Korean onsets /lax ~ aspirated/  tone Silva 2006

Mergers versus Shifts and Splits phoneme mergers phoneme splits/shifts

Can we predict the direction of change? Given a phoneme-inventory change, was it –a change that reduces lexical distinctions?  a merger –a change that preserves lexical distinctions?  a shift or a split

Given a change, predicting its type log minimal lemma pair count Merger Shift/Split

Individual datasets

New insights The distribution of a phonological contrast across the lexicon influences the trajectory of change in that phonological contrast. Results in maintenance of a compact phoneme inventory. –Contrasts that support few lexical contrasts tend to be lost. –Contrasts that support more lexical contrasts are preserved, or provide seed variation for new contrasts.

Take home message with respect to big data and computation… New data sources, models and technologies allow us to better test hypotheses concerning the relationship of the form of sound systems to their function in communication.

2. Lexical competition and hyperarticulation in natural speech With: Becky Sharp Department of Linguistics University of Arizona

Lexical competition and hyperarticulation in natural speech 1.What is the question? 2.What is the approach? 3.Find data/create materials 4.Analysis/Results

Big question If the existence of minimal pairs influences change in a phoneme contrast, what are the mechanisms, at various levels? Theoretical Prediction: (e.g., Lindblom 1990, Wright 2004, Wedel 2012…)  Phonetic cues that support communication are hyperarticulated in usage.  Consistent usage biases drive phonological change and pattern formation.

Change in distribution of word variants Change in distribution of sublexical variants across the lexicon e.g., Baudouin de Courtenay 1895, Ohala 1989, Lindblom 1990, Bybee 2001, Blevins 2004, Baese-Berk & Goldrick 2009, Ernestus 2011, Wedel et al. in press Wang 1969, Bybee 2002, Phillips 2006, Kraljic & Samuel 2009, Hay and Maclagan 2012 Theoretical/Linguistic/Experimental evidence: Articulation Perception Cognitive biases Social factors System-internal patterns Acquisition biases … 52 Bias toward accurate transmission of lexical information Bias toward accurate transmission of lexical information Biases on phonetic form of word tokens

[Within-phoneme category variants] Selection for word-level contrast /Phoneme category evolution/

Lexical competition and hyperarticulation in natural speech 1.What is the question? 2.What is the approach? 3.Find data/create materials 4.Analysis/Results

Background/Previous Work Previous work done using lab speech Small effects, fragile results –VOT is slightly hyperarticulated for initial stops given minimal pairs in list reading (Baese & Goldrick 2009, Peramunage et al. 2011). –VOT hyperarticulated on first production of words with a visual stop-competitor in the context (Kirov & Wilson 2012) –In a lab-speech paradigm designed to elicit hyperarticulation away from a vowel competitor, tense/lax vowel duration differences increased, but not formant differences (Schertz 2015)

Work with vowels has focused on ND as the trigger for hyperarticulation, and dispersion as the outcome Dispersion = distance of a vowel in F1-F2 space from the center of the vowel space But: vowel change patterns suggest that competition-driven hyperarticulation should be more phonetically specific. –Correlation of minimal pair count with vowel shift patterns link competition to shifts –Vowel chain shifts often involve moves toward the center of the vowel space.

Dispersion as the outcome of competition makes the wrong prediction for vowel system change: Vowels can centralize in chain-shifts. American Northern Cities Shift

Ok, so how to approach this? 1.Use natural speech instead of lab speech 2.Compare minimal pair existence to neighborhood density as a predictor for hyperarticulation 3.Look at both VOT for stops and F1-F2 Euclidean distance for vowels –For vowels, compare F1-F2 distance to dispersion

Lexical competition and hyperarticulation in natural speech 1.What is the question? 2.What is the approach? 3.Find data/create materials 4.Analysis/Results

Use the Buckeye Corpus of Conversational SpeechBuckeye Corpus 40 one-hour sociolinguistic interviews gender and age balanced obtained in Columbus, Ohio in 2000 Densely annotated: –Phonemic transcription –Phonetic transcription –Syntactic category –Textfiles, phonefilesTextfilesphonefiles

VOT in word-initial stops Use a perlscript to identify appropriate material in the Buckeye Corpus –words starting with [ptkbdg] –content words –1, 2 syllables –no preceding/following utterance or disfluency boundary –no preceding word-final stop Measure closure and burst lengths

Stop length, burst and offset 62 A peaA bee burst length [p] [b]

VOT data creation Annotate stop beginning, burst and offset using Praat. –Get lots of undergraduate helpers for this…

Praat example

Get dependent measure Script that processes Praat textgrid textfile to obtain:Praat textgrid textfile –Stop length, burst length –Use burst/length ratio as a rate-normalized measure of VOT (Yao 2007)

Get independent factors of interest Minimal pair existence –Carnegie Mellon Pronouncing Dictionary Neighborhood density –Calculate independently –IPhODIPhOD

Lexical competition and hyperarticulation in natural speech 1.What is the question? 2.What is the approach? 3.Find data/create materials 4.Analysis/Results

Voiced Stops Voiceless Stops No MinPairs MinPairs No MinPairs MinPairs Burst/length ratio by minimal pair existence: Initial stops distinct in voice bat pat badge pant

Relationship of Neighborhood Density to Burst/length Ratio Lexical Neighbors Burst/length ratio

Factors in Linear Mixed Effects modeling Stop-voicing minimal pair competitor existence Neighborhood density Control factors: –local speech rate –word category –forward/backward bigram probabilities –word frequency –previous mention –syllable number –stop identity –following high (liquids, rhotics, high vowels)

Get control factors From Buckeye word files:word files –Word identity previous, target, and following word –Word category –Previous Mention –Speech rate syllables per second in local utterance

Get control factors From corpora, get forward/backward bigram probability –Google n-gram –Fisher English Training setFisher English Training set see Seyfarth 2014 for example

Get control factors From IPhOD:IPhOD –SubtlexUS-based word frequency –Neighborhood density –Positional two-segment probability averaged over the word (Vitevitch & Luce 2004)

Voiceless Stop Model

Voiced Stop Model

Can we find this effect in vowels? Measuring vowel-vowel distances

Vowel distances in initial syllables Identify material in the Buckeye corpus of Conversational English –words with an initial syllable non-back monophthong –content words –1 syllable –no preceding/following utterance or disfluency boundary –no words with ablaut in their paradigm e.g., no ‘sit’, because of ‘sat’.

Dataset construction For each word token, measure vowel distance to three neighboring vowels. Starting dataset has three measures per word token: Split randomly into three datasets with one measure per token. Randomly choose one dataset for statistical analysis. Minimal Pair existence

Measuring from [i] i ɪ e ɛ æ ʌ

Measuring from [ ɛ ] i ɪ e ɛ æ ʌ

Minimal pair exists Minimal pair does not exist  more distinctive 

Factors Vowel-vowel minimal pair competitor existence Neighborhood density Vowel-vowel minimal pair competitor existence in one of the other two neighboring vowels Control factors: –local speech rate –forward/backward bigram probabilities –word frequency –previous mention –vowel length –vowel-vowel pair identity

Measuring from [ ɛ ] i ɪ e ɛ æ ʌ

LME model model = lmer (EuclideanDistance ~ MinimalPair+ Neighborhood+ Alternative + VowelLength + Vowel_CompetitorVowel + (1+ MinimalPair+Neighborhood+Alternative+VowelLen gth|Speaker) + (1|Lemma), data = k, REML = F)

Model output

What about dispersion? Run the same kind of analysis using vowel-center distance. Factors that significantly predict dispersion: –Word Frequency –Vowel Length Neighborhood density and minimal pair competitor existence are not predictive.

Summary Phonetic cues that contribute strongly to distinguishing words tend to be hyperarticulated in natural speech –VOT in initial stops –F1-F2 distance in vowels Consistent with idea that phoneme contrast is maintained in part by a bias toward lexical contrast.  maintains an efficient set of phoneme contrasts over language change: phonemes that do not distinguish many words are vulnerable to loss.