Getting at variation with ultrasound: Scottish and Dutch /r/ Ultrafest 3 University of Arizona at Tucson 14-16 April 2005 James M Scobbie (QMUC) Koen Sebregts.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Non-normative preaspiration of voiceless fricatives in Scottish English a comparison with Swedish preaspiration Olga Gordeeva and James M.Scobbie, Queen.
Tom Lentz (slides Ivana Brasileiro)
James M Scobbie CASL Research Centre LOT summer school Ultrasound, phonetics, phonology: Articulation for Beginners! With special thanks to collaborators.
James M Scobbie CASL Research Centre LOT summer school Ultrasound, phonetics, phonology: Articulation for Beginners! With special thanks to collaborators.
Evaluating health informatics projects Reasons for and problems of evaluation Objective model Subjective model.
James M Scobbie CASL Research Centre LOT summer school Ultrasound, phonetics, phonology: Articulation for Beginners! With special thanks to collaborators.
Sounds that “move” Diphthongs, glides and liquids.
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
Covert articulation of Scottish English /r/ now you see and hear it… now you don’t MFM Manchester James M Scobbie Speech Science Research Centre,
Ultrasound-based tongue root imaging and measurement James M Scobbie QMU With thanks to collaborators Jane Stuart-Smith, Marianne Pouplier, Alan Wrench,
Plasticity, exemplars, and the perceptual equivalence of ‘defective’ and non-defective /r/ realisations Rachael-Anne Knight & Mark J. Jones.
Philip Harrison J P French Associates & Department of Language & Linguistic Science, York University IAFPA 2006 Annual Conference Göteborg, Sweden Variability.
Glides (/w/, /j/) & Liquids (/l/, /r/) Degree of Constriction Greater than vowels – P oral slightly greater than P atmos Less than fricatives – P oral.
Human Speech Recognition Julia Hirschberg CS4706 (thanks to John-Paul Hosum for some slides)
Identification of prosodic near- minimal Pairs in Spontaneous Speech Keesha Joseph Howard University Center for Spoken Language Understanding (CSLU) Oregon.
Final Year Project Progress January 2007 By Daire O’Neill 4EE.
EE2F2 - Music Technology 9. Additive Synthesis & Digital Techniques.
Collection of speech production ultrasound data Donald Derrick 12, Romain Fiasson 2 and Catherine T. Best 1 1 University of Western Sydney (MARCS Institute)
Acoustic Analysis and Electropalatography (EPG) in the Assessment and Treatment of Children with Intractable Speech Disorders Sara E. Wood James M. Scobbie.
Materials The corpus of 460 sentences provides examples of /r/ in many prosodic, sentential, lexical and segmental contexts, but there are no repetitions.
Evaluation Adam Bodnar CPSC 533C Monday, April 5, 2004.
Ultrafest 3 U. Arizona, April 2005 Measuring the tongue root: Image dropoff, rotation issues, and the siren call of intrinsic F0 D. H. Whalen Haskins Laboratories.
Midterm Review Evaluation & Research Concepts Proposals & Research Design Measurement Sampling Survey methods.
1 The phonetics of speech errors Frisch, S. A. University of South Florida This work supported by NIH-NIDCD R
1 COE Rubrics Assessment Process Dr. Marwan Abu-Amara April 5 th, 2008.
The trough effect: Can we predict tongue lowering from acoustic data alone? Yolanda Vazquez Alvarez.
Phonetics, day 2 Oct 3, 2008 Phonetics 1.Experimental a. production b. perception 2. Surveys/Interviews.
UltraFest III, University of Arizona 4/16/05 A study of pre-liquid excrescent schwa in English Adam Baker, Diana Archangeli, Jeff Mielke University of.
PHONOLOGICAL ANALYSIS ABSTRACT Substitution is a common phenomenon when a non-English speaker speaks English with foreign accent. By using spectrographic.
Handwritten Character Recognition using Hidden Markov Models Quantifying the marginal benefit of exploiting correlations between adjacent characters and.
ACE TESOL Diploma Program – London Language Institute OBJECTIVES You will understand: 1. A process for teaching the receptive and productive sides of pronunciation.
Conclusions  Constriction Type does influence AV speech perception when it is visibly distinct Constriction is more effective than Articulator in this.
Phonetics HSSP Week 5.
Abstract Research Questions The present study compared articulatory patterns in production of dental stop [t] with conventional dentures to productions.
Reviewing EAP instruction research: What learning transfers, and how far? Mark Andrew James Arizona State University
Results of the WMO Laboratory Intercomparison of rainfall intensity gauges Luca G. Lanza University of Genoa WMO (Project Leader) DIAM UNIGE September.
Nasal endings of Taiwan Mandarin: Production, perception, and linguistic change Student : Shu-Ping Huang ID No. : NA3C0004 Professor : Dr. Chung Chienjer.
A review of research on English-for- academic-purposes instruction: What learning transfers, and how far? Mark Andrew James Arizona State University
An investigation of postvocalic /r/ in Glaswegian adolescents Jane Stuart-Smith and Robert Lawson Department of English Language, University of Glasgow.
1 Beetle xtalk measurements with test pulse at NIKHEF, B1.4 and B1.5 Aras Papadelis.
Working memory research Group member : 唐牧辰 武天翊 侯晓林 赵诣 赵阳.
Clinical Assessment of Articulation and Phonology
Ch 3 Slide 1 Is there a connection between phonemes and speakers’ perception of phonetic differences? (audibility of fine distinctions) Due to phonology,
By Luisa and Noriko. Description of class  Language Proficiency level: low intermediate  Class size: 12 students  Age: 15 to 25  Native Language background:
Speech Science IX How is articulation organized? Version WS
Introduction ESDS Qualidata John Southall ESDS Creating and delivering re-usable qualitative data 24 June 2004.
From subtle to gross variation: an Ultrasound Tongue Imaging study of Dutch and Scottish English /r/ James M Scobbie Koen Sebregts Jane Stuart-Smith.
#3191, 14 Oct 2012 Cabling installed to allow fast BPM electronics on injector BPMs System is flexible enough to allow different INJ-BPMs to be used (not.
Software Project MassAnalyst Roeland Luitwieler Marnix Kammer April 24, 2006.
1 Cross-language evidence for three factors in speech perception Sandra Anacleto uOttawa.
Conditional Random Fields for ASR Jeremy Morris July 25, 2006.
Combining Speech Attributes for Speech Recognition Jeremy Morris November 9, 2006.
Comparing the effectiveness of orthographic and phonological cues in the treatment of anomia. Lyndsey Nickels 1, Antje Lorenz 1,2, 1 Macquarie Centre for.
Katherine Morrow, Sarah Williams, and Chang Liu Department of Communication Sciences and Disorders The University of Texas at Austin, Austin, TX
School of Computer Science 1 Information Extraction with HMM Structures Learned by Stochastic Optimization Dayne Freitag and Andrew McCallum Presented.
Stop Acoustics and Glides December 2, 2013 Where Do We Go From Here? The Final Exam has been scheduled! Wednesday, December 18 th 8-10 am (!) Kinesiology.
Stop + Approximant Acoustics
[  ] from [  ] James M Scobbie 2 nd Ultrasound Workshop UBC Vancouver April 2004 lip or lingual vs. lip & lingual.
Rhotic Vowels 5. The Special Case of Vocalic R This is the vowel in words like “bird,” “learn,” “nerd,” “sir” Symbol: [ ɚ ] (schwar) or [ ɝ ] MacKay.
Bridging the gap between L2 speech perception research and phonological theory Paola Escudero & Paul Boersma (March 2002) Presented by Paola Escudero.
Helpful hints for planning your Wednesday investigation.
Machine Learning in Practice Lecture 9 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Introduction Method Experiment 2 In spoken word recognition, phonological and indexical properties (i.e., characteristics of the speaker’s voice) of a.
1 Probing the Big Bang with ultrasound: Retraction of /s/ in English Adam Baker, Jeff Mielke, Diana Archangeli University of Arizona Supported by James.
an Introduction to English
Midterm Review (closed book)
English Phonetics and Phonology
Speech Perception (acoustic cues)
/r/ Place: palatal Articulatory phonetics Acoustics
Presentation transcript:

Getting at variation with ultrasound: Scottish and Dutch /r/ Ultrafest 3 University of Arizona at Tucson April 2005 James M Scobbie (QMUC) Koen Sebregts (Utrecht) and thanks also esp. to Alan Wrench and Yolanda Vasquez Alvarez

Why /r/? Why ultrasound? Previous articulatory studies of /r/ –There is a lot of variation Variation, change, acquisition, phonetics –“labiodentalisation” of /r/ in Anglo English –“vocalisation” of final /r/ in Scottish English –“retroflexion” of final /r/ in Dutch Phonology – what *is* an /r/ anyway? Ultrasound –Relatively informal –Can provide dynamic images

Can we add to MRI? Tiede’s beautiful images (from ICPhS 03)

Can we add to MRI? And perhaps what we can expect…

Can we add to MRI? And perhaps what we can expect…

Can we add to MRI? Tiede’s beautiful images (from ICPhS 03)

Problems with ultrasound The usual –Incomplete images, no passive articulator –Head-probe correction or control –Synchronisation and low frame rate –Splines and edges –Stats plus… –European video output (PAL) is at 25Hz (albeit with more pixels, esp in raw AVI) –Fieldwork taped data is slower to prepare

Data collection – the laboratory Good –Helmet (or other head games) –Less environmental noise –The experimenter is in control –Choice of equipment and software - demodemo but… –Willing and normal subjects have to be found –Experiment costs –Intimidation of subjects

Data collection – the “field” In this case – Glasgow Science Centre Good –Lots of varied and willing subjects –Outreach… and publicity (!) –Vernacular speech more elicitable –Qualitative articulatory transcription - demodemo but… –Things are a little out of control (cf feedback) –Small amount of time for each subject –Non-ideal equipment and methodology

What do we want for /r/? Varied vowel environments Varied word and syllabic positions Acoustic analysis (and other channels?) Info on multiple articulators Stratified pool of subjects Normal vernacular speech Dynamics Synchronised data

Backstep: methodological issues Smearing of raw data due to scan rate Creating of video output - downsampling and synchronisation problems

Matrix of raw data generated at the scan rate of the ultrasound machine e.g 72 complete scans per second Matrix of raw data Image processing Buffer of image data Video scanning Buffer of image data Smooth greyscale image buffer i.e. video output What’s the delay?

Methodological issues There is always a delay –Our range was 20ms to 100ms (mean 40ms) –Does our 25Hz rate make it clearer or worse? Practicalities –Multichannel synchonisation, even video, even acoustics, is based on unpredicatable delay whether via camcorder or direct –Individual video frames can be arbitrarily split in addition to overlay and interlacing –High ultrasound sample rate alone is not enough

Scottish background Field work in Glasgow Science Centre Stuart-Smith et al discover heavily retracted coda /r/ in young (teenage) vernacular Scots –Not “vocalisation” like middle class Anglification –Strong breaking (transitioning) with limited time at target? –Different target? –Limited evidence of mergers (yet)

Scottish pilot Methodological –Probability of numerous subjects (hundreds) –All age groups, wide spectrum of social mix –Handheld probe plus mike mix to tape –Eyeball qualitative analysis is highly feasible –Need lab-based follow-up for quantitative research Descriptive so far (very sketchy!) –Clear and obvious cases of pharyngeal /r/ –No meta-linguistic awareness of change

Scottish hypotheses & implications Strong breaking in coda is a strong pharyngeal or tongue dorsal gesture Some speakers have lost any obvious anterior gesture in coda Anterior gestures, if present, include retroflex and bunched types of /r/ This is socially stratified –There should be intra-speaker variation too –How categorical are these variations? We need representative dynamic data

Dutch background Large sociolinguistic, phonetic and phonological survey of Dutch (van Hout, Zonneveld and Van der Velde) –400 subjects in multiple locations in Netherlands Some speakers have uvular trill/fricative /r/ in onset, and an anterior approximant in codas –What is going on articulatorily? –What is the inter-speaker sub-variation like?

Dutch study Subjects –10 – all in Edinburgh –Post-screened down to 4 anterior /r/ users Materials and protocol –Picture naming (n < 30) –Real words, near minimal pairs: /ir/ /ur/ /ar/ –Singleton /r/ and cluster /r/ and /r/-less –3 reps Feedback –Unaware of focus on /r/

Dutch study Target – it’s a multi gestural thing –How do we choose the right frame for /r/? –Using acoustics needs good synchronisation –Using the images themselves is circular –Intergestural timing at 25Hz? Dynamics –How to characterise? Acoustics –Same sort of questions – target & dynamics –Final devoicing…

Boer MS – point is red at end of voice – tip down

Boer 1 RB

Boer 2 RB

Boer 3 RB

Mier 1 RB

Mier 2 RB

Mier 3 RB

Schaar 1 RB

Schaar 2 RB

Schaar 3 RB

Boer 2 RB

boer 2 vdl

boer 2 VDB

boer 2 MS

So far Visual inspection of raw images or dynamic spline diagrams –Two retroflexers (RB VDL) –Two bunchers (MS VDB)

Next Acoustic identification of an r phase –Midpoint spline can be extracted –No discrimination of voiced or voiceless –Identification of single “max rhotic” is similar Acoustic analysis –Steady-state V + transition + r-phase –Location of end of voicing –F2 & F3 of voiced (usually) r target if obvious

MS bunched mainly voiced

VDB bunched mainly voiced

RB retroflex mainly voiceless

VDL… retroflex-ish but nearly vocalised?

VDL… real non-rhotic vowels for comparison boer vs. koe mier vs. riem schaar vs. sla

Dutch summary 4 speakers with anterior coda /r/ –Two retroflex –Two bunch One of the retroflex speakers is gradiently vocalising –Timing the /r/ late before pause –But some long domain cues in vowel quality and consonant variation –RB has strong transitions too rather than steady state but ?? sounds more rhotic Interesting to look at following C…

Overall summary Dutch anterior /r/ has various flavours –Retroflex –Non-retroflex –Weakened and late pre-pausal gesture Glasgow /r/ comes in various flavours –Pharyngeal approximant observed –Breaking taking over from “rhoticity” Strong phonetic effects –Vulnerable non-standard speech –Socially-stratified qualitative UTI is T.o.C.

Conclusions, implications Weakening of final /r/ is flexible –Multiple articulators provide lots of options –Strength and timing are affected –Effects can be gradient and/or categorical –Acoustic effects appear complex –Phonological contrasts need not be affected Small (?) sub-phonemic effects, change, and variation go hand-in-hand Fine detail and structured variation are in the grammar

Methodology – field solution Splines and edges lose a lot of info –Relative to MRI X-ray etc –Relative to ultrasound images Qualitative analysis of images usefully augments impressionistic transcription This can be done live and/or from tapes made in the field Small amounts of data from large numbers of subjects is standard practice

Methodology: lab solution #1 Dedicated hardware + software in the lab Matrix of raw data time stamped at headline sample rate (100Hz) Image processing Audio etc Time stamped PC buffer & files High speed, synchronised, clean images Integrated analysis (spline fit, export etc) Start

Methodology: lab solution #2 Multichannel backbone (incl 200Hz EPG) Funding application submitted –100Hz synchronised machine (+ EPG) –Multichannel developments (EMA, VICON) Helmet improvements Also, funding application submitted for ultrafest 5 (2007)

Holiday report Excellent!

Demo of lab data collection Slide 88 AA