Voice source characteristics in speaker segregation Patti Adank.

Slides:



Advertisements
Similar presentations
PF-STAR: emotional speech synthesis Istituto di Scienze e Tecnologie della Cognizione, Sezione di Padova – “Fonetica e Dialettologia”, CNR.
Advertisements

CNBH, Physiology Department, Cambridge University 2. Experimental procedure The experiment is a 2AFC paradigm design in which.
Human Speech Recognition Julia Hirschberg CS4706 (thanks to John-Paul Hosum for some slides)
Physical modeling of speech XV Pacific Voice Conference PVSF-PIXAR Brad Story Dept. of Speech, Language and Hearing Sciences University of Arizona.
“Connecting the dots” How do articulatory processes “map” onto acoustic processes?
Coarticulation Analysis of Dysarthric Speech Xiaochuan Niu, advised by Jan van Santen.
The perception of dialect Julia Fischer-Weppler HS Speaker Characteristics Venice International University
Speech perception 2 Perceptual organization of speech.
Fundamental Frequency & Jitter Lab 2. Fundamental Frequency Pitch is the perceptual correlate of F 0 Perception is not equivalent to measurement: –Pitch=
Two Types of Listeners? Marie Nilsenov á (Tilburg University) 1. Background When you and I listen to the same utterance, we may not perceive the linguistic.
General Problems  Foreign language speakers of a target language cause a great difficulty to native speakers because the sounds they produce seems very.
Speech Perception Overview of Questions Can computers perceive speech as well as humans? Does each word that we hear have a unique pattern associated.
Effectiveness of spatial cues, prosody, and talker characteristics in selective attention C.J. Darwin & R.W. Hukin.
Vocal Emotion Recognition with Cochlear Implants Xin Luo, Qian-Jie Fu, John J. Galvin III Presentation By Archie Archibong.
SPEECH PSYC 330: PERCEPTION. SOME BASICS Methods of Manipulation PHONATION (air pushed across vocal cords) Airflow Mass and “tuning” of cords Harmonics.
A glimpsing model of speech perception
Dr. O. Dakkak & Dr. N. Ghneim: HIAST M. Abu-Zleikha & S. Al-Moubyed: IT fac., Damascus U. Prosodic Feature Introduction and Emotion Incorporation in an.
Emotions and Voice Quality: Experiments with Sinusoidal Modeling Authors: Carlo Drioli, Graziano Tisato, Piero Cosi, Fabio Tesser Institute of Cognitive.
Why an objective intelligibility assessment ? Catherine Middag Jean-Pierre Martens Gwen Van Nuffelen Marc De Bodt.
Speech perception Relating features of hearing to the perception of speech.
Using Fo and vocal-tract length to attend to one of two talkers. Chris Darwin University of Sussex With thanks to : Rob Hukin John Culling John Bird MRC.
PSY 369: Psycholinguistics
Source Segregation Chris Darwin Experimental Psychology University of Sussex.
SPEECH PERCEPTION The Speech Stimulus Perceiving Phonemes Top-Down Processing Is Speech Special?
Auditory Objects of Attention Chris Darwin University of Sussex With thanks to : Rob Hukin (RA) Nick Hill (DPhil) Gustav Kuhn (3° year proj) MRC.
Voice. Anatomy ApeHuman Greater risk of choking in exchange for speaking.
Phonetics, day 2 Oct 3, 2008 Phonetics 1.Experimental a. production b. perception 2. Surveys/Interviews.
CSD 5400 REHABILITATION PROCEDURES FOR THE HARD OF HEARING Auditory Training.
Cognitive Processes PSY 334 Chapter 2 – Perception.
CNBH, Physiology Department, Cambridge University The perception of size and sex in vowel sounds P60 David R. R. Smith and Roy.
Phonetics Linguistics for ELT B Ed TESL 2005 Cohort 2.
CNBH, Physiology Department, Cambridge University The perception of size in four families of instruments; brass, strings, woodwind.
Phonological Constraints on the Acquisition of Mid Vowels in English for Students in Taiwan author: 黃俐雯 presented by Lisa Liu 報告人: 劉莉莎.
Speech Perception. Phoneme - a basic unit of a speech sound that distinguishes one word from another Phonemes do not have meaning on their own but they.
Age and Gender Classification using Modulation Cepstrum Jitendra Ajmera (presented by Christian Müller) Speaker Odyssey 2008.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
Speech Perception1 Fricatives and Affricates We will be looking at acoustic cues in terms of … –Manner –Place –voicing.
Liverpool University The Department Centre for Cognitive Neuroscience Department of Psychology Liverpool University Overall Aim Understanding Human Information.
The role of prosody in dialect synthesis and authentication Kyuchul Yoon Division of English Kyungnam University Spring 2008 Joint Conference of KSPS.
METHODOLOGY INTRODUCTION ACKNOWLEDGEMENTS LITERATURE Low frequency information via a hearing aid has been shown to increase speech intelligibility in noise.
Mr Background Noise and Miss Speech Perception in: by Elvira Perez and Georg Meyer.
Voice Quality + Stop Acoustics
Research Methods in Psychology (Pp ). IB Internal Assessment The IB Psychology Guide states that SL students are required to replicate a simple.
Acoustic Cues to Laryngeal Contrasts in Hindi Susan Jackson and Stephen Winters University of Calgary Acoustics Week in Canada October 14,
1. Background Evidence of phonetic perception during the first year of life: from language-universal listeners to native listeners: Consonants and vowels:
SPEECH PERCEPTION DAY 16 – OCT 2, 2013 Brain & Language LING NSCI Harry Howard Tulane University.
Epenthetic vowels in Japanese: a perceptual illusion? Emmanual Dupoux, et al (1999) By Carl O’Toole.
SEPARATION OF CO-OCCURRING SYLLABLES: SEQUENTIAL AND SIMULTANEOUS GROUPING or CAN SCHEMATA OVERRULE PRIMITIVE GROUPING CUES IN SPEECH PERCEPTION? William.
3308 First Language acquisition Acquisition of sounds Perception Sook Whan Cho Fall, 2012.
How Does auditory perception organization works ? by Elvira Perez and Georg Meyer Dept. Psychology, Liverpool University, UK Hoarse Meeting, Chrysler Ulm,
Phonetic Context Effects Major Theories of Speech Perception Motor Theory: Specialized module (later version) represents speech sounds in terms of intended.
SIL Speech Analyzer: Tutorial Part 2 Dr. Barbara Brindle CD 508 – Voice Disorders Dr. Dudley Bryant PHYS Acoustics.
Pitch perception in auditory scenes 2 Papers on pitch perception… of a single sound source of more than one sound source LOTS - too many? Almost none.
1 Branches of Linguistics. 2 Branches of linguistics Linguists are engaged in a multiplicity of studies, some of which bear little direct relationship.
Presentation Skills Workshop. Mountain Barrier Hill Barrier.
Sound Waveforms Neil E. Cotter Associate Professor (Lecturer) ECE Department University of Utah CONCEPT U AL TOOLS.
Bosch & Sebastián-Gallés Simultaneous Bilingualism and the Perception of a Language-Specific Vowel Contrast in the First Year of Life.
Nuclear Accent Shape and the Perception of Syllable Pitch Rachael-Anne Knight LAGB 16 April 2003.
SPEAK Speak.(2011) Verderber, Sellnow, & Verderber Cengage Learning Practicing Delivery.
Detection of Vowel Onset Point in Speech S.R. Mahadeva Prasanna & Jinu Mariam Zachariah Department of Computer Science & Engineering Indian Institute.
Speech in the DHH Classroom A new perspective. Speech in the DHH Bilingual Classroom Important to look beyond the traditional view of speech Think of.
Danielle Werle Undergraduate Thesis Intelligibility and the Carrier Phrase Effect in Sinewave Speech.
Investigating Multiple Roles of Vocal Pitch in Attitude Change
Vocal & Visual Delivery
Laryngeal correlates of the English tense/lax vowel contrast
Investigating Multiple Roles of Vocal Pitch in Attitude Change
Copyright © American Speech-Language-Hearing Association
Attentional Tracking in Real-Room Reverberation
Attentive Tracking of Sound Sources
Three components of speech
Presentation transcript:

Voice source characteristics in speaker segregation Patti Adank

Some speaker-related characteristics have been found to be helpful: Darwin et al. 2003, F0 (pitch) and vocal tract length (VTL) differences between concurrent speakers help listeners attending to the target speaker Aim project: to establish whether voice source characteristics of speakers can be useful to listeners when attending to a target speaker in a multi-speaker situation

Speaker-related differences that might aid listeners: - style of speech - voice quality: creaky voice, roughness, breathiness My experiments: - establish the possible relevance of acoustic aspect of a creaky voice: jitter Speaker-related differences that aid listeners: - F0 difference (if > 2 semitones) - Vocal tract length difference (VTL) (if > 1.08) - Effects of F0 and VTL are superadditive Darwin et al. 2003

Pitch: periodicity of the voice source

Jitter: a- periodicity of the voice source

Literature: - McAdams (1989): natural jitter present in speaker’s voice may be helpful for listeners - Ellis (1993): segregate simultaneously presented vowels using jitter differences alone, for a computational model

How could jitter help listeners? Auditory Scene Analysis - primitive segregations cues bottom-up involuntary listening - schema-driven segegation cues (Bregman, 1990) top-down voluntary/effortful listening

Pitch = primitive segregation cue (Scheffers, 1983, Assmann & Summerfield, 1990 etc…) + schema-driven segregation cue (Darwin et al, 2003)

Hypotheses : 0. jitter does not aid the auditory system 1. jitter is only a primitive segregation cue 2. jitter is a primitive cue AND schema-driven cue 3. jitter is only a schema-driven segregation cue

Experiments: 1. one double-vowel experiment with pitch as the experimental factor  to replicate earlier results for pitch as a primitive cue 2. one double-vowel experiment with jitter as the experimental factor  to establish if jitter is a primitive cue 3. An experiment like Darwin et al., with pitch and jitter as factors  to establish if jitter is a schema-driven cue

Experiment 1: - Double-vowel experiment to test pitch effect - Synthetic vowels (Klat 1990): AH, EE, ER, OO, OR, 200 milliseconds - five versions of each vowel: 100 Hz, +1/4 semitone (st), +1/2 st, +1 st, +2 st

Experiment 2: - Double-vowel experiment to test jitter effect - Synthetic vowels (Klat 1990) altered version: AH, EE, ER, OO, OR, 200 milliseconds - five versions of each vowel: 100 Hz, +/-1%, +/-2%, +/-4%, +/-8%

Procedure (1 & 2): - 7 listeners (5 British-English, 2 bilingual) - categorization pre-test (45 stimuli) - experiment 1 (or 2): presentation double vowel (125 combinations) select one of 15 options

Results pitch

Results jitter

Hypotheses : 0. jitter does not aid the auditory system 1. jitter is only a primitive segregation cue 2. jitter is a primitive cue AND schema-driven cue 3. jitter is only a schema-driven segregation cue 4. jitter is a primitive segregation cue if there is also a pitch difference.

Results jitter & pitch

Is there still hope for jitter? Next experiment: test if jitter is schema-driven cue Setup as in Darwin et al.: 2 sentences from same speaker presented simultaneously attend to target sentence report on target words vary jitter and pitch of the sentences