Exemplar Theory April 10, 2014 Practicalities Project presentations/review session: Thursday, April 17 th, 1-2:30 pm Science A 147 Final exam: Thursday,

Slides:

Advertisements

Similar presentations

Module 14 Thought & Language.

Advertisements

09/01/10 Kuhl et al. (1992) Presentation Kuhl, P. K., Williams, K. A., Lacerda, F., Stevens, K. N., & Lindblom, B. (1992) Linguistic experience alters.

Plasticity, exemplars, and the perceptual equivalence of ‘defective’ and non-defective /r/ realisations Rachael-Anne Knight & Mark J. Jones.

Human Speech Recognition Julia Hirschberg CS4706 (thanks to John-Paul Hosum for some slides)

The perception of dialect Julia Fischer-Weppler HS Speaker Characteristics Venice International University

Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.

Language and Cognition Colombo, June 2011 Day 8 Aphasia: disorders of comprehension.

Infant sensitivity to distributional information can affect phonetic discrimination Jessica Maye, Janet F. Werker, LouAnn Gerken A brief article from Cognition.

The Perception of Speech. Speech is for rapid communication Speech is composed of units of sound called phonemes –examples of phonemes: /ba/ in bat, /pa/

Ling 240: Language and Mind Acquisition of Phonology.

Speech perception 2 Perceptual organization of speech.

Development of Speech Perception. Issues in the development of speech perception Are the mechanisms peculiar to speech perception evident in young infants?

Psych 156A/ Ling 150: Acquisition of Language II Lecture 4 Sounds.

Chapter 7 Knowledge Terms: concept, categorization, prototype, typicality effect, object concepts, rule-governed, exemplars, hierarchical organization,

The Perception of Speech. Speech is for rapid communication Speech is composed of units of sound called phonemes –examples of phonemes: /ba/ in bat, /pa/

Module 14 Thought & Language.

Module 14 Thought & Language. INTRODUCTION Definitions –Cognitive approach method of studying how we process, store, and use information and how this.

CSD 2230 HUMAN COMMUNICATION DISORDERS Topic 2 Normal Communication Development and Communication Across the Lifespan.

Exam 1 Monday, Tuesday, Wednesday next week WebCT testing centre Covers everything up to and including hearing (i.e. this lecture)

Knowing Semantic memory.

PSY 369: Psycholinguistics

Psych 56L/ Ling 51: Acquisition of Language Lecture 8 Phonological Development III.

SPEECH PERCEPTION The Speech Stimulus Perceiving Phonemes Top-Down Processing Is Speech Special?

Language, Mind, and Brain by Ewa Dabrowska Chapter 2: Language processing: speed and flexibility.

Sound and Speech. The vocal tract Figures from Graddol et al.

The Perception of Speech

CSD 2230 HUMAN COMMUNICATION DISORDERS

Psych 56L/ Ling 51: Acquisition of Language Lecture 8 Phonological Development III.

Psych 156A/ Ling 150: Psychology of Language Learning

Speech Perception. Phoneme - a basic unit of a speech sound that distinguishes one word from another Phonemes do not have meaning on their own but they.

Speech Synthesis April 14, 2009 Some Reminders Final Exam is next Monday: In this room (I am looking into changing the start time to 9 am.) I have a.

Speech Perception 4/6/00 Acoustic-Perceptual Invariance in Speech Perceptual Constancy or Perceptual Invariance: –Perpetual constancy is necessary, however,

The Motor Theory of Speech Perception April 1, 2013.

Psych 156A/ Ling 150: Psychology of Language Learning Lecture 5 Sounds III.

1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.

Jiwon Hwang Department of Linguistics, Stony Brook University Factors inducing cross-linguistic perception of illusory vowels BACKGROUND.

Chapter 8 Language & Thinking

Auditory Perception April 9, 2009 Auditory vs. Acoustic So far, we’ve seen two different auditory measures: 1.Mels (unit of perceived pitch) Auditory.

Speech Perception 4/4/00.

Words, Voices and Memories: the interaction of linguistic and indexical information in cross-language speech perception Steve Winters (in collaboration.

Acoustic Cues to Laryngeal Contrasts in Hindi Susan Jackson and Stephen Winters University of Calgary Acoustics Week in Canada October 14,

Grab Bag! (Palatography, Fricatives + Perception, part 2)

Sh s Children with CIs produce ‘s’ with a lower spectral peak than their peers with NH, but both groups of children produce ‘sh’ similarly [1]. This effect.

Speech Science IX How is articulation organized? Version WS

SPEECH PERCEPTION DAY 18 – OCT 9, 2013 Brain & Language LING NSCI Harry Howard Tulane University.

Epenthetic vowels in Japanese: a perceptual illusion? Emmanual Dupoux, et al (1999) By Carl O’Toole.

Pragmatically-guided perceptual learning Tanya Kraljic, Arty Samuel, Susan Brennan Adaptation Project mini-Conference, May 7, 2007.

Information Processing Assumptions Measuring the real-time stages General theory –structures –control processes Representation –definition –content vs.

Sensation & Perception

Exemplar Theory + Speech Perception March 30, 2010.

The long-term retention of fine- grained phonetic details: evidence from a second language voice identification training task Steve Winters CAA Presentation.

Psych 156A/ Ling 150: Psychology of Language Learning Lecture 3 Sounds II.

Sounds and speech perception Productivity of language Speech sounds Speech perception Integration of information.

1 Cross-language evidence for three factors in speech perception Sandra Anacleto uOttawa.

Perception + Synthesis April 6, 2010 CP Results.

Grab Bag! (Hearing, Fricatives + Perception, part 2) April 12, 2011.

Human Abilities 2 How do people think? 1. Agenda Memory Cognitive Processes – Implications Recap 2.

Exemplar Theory, part 2 April 15, 2013.

Language Perception.

Motor Theory + Signal Detection Theory

Transitions + Perception March 25, 2010 Tidbits Mystery spectrogram #3 is now up and ready for review! Final project ideas.

Motor Theory of Perception March 29, 2012 Tidbits First: Guidelines for the final project report So far, I have two people who want to present their.

Signal Detection Theory March 25, 2010 Phonetics Fun, Ltd. Check it out:

Against formal phonology (Port and Leary).  Generative phonology assumes:  Units (phones) are discrete (not continuous, not variable)  Phonetic space.

Hearing + Perception, part 2 April 10, 2013 Hearing Aids et al. Generally speaking, a hearing aid is simply an amplifier. Old style: amplifies all frequencies.

Exemplar Dynamics VOT of stops varies in languages VOT of stops varies in languages So people learn language specific VOT So people learn language specific.

Chapter 9 Knowledge. Some Questions to Consider Why is it difficult to decide if a particular object belongs to a particular category, such as “chair,”

Module 14 Thought & Language.

Abstraction versus exemplars

Topic: Language perception

Presentation transcript:

Exemplar Theory April 10, 2014

Practicalities Project presentations/review session: Thursday, April 17 th, 1-2:30 pm Science A 147 Final exam: Thursday, April 24 th, 8-10 am Science A 147 Feel free to drop by Ling 341 on Monday, if you want to learn about hearing/audition: Science A 247, 1-2 pm There will be USRI evaluations at the end of class today!

And now for something completely different… Q: What’s a category? A classical answer: A category is defined by properties. All members of the category exhibit the same properties. No non-members of the category exhibit all of those properties.  The properties of any member of the category may be split into: Definitive properties Incidental properties

Classical Example A rectangle (in Euclidean geometry) may be defined as having the following properties: 1.Four-sided, two-dimensional figure (quadrilateral) 2.Four right angles This is a rectangle.

Classical Example Adding a third property gives the figure a different category classification: 1.Four-sided, two-dimensional figure (quadrilateral) 2.Four right angles 3. Four equally long sides This is a square.

Classical Example Altering other properties does not change the category classification: 1.Four-sided, two-dimensional figure (quadrilateral) 2.Four right angles 3. Four equally long sides This is still a square. A. Is red. definitive properties incidental property

Classical Linguistic Categories Formal phonology traditionally defined all possible speech sounds in terms of a limited number of properties, known as “distinctive features”. (Chomsky + Halle, 1968) [d] = [CORONAL, +voice, -continuant, -nasal, etc.] [n] = [CORONAL, +voice, -continuant, +nasal, etc.] … Similar approaches have been applied in syntactic analysis. (Chomsky, 1974) Adjectives = [+N, +V] Prepositions = [-N, -V]

Prototypes The psychological reality of classical categories was called into question by a series of studies conducted by Eleanor Rosch in the 1970s. Rosch claimed that categories were organized around privileged category members, known as prototypes. (instead of being defined by properties) Evidence for this theory initially came from linguistic tasks: 1.Semantic verification (Rosch, 1975) Is a robin a bird? Is a penguin a bird? 2.Category member naming.

Prototype Category Example: “Bird”

Exemplar Categories Cognitive psychologists in the late ‘70s (e.g., Medin & Schaffer, 1978) questioned the need for prototypes. Phenomena explained by prototype theory could be explained without recourse to a category prototype. The basic idea: Categories are defined by extension.  Neither prototypes nor properties are necessary. Categorization works by comparing new tokens to all exemplars in memory. Generalization happens on the fly.

A Category, Exemplar-style “square”

Back to Perception When people used to talk about categorical perception, they meant perception of classical categories. A stop is either a [b] or a [g] (no in between) Remember: in classical categories, there are: definitive properties incidental properties Q: What are the properties that define a stop category? The definitive properties must be invariant. (shared by all category members) So…what are the invariant properties of stop categories?

The Acoustic Hypothesis People have looked long and hard for invariant acoustic properties of stops, with little success. (and some people are still looking) Frequency values of compact (synthetic) bursts cueing different places of articulation, in various vowel contexts. (Liberman et al., 1952)

Theoretical Revision Since invariant acoustic properties could not be found (especially for velars)… It was assumed that listeners perceived (articulatory) gestures, not (acoustic) sounds. Q: What invariant articulatory properties define stop categories? A: If they exist, they’re hard to find. Motor Theory Revision #2: Listeners perceive “intended” gestures. Note: “intentions” are kind of impossible to observe. But they must be invariant…right?

Another Brick in the Wall Another problem for motor theory: Perception of speech sounds isn’t always categorical. In particular: vowels are perceived in a more gradient fashion than stops. However, vowel perception becomes more categorical when the vowels are extremely short.

It’s also hard to identify any invariant acoustic properties for vowels. Variation is rampant across: tokens speakers genders dialects age groups, etc. Variability = a huge problem for speech perception.

More Problems Also: infants exhibit categorical perception, too… Even though they don’t know category labels. Chinchillas can do it, too!

An Alternative It has been proposed that phoneme categories are defined by prototypes… which we use to identify vowels in speech. One relevant finding: the perceptual magnet effect. Part 1: play listeners a continuum of synthetic vowels in the neighborhood of [i]. Task: judge how much each one sounds like [i]. Some are better = prototypical Others are worse = non-prototypes

Perceptual Magnets Part 2: define either a prototype or a non- prototype as a category center. Task: determine whether other vowels on the continuum belong to those categories. Result: more same responses when the category center is a prototype. Prototype = a “perceptual magnet” Same? Different?

Prototypes, continued The perceptual magnet prototypes are usually located at a listener’s average F1 and F2 values for [i]. 4-month olds exhibit the perceptual magnet effect… but monkeys do not. Note: the prototype is the only thing that has to be “invariant” about the category. particular properties aren’t important. Testing a prototype model on the Peterson & Barney data yielded 51% correct classification. (Human listeners got 94% correct)  Variability is still hard to deal with.

Flipping the Script Another approach to speech perception is to preserve all variability that we hear… Rather than boiling it down to properties or prototypes. In this model, speech categories are defined by extension. = consist of exemplars So, your mental representaton of /b/ consists of every token of /b/ you’ve ever heard in your life. …rather than any particular acoustic or articulatory properties. Analogy: phonetics field project notes (your mind is a pack rat)

Exemplar Categorization 1.Stored memories of speech experiences are known as traces. Each trace is linked to a category label. 2.Incoming speech tokens are known as probes. 3.A probe activates the traces it is similar to. Note: amount of activation is proportional to similarity between trace and probe. Traces that closely match a probe are activated a lot; Traces that have no similarity to a probe are not activated much at all.

A (pretend) example: traces = vowels from the Peterson & Barney data set. * probe Activation of each trace is proportional to distance (in vowel space) from the probe. highly activated traces low activation

Echoes from the Past The combined average of activations from exemplars in memory is summed to create an echo of the perceptual system. This echo is more general features than either the traces or the probe. Inspiration: Francis Galton 

Exemplar Categorization II For each category label… The activations of the traces linked to it are summed up. The category with the most total activation wins. Note: we use all exemplars in memory to help us categorize new tokens. Also: any single trace can be linked to different kinds of category labels. Test: Peterson & Barney vowel data Exemplar model classified 81% of vowels correctly.

Exemplar Predictions Point: all properties of all exemplars play a role in categorization… Not just the “definitive” ones. Prediction: non-invariant properties of speech categories should have an effect on speech perception. E.g., the voice in which a [b] is spoken. Or even the room in which a [b] is spoken. Is this true? Let’s find out…

Another Experiment! Circle whether each word is a new or old word in the list

Another Experiment! Circle whether each word is a new or old word in the list

Continuous Word Recognition In a “continuous word recognition” task, listeners hear a long sequence of words… some of which are new words in the list, and some of which are repeats. Task: decide whether each word is new or a repeat. Twist: some repeats are presented in a new voice; others are presented in the old (same) voice. Finding: repetitions are identified more quickly and more accurately when they’re presented in the old voice. (Palmeri et al., 1993) Implication: we store voice + word info together in memory.

More Interactions Another task (Nygaard et al., 1994): train listeners to identify talkers by their voices. Then test the listeners’ ability to recognize words spoken in noise by: the talkers they learned to recognize talkers they don’t know Result: word recognition scores are much better for familiar talkers. Implication: voice properties influence word recognition. The opposite is also true: Talker identification is easier in a language you know.

Variability in Learning Caveat: it’s best not to let the relationship between words and voices get too close in learning. Ex: training Japanese listeners to discriminate between /r/ and /l/. Discrimination training on one voice: no improvement. (Strange and Dittman, 1984) Bradlow et al. (1997) tried: training on five different voices multiple phonological contexts (onset, coda, clusters) 4 weeks of training (with monetary rewards!) Result: improvement!

Variability in Learning General pattern: Lots of variability in training  better classification of novel tokens… Even though it slows down improvement in training itself. Variability in training also helps perception of synthetic speech. (Greenspan et al., 1988) Another interesting effect: dialect identification (Clopper, 2004) Bradlow et al. (1997) also found that perception training (passively) improved production skills…

Perception  Production Japanese listeners performed an /r/ - /l/ discrimination task. Important: listeners were told nothing about how to produce the /r/ - /l/ contrast …but, through perception training, their productions got better anyway.

Exemplars in Production Goldinger (1998): “shadowing” task. Task 1--Production: A: Listeners read a word (out loud) from a script. B: Listeners hear a word (X), then repeat it. Finding: formant values and durations of (B) productions match the original (X) more closely than (A) productions. Task 2--Perception: AXB task A different group of listeners judges whether X (the original) sounds more like A or B. Result: B productions are perceptually more similar to the originals.

Shadowing: Interpretation Some interesting complications: Repetition is more prominent for low frequency words… And also after shorter delays. Interpretation: The “probe” activates similar traces, which get combined into an echo. Shadowing imitation is a reflection of the echo. Probe-based activation decays quickly. And also has more of an influence over smaller exemplar sets.

Synthetic Speech Perception In the early days, speech scientists thought that synthetic speech would lead to a form of “super speech” = ideal speech, without any of the extraneous noise of natural productions. However, natural speech is always more intelligible than synthetic speech. And more natural sounding! But: perceptual learning is possible. Requires lots and lots of practice. And lots of variability. (words, phonemes, contexts) An extreme example: blind listeners.

More Perceptual Findings 1.Reducing the number of possible messages dramatically increases intelligibility.

More Perceptual Findings 2.Formant synthesis produces better vowels; Concatenative synthesis produces better consonants (and transitions) 3. Synthetic speech perception uses up more mental resources. memory and recall of number lists 4.Synthetic speech perception is a lot easier for native speakers of a language. And also adults. 5.Older listeners prefer slower rates of speech.

Audio-Visual Speech Synthesis The synthesis of audio-visual speech has primarily been spearheaded by Dominic Massaro, at UC-Santa Cruz. “Baldi” Basic findings: Synthetic visuals can induce the McGurk effect. Synthetic visuals improve perception of speech in noise …but not as well as natural visuals. Check out some samples.

Moral of the Story Remember--categorical perception was initially used to justify the claim that listeners converted a continuous signal into a discrete linguistic representation. In reality, listeners don’t just discard all the continuous information. (especially for sounds like vowels) Perceptual categories have to be more richly detailed than the classical categories found in phonology textbooks. (We need the details in order to deal with variability.)