4. RHYTHM, PROSODY, TONE, LANGUAGE MUSIC 318 MINI-COURSE ON SPEECH AND SINGING Science of Sound, Chapter 16 Springer Handbook of Acoustics, Chapter 16.

Slides:



Advertisements
Similar presentations
Fullerton College Skills Center Better Accent Tutor (BAT) How to Access and use BAT to improve your pronunciation.
Advertisements

Acoustic/Prosodic Features
Vowel Formants in a Spectogram Nural Akbayir, Kim Brodziak, Sabuha Erdogan.
Physical modeling of speech XV Pacific Voice Conference PVSF-PIXAR Brad Story Dept. of Speech, Language and Hearing Sciences University of Arizona.
American English Speech Patterns
Your Vocal Instrument.
ACOUSTICS OF SPEECH AND SINGING MUSICAL ACOUSTICS Science of Sound, Chapters 15, 17 P. Denes & E. Pinson, The Speech Chain (1963, 1993) J. Sundberg, The.
“Speech and the Hearing-Impaired Child: Theory and Practice” Ch. 13 Vowels and Diphthongs –Vowels are formed when sound produced at the glottal source.
PHONETICS AND PHONOLOGY
General Problems  Foreign language speakers of a target language cause a great difficulty to native speakers because the sounds they produce seems very.
Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.
The Human Voice. I. Speech production 1. The vocal organs
ACOUSTICAL THEORY OF SPEECH PRODUCTION
The Human Voice Chapters 15 and 17. Main Vocal Organs Lungs Reservoir and energy source Larynx Vocal folds Cavities: pharynx, nasal, oral Air exits through.
Chapter two speech sounds
PH 105 Dr. Cecilia Vogel Lecture 14. OUTLINE  consonants  vowels  vocal folds as sound source  formants  speech spectrograms  singing.
Eva Björkner Helsinki University of Technology Laboratory of Acoustics and Audio Signal Processing HUT, Helsinki, Finland KTH – Royal Institute of Technology.
L 17 The Human Voice. The Vocal Tract epiglottis.
Vowel Acoustics, part 2 November 14, 2012 The Master Plan Acoustics Homeworks are due! Today: Source/Filter Theory On Friday: Transcription of Quantity/More.
2. ARTICULATION AND FORMANTS
ACOUSTICS OF SINGING MUSICAL ACOUSTICS Science of Sound, Chapters 15, 17.
STUDY OF ENGLISH STRESS AND INTONATION
Phonetics HSSP Week 5.
Phonetics and Phonology
MUSIC 318 MINI-COURSE ON SPEECH AND SINGING
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
LING 001 Introduction to Linguistics Fall 2010 Sound Structure I: Phonetics Acoustic phonetics Jan. 27.
Speech Science Fall 2009 Oct 28, Outline Acoustical characteristics of Nasal Speech Sounds Stop Consonants Fricatives Affricates.
Eva Björkner Helsinki University of Technology Laboratory of Acoustics and Audio Signal Processing HUT, Helsinki, Finland KTH – Royal Institute of Technology.
Say “blink” For each segment (phoneme) write a script using terms of the basic articulators that will say “blink.” Consider breathing, voicing, and controlling.
SPEECH PRODUCTION,RECOGNITION, ANALYSIS, AND SYNTHESIS
Speech Science VI Resonances WS Resonances Reading: Borden, Harris & Raphael, p Kentp Pompino-Marschallp Reetzp
Physics 1251 The Science and Technology of Musical Sound
Tone, Accent and Quantity October 19, 2015 Thanks to Chilin Shih for making some of these lecture materials available.
Phonetics, part III: Suprasegmentals October 19, 2012.
AUDITORY TRANSDUCTION SEPT 4, 2015 – DAY 6 Brain & Language LING NSCI Fall 2015.
Stop Acoustics and Glides December 2, 2013 Where Do We Go From Here? The Final Exam has been scheduled! Wednesday, December 18 th 8-10 am (!) Kinesiology.
Speech Perception.
Stop + Approximant Acoustics
Intonation Lecture 11.
Vowels, part 4 November 16, 2015 Just So You Know Today: Vowel remnants + Source-Filter Theory For Wednesday: vowel transcription! Turkish and British.
P105 Lecture #27 visuals 20 March 2013.
Suprasegmental Properties of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.
Acoustic Phonetics 3/14/00.
EXPRESS YOURSELF. NEUTRAL ACCENT Neutral accent is a way of speaking a language without regionalism. Accent means variation in pronunciation and it should.
Phonetics, part III: Suprasegmentals October 18, 2010.
Speech in the DHH Classroom A new perspective. Speech in the DHH Bilingual Classroom Important to look beyond the traditional view of speech Think of.
Stringing words together.  Connected speech is spoken language that is used in a continuous sequence, as in normal conversations. Also called connected.
SPEECH PRODUCTION,RECOGNITION, ANALYSIS, AND SYNTHESIS
Suprasegmental features and Prosody Lect 6A&B LING1005/6105.
11 How we organize the sounds of speech 12 How we use tone of voice 2009 년 1 학기 담당교수 : 홍우평 언어커뮤니케이션의 기 초.
HOW WE TRANSMIT SOUNDS? Media and communication 김경은 김다솜 고우.
INTONATION And IT’S FUNCTIONS
Chapter 3: The Speech Process
L 17 The Human Voice.
Patterns of Stress and Pronunciation
Phonetics Phonetics is the study of sounds. To understand the mechanics of human languages one has to understand the physiology of the human body. Letters.
The Human Voice. 1. The vocal organs
Prosody and Non- Verbal Communication
(2) Suprasegmentals The features such as pitch, stress, and length, which are used simultaneously with units larger than segments, are called “suprasegmentals.”
The Human Voice. 1. The vocal organs
Kuiper and Allan Chapter 6.2
Vowel Formants 1.
What is Phonetics? Short answer: The study of speech sounds in all their aspects. Phonetics is about describing speech. (Note: phonetics ¹ phonics) Phonetic.
Voice Why is the voice of the actor important?
Speech Perception.
Kuiper and Allan Chapter 6.2
2. ARTICULATION AND FORMANTS
Evolution of human vocal production
Speech Perception (acoustic cues)
Presentation transcript:

4. RHYTHM, PROSODY, TONE, LANGUAGE MUSIC 318 MINI-COURSE ON SPEECH AND SINGING Science of Sound, Chapter 16 Springer Handbook of Acoustics, Chapter 16

RHYTHM A STRIKING CHARACTERISTIC OF A FOREIGN LANGUAGE IS ITS RHYTHM. ENGLISH, RUSSIAN, ARABIC AND THAI ARE STRESS-TIMED LANGUAGES. STRESSED SYLLABLES RECUR AT APPROXIMATELY EQUAL INTERVALS. SYLLABLES MOST OFTEN END WITH A CONSONANT. FRENCH, SPANISH, GREEK, ITALIAN, YORUBA AND TELEGU ARE SYLLABLE TIME LANGUAGES. SYLLABLES RECUR AT APPROXIMATELY EQUAL INTERVALS. SYLLABLES OFTEN END WITH A VOWEL. RHYTHMIC PATTERNS CAN BE USED TO SIGNAL DIFFERENCES IN SYNTACTIC STRUCTURE. COMPARE: 1.The 2000-year-old skeletons 2. The two 1000-year-old skeletons

PROSODY IN LINGUISTICS, PROSODY IS THE RHYTHM, STRESS, AND INTONATION OF SPEECH. PROSODY MAY REFLECT VARIOUS FEATURES OF THE SPEAKER OR THE UTTERANCE, THE EMOTIONAL STATE OF A SPEAKER, WHETHER THE UTTERANCE IS A STEMENT, A QUESTION, OR A COMMAND; WHETHER THE SPEAKER IS BEING IRONIC OR SARCASTIC; EMPHASIS, CONTRAST AND FOCUS. IN TERMS OF ACOUSTICS, THE PROSODICS OF ORAL LANGUAGES INVOLVE VARIATION IN SYLLABLE LENGTH, LOUDNESS, PITCH, AND THE FORMANT FREQUENCIES OF SPEECH SOUNDS. PROSODY IS OF GREAT INTEREST IN AUTOMATIC SPEECH RECOGNITION

DECLARATIVE, INTEROGATIVE, IMPERATIVE DECALARATIVE: “You are going home” INTEROGATIVE: “You are going home?” (voice is raised at end of sentence) IMPERATIVE: “You ARE going home!” (are is emphasized)

EMOTIONAL STATE OF THE SPEAKER PROSODIC FEATURES TEND TO INDICATE THE EMOTIONAL STATE OF THE SPEAKER. “RAISING ONE’S VOICE “ IN ANGER, FOR EXAMPLE, INCREASES BOTH LOUDNESS AND PITCH. A STATE OF EXCITEMENT FREQUENCY CAUSES AN INCREASE IN THE RATE OF SPEAKING. ATTEMPTS HAVE BEEN MADE TO ACCOMPLISH ACOUSTIC “LIE DETECTION” BY ANALYZING THE PROSODIC FEATURES OF RECORDED SPEECH FOR EVIDENCE OF STRESS

EFFECT OF EMOTION ON PHONATION FREQUENCY PHONATION FREQUENCY vs TIME FOR THREE ACTORS SPEAKING THE SAME SENTENCE (“For God’s sake!”) IN FOUR DIFFERENT MODES (Williams and Stevens 1972)

EFFECT OF EMOTION ON PHONATION FREQUENCY MEDIAN AND RANGE OF THE PHONATION FREQUENCY FOR THREE ACTORS SPEAKING THE SAME SENTENCE: S=SORROW; N=NEUTRAL; F=FEAR; A=ANGER

RADIO ANNOUNCER SPEAKING BEFORE (top) AND AFTER (bottom) THE CRASH OF THE HINDENBURG DIRIGIBLE (1937)

STRESS SPECTOGRAMS OF THE WORD “SQUEAL” SPOKEN WITH FOUR DEGREES OF STRESS IN RESPONSE TO A LIST OF QUESTIONS (Brownlee 1996)

TONE IN SOME LANGUAGES, SUCH AS CHINESE, A PHONEME CAN TAKE ON DIFFERENT MEANINGS DEPENDING ON ITS TONE. THE FOUR TONES IN MANDARIN CHINESE ARE SHOWN

VOICE QUALITY VOICE QUALITY IS A BROAD TERM THAT REFERS TO THE EXTRALINGUISTIC ASPECTS OF A SPEAKER’S VOICE WITH REGARD TO IDENTITY, PERSONALITY, HEALTH, AND EMOTIONAL STATE. VOCAL FOLD MASS, VOCAL TRACT LENGTH, TRACHEAL LENGTH, JAW AND TONGUE SIZE, AND NASAL CAVITY VOLUME MAY INDICATE INFORMATION ABOUT AGE, SEX, PHYSIQUE, AND HEALTH.

“High fidelity on the line: please say ‘ahh’” THIS IS THE TITLE OF AN INTERESTING ARTICLE BY STEN TERNSTRÖM IN THE FALL 2008 ISSUE OF ECHOES. SPECTRA OF SPEECH SOUNDS ARE ESPECIALLY RICH UP TO 4000 Hz, AND FALL OFF RAPIDLY ABOVE 5000 Hz. BUT HIGH HARMONICS CAN BE MEASURED UP TO 20 kHz. EARLY TELEPHONES TRANSMITTED ONLY Hz WITH LITTLE LOSS IN INTELLIGIBILITY (SEE FILTERED SPEECH IN LESSON 3). IN 2000, A WIDE-BAND STANDARD FOR TELEPHONY WAS DEFINED UP TO Hz, A BIG IMPROVEMENT OVER THE OLD “TELEPHONE SOUND.” HOPEFULLY CELL-PHONE SOUND WILL SOON SOUND MUCH BETTER. VOICES HEARD IN LIVE PERFORMANCE MAY SOUND A LITTLE “DULL” OF “FADED” BEYOND THE 15 TH ROW, BECAUSE HIGH FREQUENCIES ARE SLIGHTLY DIMINIISHED.

NORMAL, “YAWNY”, AND “TWANGY” VOICE Story, Titze, and Hoffman (2001) did a 3-dimensional study of the vocal tract using MRI to determine the shape when vowels /i/, /ae/, /α/, and /u/ were spoken with NORMAL, “YAWNY”, and “TWANGY” voice. Relative to NORMAL speech, the ORAL CAVITY is widened and the TRACT is lengthened for YAWNY vowels. F1 and F2 moved closer together. TWANGY vowels were characterized by shortened TRACT length, widened LIP OPENING, and a slightly constricted ORAL CAVITY. F1 and F2 moved farther apart.

Story, Titze and Hoffman, 2001)

Story, Titze Hoffman, 2001)

ACCENTS “TWO COUNTRIES SEPARATED BY A COMMON LANGUAGE” Have you ever misunderstood someone or been misunderstood by someone who speaks with a different accent? The sounds that an American hears as 'Bob the clerk' may be heard by an Australian as 'barb the clock'.

The two most important parameters in determining different vowel sounds are the first two formants, which are frequency bands with increased power. These are the two axes on the graph. The axes are traditionally plotted backwards, as here, so that they approximately correspond to the axes long used by phoneticians and linguists: F1 (vertical) approximately corresponds to the jaw height (which correlates negatively with the extent of the mouth opening). F2 (horizontal) approximately corresponds to the position (forward or back) of the constriction of the vocal tract where the tongue is close to the roof of the mouth. Other important parameters are the length of the vowel and other formants

F1 AND F2 FOR ENGLISH VOWEL SOUNDS SPOKEN BY AUSTRALIAN SPEAKERS F1 CORRELATES WITH MOUTH OPENING; F2 CORRELATES WITH TONGUE PLACEMENT

AMERICAN SPEAKER AUSTRALIAN SPEAKER For the Australians in this sample, the words "hud" and "hard" have a similar sound, the main difference is the length. For this sample of Americans, it is "hud" and "heard" that are distinguished by length. For an Australian, a long bud is a bard, for an American, it's a bird.

TO PARTICIPATE IN THIS SURVEY BY WOLFE, SMITH AND COLLEAGUES, CLICK ON