Speech Communications Chapter 7. Speech Communications  The Nature of Speech    Criteria for Evaluating Speech    Components of Speech Communication.

Slides:



Advertisements
Similar presentations
Acousteen Herman J.M. Steeneken Subjective Intelligibility Assessment Dr. Herman J.M. Steeneken.
Advertisements

Speech Perception Dynamics of Speech
專題研究 語音訊號處理專題 助教:余典翰 指導教授:李琳山 2013/07/30.
3. SPEECH RECOGNITION, ANALYSIS, AND SYNTHESIS MUSIC 318 MINI-COURSE ON SPEECH AND SINGING Science of Sound, Chapter 16 The Speech Chain, Chapters 7, 8.
What makes a musical sound? Pitch n Hz * 2 = n + an octave n Hz * ( …) = n + a semitone The 12-note equal-tempered chromatic scale is customary,
Room Acoustics: implications for speech reception and perception by hearing aid and cochlear implant users 2003 Arthur Boothroyd, Ph.D. Distinguished.
Lecture 22 Frequency Response Hung-yi Lee Filter Outline (Chapter 11) Amplitude Ratio Phase Shift Highpass Filter Frequency Response Bode Plot Draw frequency.
CENTER FOR SPOKEN LANGUAGE UNDERSTANDING 1 PREDICTION AND SYNTHESIS OF PROSODIC EFFECTS ON SPECTRAL BALANCE OF VOWELS Jan P.H. van Santen and Xiaochuan.
Introduction to Phonology. Introduction to Phonetics Human listeners can hear speech as a sequence of sounds, and each sound can be represented by a written.
Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.
Speech Perception Overview of Questions Can computers perceive speech as well as humans? Does each word that we hear have a unique pattern associated.
Event Sampling 事件取樣法. 關心重點為「事件」本身明確的焦點 行為 清楚掌握主題 - 當「事件」出現時才開 始記錄 記錄程序 等待目標事件的發生 開始記錄 事件結束,停止記錄.
Speech perception Relating features of hearing to the perception of speech.
數位學習經驗分享 「 E 化教學教室與虛擬攝影棚」推廣經驗分享暨觀摩 高高屏活動 義守大學應用數學系郎正廉.
SPPA 6010 Advanced Speech Science 1 The Source-Filter Theory: The Sound Source.
研究法簡介 何明洲 中山醫學大學心理系. Single Factor – Two Levels Independent groups design: use random assignment –IV, manipulated –Between-subject Matched groups design:
Chapter 3 Data and Signals
Hint of Homework 4 jinnjy. Outline Hint of exercise 3.18.
Department of Electronic Engineering City University of Hong Kong EE3900 Computer Networks Data Transmission Slide 1 Continuous & Discrete Signals.
錄音筆,MP3 撥放器, 隨身碟 之原理及規格. 定義 錄音筆 – 以錄音為首要功能 MP3 撥放器 – 以播放音樂為首要功能 隨身碟 – 以行動碟為功能.
Hormones and Toy Preferences Chap 16. 緒論 Congenital Adrenal Hyperplasia (CAH) exposed to high levels of androgen in prenatal and early postnatal Toy Preference.
網路廣告 Web Advertising. 2 商業廣告 不被認知認知 熟悉 / 信任 沒有交易過 零星交易 固定交易.
SPEECH PERCEPTION The Speech Stimulus Perceiving Phonemes Top-Down Processing Is Speech Special?
SPSS 分析簡介 何明洲 中山醫學大學心理系. 資料在 SPSS 上之排列 Between-subject design, one factor with three levels.
溶劑可以溶解反應物,形成均勻的反應系統; 溶劑用來調整反應物的濃度與反應溫度,控制速率與方向; 溶劑萃取,分離特定的化合物。 溶劑,特別是有機溶劑,是環境污染的主要來源。 綠色(永續)化學逐漸形成一種新的科學理念。溶劑的選擇 與化學反應的設計,必須加上環境因素的考量。 化學家已發展出許多有機溶劑替代液體及綠色的合成方法:
SPPA 4030 Speech Science1 Sound Physics. SPPA 4030 Speech Science2 Outline  What is sound?  Graphic representation of sound  Classifying sounds  The.
Optimization And Differential Equations 最佳化與微分方程 Peng-Jen Lai ( 賴鵬仁 ) Department of Mathematics National Kaohsiung Normal University ( 高雄師範大學數學系 ) ( 高雄師範大學數學系.
Auditory, Tactual, and Olfactory Displays
Chapter 2 Frequency Distributions 次數分配
SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.
第五章IIR數位濾波器設計 濾波器的功能乃對於數位信號進行處理﹐ 以滿足系統的需求規格。其作法為設計一 個系統的轉移函數﹐或者差分方程式﹐使 其頻率響應落在規格的範圍內。本章探討 的是其中一種方法﹐稱為Infinite impulse register(IIR)。 IIR架構說明。 各種不同頻帶(Band)濾波器的設計方法。
1 Chemical and Engineering Thermodynamics Chapter 1 Introduction Sandler.
Human Psychoacoustics shows ‘tuning’ for frequencies of speech If a tree falls in the forest and no one is there to hear it, will it make a sound?
Text-To-Speech System for Marathi Miss. Deepa V. Kadam Indian Institute of Technology, Bombay.
Speech Communications (Chapter 7) Prepared by: Ahmed M. El-Sherbeeny, PhD 1.
Human Capabilities Part - B. Speech Communications (Chapter 7) Prepared by: Ahmed M. El-Sherbeeny, PhD 1.
Formatting and Baseband Modulation
LE 460 L Acoustics and Experimental Phonetics L-13
Digital Sound and Video Chapter 10, Exploring the Digital Domain.
Introduction to Interactive Media 10: Audio in Interactive Digital Media.
IE341: Human Factors Engineering Prof. Mohamed Zaki Ramadan Lecture 6 – Auditory Displays.
Artificial Intelligence 2004 Speech & Natural Language Processing Natural Language Processing written text as input sentences (well-formed) Speech.
By: Sepideh Abolghasem Shabnam Alaghehband Mina Khorram May 2006.
Chapter 7 SPEECH COMMUNICATIONS
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
ECE 4710: Lecture #9 1 PCM Noise  Decoded PCM signal at Rx output is analog signal corrupted by “noise”  Many sources of noise:  Quantizing noise »Four.
Chapter 3.2 Speech Communication Human Performance Engineering Robert W. Bailey, Ph.D. Third Edition.
Filtering. What Is Filtering? n Filtering is spectral shaping. n A filter changes the spectrum of a signal by emphasizing or de-emphasizing certain frequency.
Dynamics of speech the diagnostic audiometer test environment patient’s and clinician’s role speech-threshold testing most comfortable loudness and uncomfortable.
Speech Perception 4/4/00.
Noise Pollution and Control
Artificial Intelligence 2004 Speech & Natural Language Processing Natural Language Processing written text as input sentences (well-formed) Speech.
CH 5 Reduction of Multiple Subsystems 5.1 Introduction 實際系統複雜 由許多子系統組成 (1 子系統 1 方塊 ) 分別求子系統的數學模型 連結各方塊呈現整體系 統 如何預測 ? Tp, Ts, Tr, %OS with step input.
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2009.
Temporal masking of spectrally reduced speech: psychoacoustical experiments and links with ASR Frédéric Berthommier and Angélique Grosgeorges ICP 46 av.
Feedback Filters n A feedback filter processes past output samples, as well as current input samples: n Feedback filters create peaks (poles or resonances)
An Introduction to the and Text Mark-up Chander Tseng 曾國奕 October 2015.
IIT Bombay 17 th National Conference on Communications, Jan. 2011, Bangalore, India Sp Pr. 1, P3 1/21 Detection of Burst Onset Landmarks in Speech.
A. R. Jayan, P. C. Pandey, EE Dept., IIT Bombay 1 Abstract Perception of speech under adverse listening conditions may be improved by processing it to.
The Speech Chain (Denes & Pinson, 1993)
Suprasegmental Properties of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.
Acoustic Phonetics 3/14/00.
Figures for Chapter 8 Candidacy Dillon (2001) Hearing Aids.
Speechreading Based on Tye-Murray (1998) pp
Lifecycle from Sound to Digital to Sound. Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20Hz – 20KHz] Speech: [200Hz.
King Saud University College of Engineering IE – 341: “Human Factors Engineering” Fall – 2016 (1st Sem H) Human Capabilities Part – C. Speech.
Ch.1: Introduction to audio signal processing
WRS.
Noise Aperiodic complex wave
Speech Communications
Presentation transcript:

Speech Communications Chapter 7

Speech Communications  The Nature of Speech    Criteria for Evaluating Speech    Components of Speech Communication System    Synthesized Speech  

The Nature of Speech 1/2  發聲 : 呼吸系統, Articulators  Types of Speech Sound  Phoneme ( 音素 ) − shortest segment of speech if change → meaning change if change → meaning change  分類 : 母音 (vowel), 子音 (consonant) 雙母音 (diphthongs) 雙母音 (diphthongs)  Phoneme →Syllable →Word → Sentence

The Nature of Speech 2/2  Depicting Speech  Waveform, Spectrum  Sound spectrogram Fig 8-1 Fig 8-1 Fig 8-1  Intensity of Speech  Average intensity (speech power): 母音>子音  Intelligibility: 子音較重要  Frequency Composition of Speech  低頻 : 男>女 Fig 8-2 Fig 8-2 Fig 8-2  Shouting: frequency 上升

Criteria for Evaluating Speech  Speech Intelligibility ( 能解度 )  方法 − Repeat 呈現的聲音 − 回答問題  Test − Nonsense syllables − Isolated words (phonetically balanced, PB) − Sentences  Speech quality (Naturalness)  Preference

Components of Speech Communication System  Speaker  Message  Transmission System  Noise  Hearer

Components of Speech Communication System  Speaker  Enunciation ( 清晰的聲音 )  Superior Speakers − Longer syllable duration − Greater intensity − More total time with speech sounds − Frequencies varied 1/7

Components of Speech Communication System  Message  Phoneme Confusion − DVPBGCET, FXSH, KJA, MN − Avoid single letters, Word-spelling alphabet  Word Characteristics − Familiar words − Long words 2/7

Components of Speech Communication System  Message  Context Features − Sentence: meaningful > nonsense − Set size: 字多<字少 Fig 7-3 Fig 7-3 Fig 7-3 − Guidelines  用較少的字  Standard sentence  Avoid short word  Familiarize user 3/7

Components of Speech Communication System  Transmission System  Filtering (Frequency distortion) Fig 7-4 Fig 7-4 Fig 7-4 − High-pass: cutoff < 600 Hz − Low-pass: cutoff > 4000 Hz  Amplitude Distortion Fig Fig Fig − Peak clipping  Quality , Intelligibility ≈ − Center clipping  Intelligibility  − 提高 Intelligibility: Peak clipping  Amplify ( 子音 / 母音  ) 4/7

Components of Speech Communication System  Noise  Articulation Index (AI) Fig 7-7 Fig 7-7 Fig 7-7 − 1/3 octave, S-N, weighted sum − Intelligibility Fig 7-8 Tab 7-1 Fig 7-8Tab 7-1 Fig 7-8Tab 7-1  Preferred-Octave Speech Interference Level (PSIL) − Mean of 500, 1000, 2000 Hz (octave) − SIL: Mean of , ,... − Intelligibility (vs. distance) Fig 7-9 Fig 7-9 Fig 7-9 − Subjective rating Fig 7-10 Tab 7-2 Fig 7-10Tab 7-2 Fig 7-10Tab 7-25/7

Components of Speech Communication System  Noise  Preferred Noise Criterion Curve (PNC) Fig 7-11Fig 7-11 Tab 7-3 Tab 7-3 Fig 7-11Tab 7-3  Reverberation Fig 7-12 Fig 7-12 Fig 7-12 − Reverberation time: Decay 60 dB − Reverberation time   Intelligibility  6/7

Components of Speech Communication System  Hearer  Age Fig 7-13 Fig 7-13 Fig 7-13  Wearing of Hearing Protection 7/7

Synthesized Speech  種類  Uses  Performance  Preference  Guidelines

Synthesized Speech  種類  Synthesis by Analysis − Digitized human speech  compressed data format  compressed data format − 缺點 : 限於 encoded & stored Lack of coarticulation Lack of coarticulation  Synthesis by Rule − 缺點 : quality 較差