Speech Communications Chapter 7. Speech Communications  The Nature of Speech    Criteria for Evaluating Speech    Components of Speech Communication.

Slides:

Advertisements

Similar presentations

Acousteen Herman J.M. Steeneken Subjective Intelligibility Assessment Dr. Herman J.M. Steeneken.

Advertisements

Speech Perception Dynamics of Speech

專題研究語音訊號處理專題助教：余典翰指導教授：李琳山 2013/07/30.

3. SPEECH RECOGNITION, ANALYSIS, AND SYNTHESIS MUSIC 318 MINI-COURSE ON SPEECH AND SINGING Science of Sound, Chapter 16 The Speech Chain, Chapters 7, 8.

What makes a musical sound? Pitch n Hz * 2 = n + an octave n Hz * ( …) = n + a semitone The 12-note equal-tempered chromatic scale is customary,

Room Acoustics: implications for speech reception and perception by hearing aid and cochlear implant users 2003 Arthur Boothroyd, Ph.D. Distinguished.

Lecture 22 Frequency Response Hung-yi Lee Filter Outline (Chapter 11) Amplitude Ratio Phase Shift Highpass Filter Frequency Response Bode Plot Draw frequency.

CENTER FOR SPOKEN LANGUAGE UNDERSTANDING 1 PREDICTION AND SYNTHESIS OF PROSODIC EFFECTS ON SPECTRAL BALANCE OF VOWELS Jan P.H. van Santen and Xiaochuan.

Introduction to Phonology. Introduction to Phonetics Human listeners can hear speech as a sequence of sounds, and each sound can be represented by a written.

Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.

Speech Perception Overview of Questions Can computers perceive speech as well as humans? Does each word that we hear have a unique pattern associated.

Event Sampling 事件取樣法. 關心重點為「事件」本身明確的焦點行為清楚掌握主題 - 當「事件」出現時才開始記錄記錄程序等待目標事件的發生開始記錄事件結束，停止記錄.

Speech perception Relating features of hearing to the perception of speech.

數位學習經驗分享「 E 化教學教室與虛擬攝影棚」推廣經驗分享暨觀摩高高屏活動義守大學應用數學系郎正廉.

SPPA 6010 Advanced Speech Science 1 The Source-Filter Theory: The Sound Source.

研究法簡介何明洲中山醫學大學心理系. Single Factor – Two Levels Independent groups design: use random assignment –IV, manipulated –Between-subject Matched groups design:

Chapter 3 Data and Signals

Hint of Homework 4 jinnjy. Outline Hint of exercise 3.18.

Department of Electronic Engineering City University of Hong Kong EE3900 Computer Networks Data Transmission Slide 1 Continuous & Discrete Signals.

錄音筆,MP3 撥放器, 隨身碟之原理及規格. 定義錄音筆 – 以錄音為首要功能 MP3 撥放器 – 以播放音樂為首要功能隨身碟 – 以行動碟為功能.

Hormones and Toy Preferences Chap 16. 緒論 Congenital Adrenal Hyperplasia (CAH) exposed to high levels of androgen in prenatal and early postnatal Toy Preference.

網路廣告 Web Advertising. 2 商業廣告不被認知認知熟悉 / 信任沒有交易過零星交易固定交易.

SPEECH PERCEPTION The Speech Stimulus Perceiving Phonemes Top-Down Processing Is Speech Special?

SPSS 分析簡介何明洲中山醫學大學心理系. 資料在 SPSS 上之排列 Between-subject design, one factor with three levels.

溶劑可以溶解反應物，形成均勻的反應系統；溶劑用來調整反應物的濃度與反應溫度，控制速率與方向；溶劑萃取，分離特定的化合物。溶劑，特別是有機溶劑，是環境污染的主要來源。綠色（永續）化學逐漸形成一種新的科學理念。溶劑的選擇與化學反應的設計，必須加上環境因素的考量。化學家已發展出許多有機溶劑替代液體及綠色的合成方法：

SPPA 4030 Speech Science1 Sound Physics. SPPA 4030 Speech Science2 Outline  What is sound?  Graphic representation of sound  Classifying sounds  The.

Optimization And Differential Equations 最佳化與微分方程 Peng-Jen Lai ( 賴鵬仁 ) Department of Mathematics National Kaohsiung Normal University ( 高雄師範大學數學系 ) ( 高雄師範大學數學系.

Auditory, Tactual, and Olfactory Displays

Chapter 2 Frequency Distributions 次數分配

SPPA 403 Speech Science1 Unit 3 outline The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs.

第五章IIR數位濾波器設計濾波器的功能乃對於數位信號進行處理﹐ 以滿足系統的需求規格。其作法為設計一個系統的轉移函數﹐或者差分方程式﹐使其頻率響應落在規格的範圍內。本章探討的是其中一種方法﹐稱為Infinite impulse register(IIR)。 IIR架構說明。各種不同頻帶(Band)濾波器的設計方法。

1 Chemical and Engineering Thermodynamics Chapter 1 Introduction Sandler.

Human Psychoacoustics shows ‘tuning’ for frequencies of speech If a tree falls in the forest and no one is there to hear it, will it make a sound?

Text-To-Speech System for Marathi Miss. Deepa V. Kadam Indian Institute of Technology, Bombay.

Speech Communications (Chapter 7) Prepared by: Ahmed M. El-Sherbeeny, PhD 1.

Human Capabilities Part - B. Speech Communications (Chapter 7) Prepared by: Ahmed M. El-Sherbeeny, PhD 1.

Formatting and Baseband Modulation

LE 460 L Acoustics and Experimental Phonetics L-13

Digital Sound and Video Chapter 10, Exploring the Digital Domain.

Introduction to Interactive Media 10: Audio in Interactive Digital Media.

IE341: Human Factors Engineering Prof. Mohamed Zaki Ramadan Lecture 6 – Auditory Displays.

Artificial Intelligence 2004 Speech & Natural Language Processing Natural Language Processing written text as input sentences (well-formed) Speech.

By: Sepideh Abolghasem Shabnam Alaghehband Mina Khorram May 2006.

Chapter 7 SPEECH COMMUNICATIONS

1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.

ECE 4710: Lecture #9 1 PCM Noise  Decoded PCM signal at Rx output is analog signal corrupted by “noise”  Many sources of noise:  Quantizing noise »Four.

Chapter 3.2 Speech Communication Human Performance Engineering Robert W. Bailey, Ph.D. Third Edition.

Filtering. What Is Filtering? n Filtering is spectral shaping. n A filter changes the spectrum of a signal by emphasizing or de-emphasizing certain frequency.

Dynamics of speech the diagnostic audiometer test environment patient’s and clinician’s role speech-threshold testing most comfortable loudness and uncomfortable.

Speech Perception 4/4/00.

Noise Pollution and Control

Artificial Intelligence 2004 Speech & Natural Language Processing Natural Language Processing written text as input sentences (well-formed) Speech.

CH 5 Reduction of Multiple Subsystems 5.1 Introduction 實際系統複雜由許多子系統組成 (1 子系統 1 方塊 ) 分別求子系統的數學模型連結各方塊呈現整體系統如何預測 ? Tp, Ts, Tr, %OS with step input.

CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2009.

Temporal masking of spectrally reduced speech: psychoacoustical experiments and links with ASR Frédéric Berthommier and Angélique Grosgeorges ICP 46 av.

Feedback Filters n A feedback filter processes past output samples, as well as current input samples: n Feedback filters create peaks (poles or resonances)

An Introduction to the and Text Mark-up Chander Tseng 曾國奕 October 2015.

IIT Bombay 17 th National Conference on Communications, Jan. 2011, Bangalore, India Sp Pr. 1, P3 1/21 Detection of Burst Onset Landmarks in Speech.

A. R. Jayan, P. C. Pandey, EE Dept., IIT Bombay 1 Abstract Perception of speech under adverse listening conditions may be improved by processing it to.

The Speech Chain (Denes & Pinson, 1993)

Suprasegmental Properties of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.

Acoustic Phonetics 3/14/00.

Figures for Chapter 8 Candidacy Dillon (2001) Hearing Aids.

Speechreading Based on Tye-Murray (1998) pp

Lifecycle from Sound to Digital to Sound. Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20Hz – 20KHz] Speech: [200Hz.

King Saud University College of Engineering IE – 341: “Human Factors Engineering” Fall – 2016 (1st Sem H) Human Capabilities Part – C. Speech.

Ch.1: Introduction to audio signal processing

Noise Aperiodic complex wave

Speech Communications

Presentation transcript:

Speech Communications Chapter 7

Speech Communications  The Nature of Speech    Criteria for Evaluating Speech    Components of Speech Communication System    Synthesized Speech  

The Nature of Speech 1/2  發聲 : 呼吸系統, Articulators  Types of Speech Sound  Phoneme ( 音素 ) − shortest segment of speech if change → meaning change if change → meaning change  分類 : 母音 (vowel), 子音 (consonant) 雙母音 (diphthongs) 雙母音 (diphthongs)  Phoneme →Syllable →Word → Sentence

The Nature of Speech 2/2  Depicting Speech  Waveform, Spectrum  Sound spectrogram Fig 8-1 Fig 8-1 Fig 8-1  Intensity of Speech  Average intensity (speech power): 母音＞子音  Intelligibility: 子音較重要  Frequency Composition of Speech  低頻 : 男＞女 Fig 8-2 Fig 8-2 Fig 8-2  Shouting: frequency 上升

Criteria for Evaluating Speech  Speech Intelligibility ( 能解度 )  方法 − Repeat 呈現的聲音 − 回答問題  Test − Nonsense syllables − Isolated words (phonetically balanced, PB) − Sentences  Speech quality (Naturalness)  Preference

Components of Speech Communication System  Speaker  Message  Transmission System  Noise  Hearer

Components of Speech Communication System  Speaker  Enunciation ( 清晰的聲音 )  Superior Speakers − Longer syllable duration − Greater intensity − More total time with speech sounds − Frequencies varied 1/7

Components of Speech Communication System  Message  Phoneme Confusion − DVPBGCET, FXSH, KJA, MN − Avoid single letters, Word-spelling alphabet  Word Characteristics − Familiar words − Long words 2/7

Components of Speech Communication System  Message  Context Features − Sentence: meaningful ＞ nonsense − Set size: 字多＜字少 Fig 7-3 Fig 7-3 Fig 7-3 − Guidelines  用較少的字  Standard sentence  Avoid short word  Familiarize user 3/7

Components of Speech Communication System  Transmission System  Filtering (Frequency distortion) Fig 7-4 Fig 7-4 Fig 7-4 − High-pass: cutoff ＜ 600 Hz − Low-pass: cutoff ＞ 4000 Hz  Amplitude Distortion Fig Fig Fig − Peak clipping  Quality , Intelligibility ≈ − Center clipping  Intelligibility  − 提高 Intelligibility: Peak clipping  Amplify ( 子音 / 母音  ) 4/7

Components of Speech Communication System  Noise  Articulation Index (AI) Fig 7-7 Fig 7-7 Fig 7-7 − 1/3 octave, S-N, weighted sum − Intelligibility Fig 7-8 Tab 7-1 Fig 7-8Tab 7-1 Fig 7-8Tab 7-1  Preferred-Octave Speech Interference Level (PSIL) − Mean of 500, 1000, 2000 Hz (octave) − SIL: Mean of , ,... − Intelligibility (vs. distance) Fig 7-9 Fig 7-9 Fig 7-9 − Subjective rating Fig 7-10 Tab 7-2 Fig 7-10Tab 7-2 Fig 7-10Tab 7-25/7

Components of Speech Communication System  Noise  Preferred Noise Criterion Curve (PNC) Fig 7-11Fig 7-11 Tab 7-3 Tab 7-3 Fig 7-11Tab 7-3  Reverberation Fig 7-12 Fig 7-12 Fig 7-12 − Reverberation time: Decay 60 dB − Reverberation time   Intelligibility  6/7

Components of Speech Communication System  Hearer  Age Fig 7-13 Fig 7-13 Fig 7-13  Wearing of Hearing Protection 7/7

Synthesized Speech  種類  Uses  Performance  Preference  Guidelines

Synthesized Speech  種類  Synthesis by Analysis − Digitized human speech  compressed data format  compressed data format − 缺點 : 限於 encoded & stored Lack of coarticulation Lack of coarticulation  Synthesis by Rule − 缺點 : quality 較差