09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 1 ENEE408G: Capstone Design Project: Multimedia Signal Processing Design Project 1: Digital Speech.

Slides:



Advertisements
Similar presentations
Ease of Access and Assistive Technology on Windows 7 Computer Access for Individuals with Visual Impairments.
Advertisements

Acoustic/Prosodic Features
03/18/2005ENEE408G Spring 2005 Multimedia Signal Processing 1 ENEE408G: Capstone Design Project: Multimedia Signal Processing Design Project 4: Digital.
Introduction to Acoustics Words contain sequences of sounds Each sound (phone) is produced by sending signals from the brain to the vocal articulators.
Speaker Recognition Sharat.S.Chikkerur Center for Unified Biometrics and Sensors
Speech in Multimedia Hao Jiang Computer Science Department Boston College Oct. 9, 2007.
6/3/20151 Voice Transformation : Speech Morphing Gidon Porat and Yizhar Lavner SIPL – Technion IIT December
SOME SIMPLE MANIPULATIONS OF SOUND USING DIGITAL SIGNAL PROCESSING Richard M. Stern demo August 31, 2004 Department of Electrical and Computer.
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
Mobile Agents Using Sound Daniel Hägglund
Digital Audio Editing
02/17/05ENEE408G Spring 2005 Multimedia Signal Processing 1 ENEE408G: Capstone Design Project: Multimedia Signal Processing Design Project 2: Video Processing.
02/25/2005ENEE408G Spring 2005 Multimedia Signal Processing 1 ENEE408G: Capstone Design Project: Multimedia Signal Processing Design Project 2: Video Processing.
4/25/2001ECE566 Philip Felber1 Speech Recognition A report of an Isolated Word experiment. By Philip Felber Illinois Institute of Technology April 25,
10/14/2005ENEE408G Fall 2005 Multimedia Signal Processing 1 ENEE408G: Capstone Design Project: Multimedia Signal Processing Design Project 1: Image Processing.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
05/06/2005ENEE408G Spring 2005 Multimedia Signal Processing 1 ENEE408G: Capstone Design Project: Multimedia Signal Processing Final Remarks.
01/28/2005 ENEE408G Multimedia Signal Processing 1 ENEE408G: Capstone Design Project: Multimedia Signal Processing TA: Hung-Quoc Lai,
03/04/2005ENEE408G Spring 2005 Multimedia Signal Processing 1 ENEE408G: Capstone Design Project: Multimedia Signal Processing Design Project 3: Digital.
Analysis & Synthesis The Vocoder and its related technology.
09/02/2005 ENEE408G Multimedia Signal Processing 1 ENEE408G: Capstone Design Project: Multimedia Signal Processing TA: Hung-Quoc Lai,
Real-Time Speech Recognition Thang Pham Advisor: Shane Cotter.
Digital signal Processing Digital signal Processing ECI Semester /2004 Telecommunication and Internet Engineering, School of Engineering, South.
02/04/2005ENEE408G Spring 2005 Multimedia Signal Processing 1 ENEE408G: Capstone Design Project: Multimedia Signal Processing Design Project 1: Image Processing.
Human Psychoacoustics shows ‘tuning’ for frequencies of speech If a tree falls in the forest and no one is there to hear it, will it make a sound?
5. Multimedia Data. 2 Multimedia Data Representation  Digital Audio  Sampling/Digitisation  Compression (Details of Compression algorithms – following.
Brief description of recording by microphone. Prepartion for recording from microphone – Step 1 4. Selection of recording source: Microphone, which needs.
Artificial Intelligence 2004 Speech & Natural Language Processing Natural Language Processing written text as input sentences (well-formed) Speech.
Multimedia Specification Design and Production 2013 / Semester 2 / week 3 Lecturer: Dr. Nikos Gazepidis
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
Acoustic Analysis of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.
Page 0 of 23 MELP Vocoders Nima Moghadam SN#: Saeed Nari SN#: Supervisor Dr. Saameti April 2005 Sharif University of Technology.
Creating Multimedia Interaction with Windows Media Technologies 7.
Activity 1 Record and edit your voice using Audacity 1.Download Audacity (a free and open source audio editing software from
Group Members: Sam Marlin, Jonathan Brown Faculty Adviser: Tom Miller.
HELPA ELPA! 2011 Justin Johnson PPS. 4 Tasks 1. Learn to set up the headphones. 2. learn how to get onto the practice test and take it. 3. learn about.
There are two methods to get to the practice tests: 1.Through the secure browser 2.Through a Web browser January 2014 Smarter Balanced Practice Test Workshop.
09/30/2005ENEE408G Fall 2005 Multimedia Signal Processing 1 ENEE408G: Capstone Design Project: Multimedia Signal Processing Design Project 2: Digital Audio.
Advanced Topics in Speech Processing (IT60116) K Sreenivasa Rao School of Information Technology IIT Kharagpur.
ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
Audio processing methods on marine mammal vocalizations Xanadu Halkias Laboratory for the Recognition and Organization of Speech and Audio
Objective 3.02 Train the system and input simple documents using speech writing techniques.
(Extremely) Simplified Model of Speech Production
Setting up your computer’s microphone Begin by double clicking on the volume icon within the task bar.
 Audacity has many different uses  Record live audio  Copy or splice sound tracks together  Change the speed or pitch of a recording  Import and.
Ways to generate computer speech Record a human speaking every sentence HAL will ever speak (not likely) Make a mathematical model of the human vocal.
Introduction Part I Speech Representation, Models and Analysis Part II Speech Recognition Part III Speech Synthesis Part IV Speech Coding Part V Frontier.
Audacity.  Tutorial: tutorial.com/17-audacity-tutorial.htmhttp:// tutorial.com/17-audacity-tutorial.htm.
Introduction to Audio Recording. Summary Questions 1.What is the first step in setting up your computer to record? Copyright © Texas Education Agency,
Speech Recognition Created By : Kanjariya Hardik G.
1 Speech Compression (after first coding) By Allam Mousa Department of Telecommunication Engineering An Najah University SP_3_Compression.
Lesson 4 Alternative Methods Of Input.
Alternative Methods Of Input
Standard Methods of Input.
Speech Recognition There are different kinds of voice or speech "engines" that take the sounds of your voice and match it with words. The engine is software.
Talking with computers
Vocoders.
Introduction to Audio Recording
Introduction to Audio Recording
Lesson 4 Alternative Methods Of Input.
Using Speech Recognition for Input: A Powerful and Readily Available Tool Dr. Donna Olsen Instructional Technologist Central Wyoming College
Speech Recognition There are different kinds of voice or speech "engines" that take the sounds of your voice and match it with words. The engine is software.
Dialog Design 4 Speech & Natural Language
The Vocoder and its related technology
Activity 1 Record and edit your voice using Audacity
Lesson 4 Alternative Methods Of Input.
Speech Processing Final Project
Digital Audio Application of Digital Audio - Selected Examples
Auditory Morphing Weyni Clacken
Presentation transcript:

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 1 ENEE408G: Capstone Design Project: Multimedia Signal Processing Design Project 1: Digital Speech Processing

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 2 Outline of Design Project 1 Part I : Speech Analysis Part II : Speech Coding: Linear Predictive Vocoder Part III: Speech Recognition by IBM ViaVoice Part IV: Speech Synthesis Part V : Human Computer Interface Part VI: Mobile Computing and Pocket PC Programming

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 3 Adjust the Microphone Device Use Sound Recorder By accessories  entertainment  sound recorder Select Line-In 2/Mic 2 By Edit  audio properties  sound recording  Volume

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 4 Part I. Speech Analysis (1) Human Vocal Apparatus

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 5 Part I. Speech Analysis (2) Vocal Tract Model

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 6 Part I. Speech Analysis (3) COLEA toolbox: Waveform on Time Domain Spectrogram Pitch and Formant Tracking LPC Spectra Record your own voice and analyze pitch and formants.

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 7 Part I. Speech Analysis (4)

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 8 Part I. Speech Analysis (5) Gender Identification: Use Auditory Toolbox to obtain Linear Predictive coefficients. Design your algorithm to identify the gender of samples in the training set. Test your algorithm on 9/26 by new samples.

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 9 Pat II. Linear Predictive Vocoder: Encoder Encoder:

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 10 Part II. Linear Predictive Vocoder:Decoder

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 11 Part III. Speech Recognition IBM ViaVoice ViaVoice Training: Operate PC by ViaVoice

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 12 Part III. IBM ViaVoice Training Start from BLUE word. Keep specking, the recognized words become GRAY. If you hear sounds or the BLUE sign stop in a specific word, return to the blue word and read the BLACK sentence again.

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 13 Part III. IBM ViaVoice Dictation Speak Pad Menu Bar: 1. Menu Button 2. Microphone State 3. Status Area 4. ViaCenter Help 5. Current User

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 14 Part IV. Speech Synthesis Text-To-Speech and Talking Head Vowel Synthesis Demo

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 15 Part V. Human Computer Interface CSLU Human Computer Interface Rapid Application Developer (RAD) Start  Speech Toolkit  RAD MIT Galaxy System JUPITER: Weather Information System TEL: PEGASUS: Airline Flight Planning System TEL:

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 16 Part VI. Pocket PC Programming Apply what you learned from previous parts and design a simple application related to digital speech processing by Microsoft eMbedded Tools for Pocket PC.

09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 17 Announcement Matlab task: Part II C++ task: Part VI Check out Pocket PC