Teaching Tool For French Speech Pronunciation Capstone Design Project 2008 Joseph Ciaburri Advisor: Professor Catravas.

Slides:

Advertisements

Similar presentations

APPROACHES TO T&L Language

Advertisements

DCSP-13 Jianfeng Feng

Acoustic/Prosodic Features

Sinew Technology Co., Ltd. DTS II Digital Language Training System.

Time-Frequency Analysis Analyzing sounds as a sequence of frames

Natalie Fong English Centre, The University of Hong Kong Good Practices in a Second Language Classroom: An Alternating Use of ICT in Independent Learning.

Analysis and Digital Implementation of the Talk Box Effect Yuan Chen Advisor: Professor Paul Cuff.

Voiceprint System Development Design, implement, test unique voiceprint biometric system Research Day Presentation, May 3 rd 2013 Rahul Raj (Team Lead),

1 Acoustic Sampling Of Instruments Dan Starr Capstone Design Project Advisors: Prof. Catravas Prof. Postow.

Improvement of Audio Capture in Handheld Devices through Digital Filtering Problem Microphones in handheld devices are of low quality to reduce cost. This.

Sampling Chapter 2 ME 392 Sampling Chapter 2 ME January 2012 week 4 Joseph Vignola.

Xkl: A Tool For Speech Analysis Eric Truslow Adviser: Helen Hanson.

4/25/2001ECE566 Philip Felber1 Speech Recognition A report of an Isolated Word experiment. By Philip Felber Illinois Institute of Technology April 25,

PDACS Michelle Berger John Curtin Trey Griffin Aaron King Michael Nordfelt Jeffrey Whitted.

Effects in frequency domain Stefania Serafin Music Informatics Fall 2004.

Unit II Four Language Skills: Aural and Oral Reading and Writing.

Evaluation of Speech Detection Algorithm Project 1b Due February 14th.

Real-Time Speech Recognition Thang Pham Advisor: Shane Cotter.

A PRESENTATION BY SHAMALEE DESHPANDE

Adapted from CTAE Resources Network PROFITT Curriculum Basic Computer Skills Module 1 Hardware.

Introduce about sensor using in Robot NAO Department: FTI-FHO-FPT Presenter: Vu Hoang Dung.

Design and Development of an Accelerometer based Personal Trainer System By Emer Bussmann B.E. Electronic Engineering April 2008.

Representing Acoustic Information

LE 460 L Acoustics and Experimental Phonetics L-13

Digital Sound and Video Chapter 10, Exploring the Digital Domain.

GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.

Senior Project – Electrical Engineering Tool for Improving Non-Native French Speech Pronunciation Joseph Ciaburri Advisor – Professor Catravas,

[1] Processing the Prosody of Oral Presentations Rebecca Hincks KTH, The Royal Institute of Technology Department of Speech, Music and Hearing The Unit.

Artificial Intelligence 2004 Speech & Natural Language Processing Natural Language Processing written text as input sentences (well-formed) Speech.

Speech Recognition ECE5526 Wilson Burgos. Outline Introduction Objective Existing Solutions Implementation Test and Result Conclusion.

Privacy Protection for Life-log Video Jayashri Chaudhari, Sen-ching S. Cheung, M. Vijay Venkatesh Department of Electrical and Computer Engineering Center.

ECE 598: The Speech Chain Lecture 7: Fourier Transform; Speech Sources and Filters.

Wireless and Mobile Computing Transmission Fundamentals Lecture 2.

1 BILC SEMINAR 2009 Speech Recognition: Is It for Real? Tony Mirabito Defense Language Institute English Language Center (DLIELC) DLIELC.

The Computer and the Human Body The Computer and the Human Body.

♥♥♥♥ 1. Intro. 2. VTS Var.. 3. Method 4. Results 5. Concl. ♠♠ ◄◄ ►► 1/181. Intro.2. VTS Var..3. Method4. Results5. Concl ♠♠◄◄►► IIT Bombay NCC 2011 : 17.

American Speechsounds How to Use the Program. AmericanSpeechsounds Why use American Speechsounds? Practice the problem sounds of American English Learn.

Basics of Neural Networks Neural Network Topologies.

Speaker Recognition by Habib ur Rehman Abdul Basit CENTER FOR ADVANCED STUDIES IN ENGINERING Digital Signal Processing ( Term Project )

Digital Image Processing Chapter 4 Image Enhancement in the Frequency Domain Part I.

Perceptual Analysis of Talking Avatar Head Movements: A Quantitative Perspective Xiaohan Ma, Binh H. Le, and Zhigang Deng Department of Computer Science.

In and Out are opposites. This is something to keep in mind when considering Input and Output. INPUT OUTPUT Ask: Does this device send information in?

Controlling Computer Using Speech Recognition (CCSR) Creative Masters Group Supervisor : Dr: Mounira Taileb.

ICVGIP 2012 ICVGIP 2012 Speech training aids Visual feedback of the articulatory efforts during acquisition of speech production by a hearing-impaired.

Counting How Many Words You Read

Fourier and Wavelet Transformations Michael J. Watts

Chapter 7 Speech Recognition Framework  7.1 The main form and application of speech recognition  7.2 The main factors of speech recognition  7.3 The.

The Discrete Fourier Transform

Real-Time Speech Pitch Shifting on an FPGA

Message Source Linguistic Channel Articulatory Channel Acoustic Channel Observable: MessageWordsSounds Features Bayesian formulation for speech recognition:

ADAPTIVE BABY MONITORING SYSTEM Team 56 Michael Qiu, Luis Ramirez, Yueyang Lin ECE 445 Senior Design May 3, 2016.

Bryant Tober. Problem Description  View the sound wave produced from a wav file  Apply different modulations to the wave file  Hear the effect of the.

Speech Processing Dr. Veton Këpuska, FIT Jacob Zurasky, FIT.

1st Trial (Late 2015) Results Future Directions

Topic: Waveforms in Noesis

Ch. 2 : Preprocessing of audio signals in time and frequency domain

Speech Processing AEGIS RET All-Hands Meeting

ARTIFICIAL NEURAL NETWORKS

Speech Processing AEGIS RET All-Hands Meeting

Vision for Robotic Applications

Fourier and Wavelet Transformations

CHAPTER 3 DATA AND SIGNAL

Speech Recognition Christian Schulze

CS 2610 Project Presentation Presented By- Zuha Agha and Tazin Afrin

LECTURE 18: FAST FOURIER TRANSFORM

Analysis of Audio Using PCA

ECE 791 Project Proposal Project Title: Developing and Evaluating a Tool for Converting MP3 Audio Files to Staff Music Project Team: Salvatore DeVito.

Voice Manipulator Department of Electrical & Computer Engineering

Keyword Spotting Dynamic Time Warping

LECTURE 18: FAST FOURIER TRANSFORM

Presentation transcript:

Teaching Tool For French Speech Pronunciation Capstone Design Project 2008 Joseph Ciaburri Advisor: Professor Catravas

Motivation Use feedback that allows for self diagnosis Make tool as simple as possible for student Improve French pronunciation through the repetition of visual and aural aids Tony Blair Congratulating Nicolas Sarkozy on Election Win Interview with Domnique Villepin

USER Window 1 Native Speaker Audio SpeechSpeech MicrophoneWebcam VideoVideo Data Acquisition Window 2 Audio and Video of User Speaking Data DataData Window 3 Diagnostics Video Audio Video Audio Audio/Visual Databank Proposed Learning System

Design Specifications Goals Read in audio and video at the same time Play back audio and video at the same time within 1 second Minimize system requirements Implement diagnostics that are sensitive to pronunciation differences Provide pronunciation feedback via bulls eye Simplicity

Microphone and Webcam to Data Acquisition USER Webcam VIDEOVIDEO Data Acquisition Data Auto light compensation Average Frame Rate 15 frames per second Large file stored as a variable Length of video is short ~5 Seconds Camera lights up when recording USER Microphone Data Acquisition SpeechSpeech Data Webcam Microphone Sampling rate used is adjustable up to 44.1khz Saved as a variable Reads in simultaneously with video Microphone Built into webcam Auto noise cancellation Power comes from computer Able to crop to only speech

Repetition of User USER Window 2 Video of User Data Acquisition Data AUDIOAUDIO VIDEOVIDEO Play back from variable Allows for a quicker load time Less than 3 seconds to load video Audio and Video do not play in sync Play length ~ 5 Seconds Keeps memory requirements low

Diagnostics Data Acquisition DataData Window 3 Diagnostics USER AudioAudioVideoVideo Can create and graph spectrogram data Allows for determination of vowels using the formants and consonants using the transitions of the formants Can create and graph cepstrum data Inverse Fourier Transform of the log of the Fourier Transform Can find fundamental frequency Can find Zero Crossings Zero Crossings show silence versus speech Bulls eye allows for two inputs, along the x and y, graphed as a percent distance from the center

Results Time Domain: Non Native SpeakerTime Domain: Native Speaker Spectrogram: Non Native SpeakerSpectrogram: Native Speaker

Results Continued Cepstrum: Non Native SpeakerCepstrum: Native Speaker Zero : Non Native Speaker Zero Crossings: Native Speaker

Design Specifications Goals Read in audio and video at the same time Play back audio and video at the same time within 1 second Implement diagnostics that are sensitive to pronunciation differences Provide pronunciation feedback via bulls eye Simplicity Minimize system requirements Accomplished Can read synchronized audio and video into MATLAB Can play back audio or video separately, or unsynchronized audio and video in MATLAB Can plot diagnostics and find fundamental frequency Can plot on bulls eye All in one webcam as well as keeping the whole program in MATLAB

In Progress Identifying specific components of speech that specific to French –Vowels –Consonants Quantifying these components and using them on the bulls eye Creating a GUI Gather more video samples

Future Research Integrating other languages Evaluation –Use of non-native speakers –Use of native speakers Testing in the use of facial communication in oral communication Basis for comparison of other audible signals

Acknowledgements Professor Rudko Professor Hanson Professor Streignitz Professor Cotter Professor Catravas Professor Chilcoat Professor Pickering Professor Spallholz