Speech Database/Tool System And Preliminary Accent study.

Slides:

Advertisements

Similar presentations

Voiceprint System Development Design, implement, test unique voiceprint biometric system Research Day Presentation, May 3 rd 2013 Rahul Raj (Team Lead),

Advertisements

PHONETICS AND PHONOLOGY

Basic Spectrogram Lab 8. Spectrograms §Spectrograph: Produces visible patterns of acoustic energy called spectrograms §Spectrographic Analysis: l Acoustic.

Classification of Music According to Genres Using Neural Networks, Genetic Algorithms and Fuzzy Systems.

LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.

1 Software Testing and Quality Assurance Lecture 30 – Testing Systems.

The Chinese University of Hong Kong Department of Computer Science and Engineering Lyu0202 Advanced Audio Information Retrieval System.

TEAM-1 JACKIE ABBAZIO SASHA PEREZ DENISE SILVA ROBERT TESORIERO Face Recognition Systems.

Knowledge Base approach for spoken digit recognition Vijetha Periyavaram.

Study of Word-Level Accent Classification and Gender Factors

The identification of interesting web sites Presented by Xiaoshu Cai.

Acoustic Analysis of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.

Keystroke Biometric System Client: Dr. Mary Villani Instructor: Dr. Charles Tappert Team 4 Members: Michael Wuench ; Mingfei Bi ; Evelin Urbaez ; Shaji.

 Fundamentally, data mining is about processing data and identifying patterns and trends in that information so that you can decide or judge.  Data.

Using Technology to Teach Pronunciation A review of the research from Melike Yücel Eleonora Frigo Laurie Wayne Ling 578, Winter 2010, Dr. Arnold.

Audio processing methods on marine mammal vocalizations Xanadu Halkias Laboratory for the Recognition and Organization of Speech and Audio

CISB113 Fundamentals of Information Systems Data Management.

Performance Comparison of Speaker and Emotion Recognition

Chapter 2 What is Evidence?. Objectives Discuss the concept of “best available clinical evidence.” Describe the general content and procedural characteristics.

Objectives: Terminology Components The Design Cycle Resources: DHS Slides – Chapter 1 Glossary Java Applet URL:.../publications/courses/ece_8443/lectures/current/lecture_02.ppt.../publications/courses/ece_8443/lectures/current/lecture_02.ppt.

Lucent Technologies - Proprietary 1 Interactive Pattern Discovery with Mirage Mirage uses exploratory visualization, intuitive graphical operations to.

Lecture 1 Phonetics – the study of speech sounds

C - IT Acumens. COMIT Acumens. COM. To demonstrate the use of Neural Networks in the field of Character and Pattern Recognition by simulating a neural.

Acoustic Phonetics 3/14/00.

Spectral subtraction algorithm and optimize Wanfeng Zou 7/3/2014.

BIOMETRICS VOICE RECOGNITION. Meaning Bios : LifeMetron : Measure Bios : LifeMetron : Measure Biometrics are used to identify the input sample when compared.

CS 445/656 Computer & New Media

Computer Literacy BASICS

Multimedia: making it Work

E303 Part II The Context of Language Research

Homework 1 Hints.

Core Elements Engineering - Midrange

Actuaries Climate Index™

MATCH A Music Alignment Tool Chest

(Winter 2017) Instructor: Craig Duckett

Text-To-Speech System for English

Automation System For Checking Protein Prediction

Biometrics Reg: AMP/HNDIT/F/F/E/2013/067.

Chapter 2 Sociological Research Methods

Classification of modulation

IMPAIRED-USER INPUT SCENARIOS FOR KEYSTROKE BIOMETRIC AUTHENTICATION

OCR GCSE ICT Data capture methods.

OCR GCSE ICT Data capture methods.

Actuaries Climate Index™

November 8th, 2017 Matthew Davis and John Fink

Data Analysis in Particle Physics

Analyzing Language in a Speech: The Montgomery Bus Boycott Speech

N. Capp, E. Krome, I. Obeid and J. Picone

Introduction to Computer Programming

AD HOC Query (Report) Tool

Optimizing Efficiency + Funding

Weka Package Weka package is open source data mining software written in Java. Weka can be applied to your dataset from the GUI, the command line or called.

Reporting An In-Depth Guide.

2-1-1 Automated Verifications

David Cyphert CS 2310 – Software Engineering

Environmental Monitoring: Coupling Function Calculator

Audio and Speech Computers & New Media.

Hybrid Finger print recognition

Spreadsheets, Modelling & Databases

Using GOLD to Tracking L2 Development

Applied Linguistics Chapter Four: Corpus Linguistics

This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.

Anthor: Andreas Tsiartas, Prasanta Kumar Ghosh,

Name of Research Program: TRUST, SPRING 2011

The Internet and Electronic mail

Creating and Editing a Presentation

This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.

Wiley CPA Exam Review Practice Software 13.0

Auditory Morphing Weyni Clacken

Calibration Infrastructure Design

Presentation transcript:

Speech Database/Tool System And Preliminary Accent study. Dr. Charles Tappert a.k.a (Project Manager) Arthur Phidd, DPS a.k.a (The Client) Padmashree Thimmappa Shankar Vijayakumar Richard Sauther May 6th 2005 11/12/2018

Overview Create a tool & database for collecting speech samples and data mining processing. Ability to record new voice files. Upload these files on to the server. Retrieve these files when needed. Play the files. Analyze the files using Pronunciation Affinity Matrix (PAM) to determine the possible accent. Analyze the files for further research using available Speech Filing System (SFS) to decompose spectrograms into data elements for data mining. 11/12/2018

Specification of Spectrographic Tools Ability to perform spectral analysis of the speech signal. Segment a portion of signal from the background noise. Ability to view and store various voice data and functionality for research purposes. Spectrographic tool that provides access to the actual numerical data (e.g., the energy in a particular frequency band in a particular time interval) that can be processed later in an application. 11/12/2018

Human Computer Interaction User fills in the demographic information. He can upload his voice file. He can play back any voice file stored in the database as indicated by the drop down list. He can hear the voice file and choose values for certain key words in the voice file for various accents. The best chosen accent is recognized and displayed. Voice owner information is displayed for comparison. For further analysis of voice file, he can download the Speech Filing system installer, install it and run the voice file to get various types of spectrograms and other voice data. 11/12/2018

Choices are “Academic” or Screen shots Participation form Choices are “Academic” or “Natural”. 11/12/2018

Once submitted… 11/12/2018

Classification using Pronunciation Affinity Matrix (PAM) In the various Asian dialects “T” & “TH” are commonly Pronunced as “D” Vowel preceeding “ry” ending is typically dropped. The letter “V” is a “B” Pronunciation in Spanish 11/12/2018

Accent determined with reference to values chosen File upload index 11/12/2018

Analyze voice files using SFS 11/12/2018

Spectrogram generated using Speech Filing System. 11/12/2018

Actual Voice data retrieved from the spectrogram 11/12/2018

Smooth Fundamental Frequency track 11/12/2018

Noise Analysis data 11/12/2018

Users of this Application This application will be mainly used for experimental research in areas such as : 1.Speech Recognition and Accent determination. 2.Voice Biometric Studies. 3.Speaker Authentication applications. 11/12/2018

Research Next Steps Build up a sizeable corpus of voice samples across the four pronunciation nationalities in the PAM matrix. Identify examiners that come from the same cross section of nationalities found in PAM Perform more identification exams to validate the effectiveness of the selected words/phrase Create a data-mart of the numerical equivalent of the spectrograms of each voice sample in the corpus. Select a data mining classification algorithm to effectively classify the accents. (maybe focus on the correlation between energy levels, specific words, and accents or stress patterns and accents) 11/12/2018

Thank you! 11/12/2018