Hierarchy of Design Voice Controlled Remote Voice Input Control Path Speech Processing IR Interface.

Slides:



Advertisements
Similar presentations
Chapter : Digital Modulation 4.2 : Digital Transmission
Advertisements

Analysis and Digital Implementation of the Talk Box Effect Yuan Chen Advisor: Professor Paul Cuff.
1. INTRODUCTION In order to transmit digital information over * bandpass channels, we have to transfer the information to a carrier wave of.appropriate.
IR Control Materials taken from a variety of sources including IR Remote for the Boe-Bot by Andy Lindsay.
5/4/2006BAE Analog to Digital (A/D) Conversion An overview of A/D techniques.
EET 2351 Lecture 2 Professor: Dr. Miguel Alonso Jr.
Speech in Multimedia Hao Jiang Computer Science Department Boston College Oct. 9, 2007.
Conversion Between Video Compression Protocols Performed by: Dmitry Sezganov, Vitaly Spector Instructor: Stas Lapchev, Artyom Borzin Cooperated with:
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
Digital Voice Communication Link EE 413 – TEAM 2 April 21 st, 2005.
(Voice Activated Home Control System). Project Summary Control any IR activated device –Via voice command Learnable –Learn IR Code –Learn Voice Command.
VAHCS Voice Activated Home Control System By: Kyle Joseph Troy Resetich Advisors: Dr. Malinowski Dr. Schertz.
Pulse Modulation CHAPTER 4 Part 3
Real-Time Speech Recognition Thang Pham Advisor: Shane Cotter.
331: STUDY DATA COMMUNICATIONS AND NETWORKS.  1. Discuss computer networks (5 hrs)  2. Discuss data communications (15 hrs)
Why to Apply Digital Transmission?
Universal Voice Activated Remote Control (UVARC) Thanh Phan Dat Le Mohammad Safaiezeab Brandon Wilgor Peter Ralston.
A/D Conversion No. 1  Seattle Pacific University Analog to Digital Conversion Based on Chapter 5 of William Stallings, Data and Computer Communication.
Fundamentals of Digital Communication
DIGITAL VOICE NETWORKS ECE 421E Tuesday, October 02, 2012.
ECE 4371, Fall, 2014 Introduction to Telecommunication Engineering/Telecommunication Laboratory Zhu Han Department of Electrical and Computer Engineering.
Knowledge Base approach for spoken digit recognition Vijetha Periyavaram.
Time-Domain Methods for Speech Processing 虞台文. Contents Introduction Time-Dependent Processing of Speech Short-Time Energy and Average Magnitude Short-Time.
Digital Speech Transmission and Recovery. Overall System Output (speaker) Channel (coax cable) Receiver Circuit Input (microphone) Transmitter Circuit.
LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Encoding of Waveforms Encoding of Waveforms to Compress Information.
Multimedia Specification Design and Production 2013 / Semester 2 / week 3 Lecturer: Dr. Nikos Gazepidis
Physical Layer Dr. Sanjay P. Ahuja, Ph.D. Fidelity National Financial Distinguished Professor of CIS School of Computing, UNF.
© Janice Regan, CMPT 128, CMPT 371 Data Communications and Networking Digital Encoding.
Concepts of Multiplexing Many input signals to one transmission media Reduces the number of channels or conductors running from point A to point B Added.
CE Digital Signal Processing Fall 1992 Waveform Coding Hossein Sameti Department of Computer Engineering Sharif University of Technology.
1 PCM & DPCM & DM. 2 Pulse-Code Modulation (PCM) : In PCM each sample of the signal is quantized to one of the amplitude levels, where B is the number.
: Data Communication and Computer Networks
British Computer Society (BCS)
Kashif BashirWWW.Taleem.greatnow.com Chapter 4 Digital Transmission.
Dan Lopez Dan Lopez Ben Rohner Ben Rohner Erin Loutzenhiser Erin Loutzenhiser.
MULTIMEDIA INPUT / OUTPUT TECHNOLOGIES
1 Speech Synthesis User friendly machine must have complete voice communication abilities Voice communication involves Speech synthesis Speech recognition.
IR Communication Materials taken from a variety of sources including IR Remote for the Boe-Bot by Andy Lindsay.
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2009.
ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
IR Communication Materials taken from a variety of sources including IR Remote for the Boe-Bot by Andy Lindsay.
Voice Activity Detection based on OptimallyWeighted Combination of Multiple Features Yusuke Kida and Tatsuya Kawahara School of Informatics, Kyoto University,
Floyd, Digital Fundamentals, 10 th ed Digital Fundamentals Tenth Edition Floyd © 2008 Pearson Education Chapter 1.
Microphone Array Project ECE5525 – Speech Processing Robert Villmow 12/11/03.
Universal Voice Activated Remote Control (UVARC) Thanh Phan Dat Le Mohammad Safaiezeab Brandon Wilgor Peter Ralston.
◦ We sometimes need to digitize an analog signal ◦ To send human voice over a long distance, we need to digitize it, since digital signals are less prone.
Monitoring Volume Level Application - End of Project Presentation Made by: Roi Abecasis Maxim Meltsin Supervisor: Boaz Mizrahi.
4.2 Digital Transmission Pulse Modulation Pulse Code Modulation
ARTIFICIAL INTELLIGENCE FOR SPEECH RECOGNITION. Introduction What is Speech Recognition?  also known as automatic speech recognition or computer speech.
Chapter : Digital Modulation 4.2 : Digital Transmission
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2014.
Chapter 7 Speech Recognition Framework  7.1 The main form and application of speech recognition  7.2 The main factors of speech recognition  7.3 The.
BEER BOT Dalton Verhagen. Sound Sensor Designed to find the direction a specified sound source is coming from Determines this with a time of arrival algorithm.
Power Analyzer Option History of Power Analysis Software  Strong history of power analysis going back to late 1990’s  Software evolved.
Toshiba IR Test Apparatus Project Ahmad Nazri Fadzal Zamir Izam Nurfazlina Kamaruddin Wan Othman.
Physical Layer (I) Data Encoding Techniques Advanced Computer Networks.
Fundamentals of Communications. Communication System Transmitter: originates the signal Receiver: receives transmitted signal after it travels over the.
Analog Communication Systems Amplitude Modulation By Dr. Eng. Omar Abdel-Gaber M. Aly Assistant Professor Electrical Engineering Department.
PULSE MODULATION.
COMPUTER NETWORKS and INTERNETS
Principios de Comunicaciones EL4005
Vocoders.
INTRODUCTION TO TELEPHONY BY : ITZIK CHOEN
IR Control Materials taken from a variety of sources including IR Remote for the Boe-Bot by Andy Lindsay.
PCM & DPCM & DM.
Microphone Array Project
UNIT 7: INFRARED SENSORS
UNIT 7: INFRARED SENSORS
Auditory Morphing Weyni Clacken
Presentation transcript:

Hierarchy of Design Voice Controlled Remote Voice Input Control Path Speech Processing IR Interface

Characteristics of Speech n Amplitude variations n Frequency variations n Continuous in frequency domain n Most of the energy is within 100Hz to 4kHz n Requires >8kHz sampling for intelligible speech

Our Speech Algorithm n Isolated word - cannot distinguish important areas in a stream of uninterrupted speech n “Small” vocabulary - in the zero to tens of words region - Up, Down, Power, Surf n Training Required - tells the device what the command sounds like n Speaker Dependent - re-training required for separate user

The Voice Input n Condenser microphone n Signal is amplified approximately 6000x n Sampling rate ~8 kHz n 8 bit linear conversion

Word Boundary Detection n Samples continuously n Has the threshold level been reached? n Begin analyzing the data n Is the threshold level being reached very often? n Stop analyzing the data

Zero Crossings n One transition from positive to negative or vice-versa n Algorithm to determine the frequency of the signal n Frequency inversely proportional to the period

Energy Analysis n The energy of the signal is the amplitude squared (Parseval’s theorem). n we used absolute value of amplitude. n Real-time calculation (as it is received).

The Recognition Process Compare the characteristics of the sample against Command1 Compare the characteristics of the sample against Command2 Compare the characteristics of the sample against Command3 Compare the characteristics of the sample against Command4 The command most similar to the recognized word. The command most similar to the recognized word. The command that was spoken

The Infra-Red Beam n Detects and stores codes for common Sony TVs n Utilizes blind copycat method of IR memory, no decoding occurs n Method easily modified to other IR protocols

General A/V IR coding schemes n 38-40kHz carrier at  940nm wavelength n Carrier output is gated by bit stream. n Most protocols use Pulse Width Modulation for bit encoding. – Logic ‘1’s coded as T (un-modulated) followed by 2T (modulated). Where T  550  s – Logic ‘0’s coded as T (un-modulated) followed by T (modulated). n Various bit lengths, start and end sequences.

Gated modulation to Carrier

Bit stream for “Power” command

Bit stream for “Channel up” command

Common North American IR code sequence

The Control Path n Implemented in two Moore state machines n Training/Initialization n Active/Recognition

The Surf Function n Start and stop the function with the utterance of the command SURF n Enables a three-second preview of each channel n Risk of developing carpal- tunnel syndrome decreases sharply!