Sean Powers Florida Institute of Technology ECE 5525 Final: Dr. Veton Kepuska Date: 07 December 2010 Controlling your household appliances through conversation.

Slides:



Advertisements
Similar presentations
Aspire Vertical Markets Law Office. Law Office Solutions.
Advertisements

Aspire Vertical Markets Executive Suite Solution.
Chapter Nine Communications and Networks. Objective ONE Discuss the components required for successful communications.
Review of Chapter 2. Important concepts – The Internet is a worldwide collection of networks that links millions of businesses, government agencies, educational.
CAUSES & CURE OF LATENCY IN THE INTERNET TELEPHONY DR. OLUMIDE SUNDAY ADEWALE Dept of Industrial Math & Computer Science Federal University of Technology.
GPC VOIC MODULAR MESSAGING 3.0.
Introducing The IP550 IP Telephone. Things you should know about an IP Telephone system The IPitomy system has many of the same features of traditional.
Copyright © Open Text Corporation. All rights reserved. Slide 1 Automatic Routing With Captaris FaxPress and FaxPress Premier Darin McGinnes Sales Engineer.
1 iHome Automation System Home Automation System Team: Million Dollar Contingency Regiment Adam Doehling Chris Manning Ryan Patterson.
Vodacom Microsoft Hosted Lync
Lecture 51 The Telephone System. Lecture 52 The Telephone System The modern telephone system draws from these Electrical Engineering subdisciplines: Signal.
IOT POLY ENGINEERING Information is an essential resource for design (1 of the 7 inputs). 2.Communication technology significantly impacts the way.
Soft Skills for a Digital Workplace: Verbal Communication Unit C: Developing Professional Telephone Skills.
The Telephone Instrument
Integrated Library Management System
Voice Recognition Hardware Development Read My Lips John Porter, Lavanya Mynam, Gerald Mortensen.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Article Review: Spoken Dialogue Technology: Enabling the Conversational User MICHAEL F.M C TEAR University of Ulster University of Ulster This article.
Multimodal Architecture for Integrating Voice and Ink XML Formats Under the guidance of Dr. Charles Tappert By Darshan Desai, Shobhana Misra, Yani Mulyani,
IT-101 Section 001 Lecture #15 Introduction to Information Technology.
A Guide to major network components
Adapted from CTAE Resources Network PROFITT Curriculum Basic Computer Skills Module 1 Hardware.
BASIC TELECOMMUNICATIONS
SMART House Kia Graham Tyangelio Gaines Ebony LeBlanc Southern University and A & M College College of Engineering and Computer Science Department of Electrical.
Customer Premise Equipment and Application Chapter 5.
Graphic Equalizer Table By Jose Lerma. Main Idea The main idea of this table is to display the frequencies of any sound or audio input, either by microphone.
Speech Recognition Final Project Resources
NETWORK CENTRIC COMPUTING (With included EMBEDDED SYSTEMS)
Chapter 2 Network Design Essentials Instructor: Nhan Nguyen Phuong.
Environmental Control Unites/Electronic Aids to Daily Living for People with Polytrauma Environmental Control Units/Electronic Aids to Daily Living for.
Knowledge Base approach for spoken digit recognition Vijetha Periyavaram.
Manasota.score.org 11 SCORE Tailored Seminars & Workshops: An Overview SCORE Office Information V
Introductory Communications Objectives  To gain an overview of: Modems and digital telephone lines Analogue to digital conversion and digital to analogue.
Appendix A Implementing Unified Messaging. Appendix Overview Overview of Telephony Introducing Unified Messaging Configuring Unified Messaging.
Song Pro Retro Alex Harper. Contents of Presentation Inspiration Basic Concept Speaker Module.sng file structure Song Pro Retro: Light Song Pro Retro:
Introduction to Engineering and Technology Concepts Unit Five Chapter Five – Computer and Internet Connections.
Technical Seminar Presented by :- Debabandana Apta (EC ) National Institute of Science and Technology [1] “ECHO CANCELLATION” Presented.
Abstract Other Resources Financial Resources Project Schedule Personnel Efforts Item DescriptionCost STK300 Microcontrollerdonated (Kanda) GM28 Cellular.
GROUP INVOLVED IN A WEB APPLICATION DEVELOPMENT Continue.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Understanding Wireless Networking. WiFi Technology WiFi began as a way to extend home and small office network access without installing more cable. As.
Introducing The IP550 IP Telephone. What to expect from your new IPitomy IP telephone system The IPitomy system has many of the same features of traditional.
Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.
How a Computer Processes Data With today’s technology a little knowledge about what’s inside a computer can make you a more effective user and help you.
Appendix A Implementing Unified Messaging. Appendix Overview Overview of Telephony Introducing Unified Messaging Configuring Unified Messaging.
The Office Procedures and Technology Chapter 12 Telephone Systems and Procedures Copyright© 2007 Thomson/South-Western.
WIRELESS NETWORKING Ramiah Qasem, Jahmia Algahmie, Andrew Speice.
NETWORKING FUNDAMENTALS. Network+ Guide to Networks, 4e2.
S PEECH T ECHNOLOGY Answers to some Questions. S PEECH T ECHNOLOGY WHAT IS SPEECH TECHNOLOGY ABOUT ?? SPEECH TECHNOLOGY IS ABOUT PROCESSING HUMAN SPEECH.
Student Name USN NO Guide Name H.O.D Name Name Of The College & Dept.
© 2013 by Larson Technical Services
Basic structure of sphinx 4
ARTIFICIAL INTELLIGENCE FOR SPEECH RECOGNITION. Introduction What is Speech Recognition?  also known as automatic speech recognition or computer speech.
BY KALP SHAH Sentence Recognizer. Sphinx4 Sphinx4 is the best and versatile recognition system. Sphinx4 is a speech recognition system which is written.
Copyright © 2002 Pearson Education, Inc. Slide 3-1 Internet II A consortium of more than 180 universities, government agencies, and private businesses.
Chapter 7 Speech Recognition Framework  7.1 The main form and application of speech recognition  7.2 The main factors of speech recognition  7.3 The.
Don’t Log in!. Recap on the previous units I’ve tried to make it as concise as possible but there is a bit of writing, to ensure that you have some notes.
ALPHABET RECOGNITION USING SPHINX-4 BY TUSHAR PATEL.
The Telephone Analog vs Digital View the video first: Digital
Message Source Linguistic Channel Articulatory Channel Acoustic Channel Observable: MessageWordsSounds Features Bayesian formulation for speech recognition:
©2016 EarthLink. All rights reserved. Mitel 6867 IP Phone User Guide Hosted Voice Service.
Input and Output Devices
IP Telephony (VoIP).
Instructor Materials Chapter 6 Building a Home Network
Arduino Based Industrial appliances control system by decoding dual tone multi frequency signals on GSM / CDMA network. Submitted by:
Introducing The IP550 IP Telephone
Eye mouse.
Introduction to Computers
DIFFERENT TYPES OF INTERNET CONNECTIONS.
The ultimate in data organization
Presentation transcript:

Sean Powers Florida Institute of Technology ECE 5525 Final: Dr. Veton Kepuska Date: 07 December 2010 Controlling your household appliances through conversation.

Agenda Problem Statement How it works System Architecture Future Works Demonstration Questions

Problem Statement Imagine you were to leave your house in a hurry because you were running late to work, had to pick up your kids from school or any other reason and you forgot to turn off the stove. You remember three blocks away and you do not have the time to turn around. You then dial a phone number that is assigned to your house and ask your house to turn the stove off for you. Your house confirms the stove will be turned off and you are now relived you won’t come home to a potential fire. This is the essence of the Phoning Home system.

How it works Asterisk Server (VOIP) Phoning Home Services (WCF) Speech Recognizition (Cmu Sphinx) Client Services (WCF + MCU) Phoning Home is broken into four main components. The Voice over IP (VOIP) server which forwards the phone speech and uses text to speech (TTS) to speak to the user. The Phoning Home web services which handles the communication for all Phoning Home households. The speech recognition server which is responsible for recognizing the user’s phone speech. The client services which handle an individual households devices.

Answering incoming calls Asterisk Open Source Communication Project Turns a computer into a voice communications server Includes features: Ability to answer incoming calls Ability to generate outgoing calls Ability to play and generate tones Ability to integrate with web services Ability to record calls Ability to provide call details such as caller identification

Phone Server Application

Speech Recognition Application Carnegie Melon University (CMU) Sphinx-4 Speech Recognition Engine The Sphinx-4 framework consists of three primary modules: the FrontEnd, the Decoder, and the Linguist. The FrontEnd takes input signals and parameterizes them into a sequence of features. The Linguist translates any type of standard language model along with information from the Dictionary and structural information from one or more sets of Acoustic models into a SearchGraph. The Decoder uses the features from the FrontEnd, and the SearchGraph from the Linguist to perform the actual decoding and produce the Results.

Sphinx 4

A Wireless Appliance Reducing Energy (AWARE) Machine B Each device that can be controlled via the Phoning Home system is known as an A Wireless Appliance Reducing Energy (AWARE). Every device is wirelessly controlled via the Phoning Home Master Control which is ultimately an Atmel Atmega16 microcontroller connected via USB to the Client Services.

Phoning Home System Overview

Sequence Diagram

Issues One issue I had to overcome when developing the demonstration was recognizing telephone speech. There are significant differences between microphone and telephone speech. From the Sphinx documentation: “The issue with telephone audio is that it has limited range of frequencies. Unlike usual microphone recording that includes frequencies from 1 Hz to 8000 kHz, telephone audio is passed through frequency filters. As a result telephone audio contains frequencies from 200 Hz to 3500 Hz. That makes it impossible to recognize telephone audio with usual microphone acoustic model. You need to use specialized models to recognize it.” Ending up using the 8kHz VoxForge acoustic model.VoxForge

Future Works Easy Collection and Storage of Multiple Utterances can be used to improve acoustic models Asterisk Server is capable of simultaneously handling multiple calls Server stores and catalogs utterances automatically Database

Future Works Although Phoning Home is designed to allow you to call your house to control your appliances, it would be very useful to combine Dr. Kepuska’s Wake-up-Word (WuW) technology to allow you to control your appliances from inside the house as well. In a commercial product, Phoning Home would need to be equipped with extensive security measures to confirm the user calling their house actually has appropriate credentials to control their appliances. This could be as simple as a password or as complex as adding speaker recognition to confirm the user calling is a user that is permitted to call.

Demonstration Virtual Phone Number: 1 (321) #JSGF V1.0; /** * JSGF Digits Grammar for Phoning Home */ grammar digits; public = room [number] ; = [please | kindly | could you | oh mighty computer | operator]; = (turn | switch); = (lights | lamps); = (on | off); = (oh | zero | one | two | three | four | five | six | seven | eight | nine);

Questions