Using Commonsense Reasoning to Improve Voice Recognition.

Slides:



Advertisements
Similar presentations
Speech-to-Text Technology on Mac OS X Computer Access for Individuals with Disabilities.
Advertisements

Read&Write (v7) GOLD A literacy aid for everyone with learning disabilities / difficulties.
Read&Write GOLD Jackie Prentice. Objectives GOLD See the key features of Read&Write GOLD in order to familiarise yourself with the functionality of the.
Introduction to Computational Linguistics
Google Web Speech API Implementation Case Study: English Skill Online Practice Prajaks Jitngernmadan Faculty of Informatics, Burapha University.
WSCD INTRODUCTION  Query suggestion has often been described as the process of making a user query resemble more closely the documents it is expected.
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
Voice-enabled Image Identification System Design Aashish P. Shrestha Ming Ming Zheng Multimedia Signal Processing, University of Bridgeport, Connecticut.
R EAD & W RITE G OLD : T EXT H ELP S YSTEMS I NC.: T EXT TO S PEECH S OFTWARE By: Ashley, Kathryn, Rine, and Samantha.
Multimedia in Presentations Justin Gray COMM 165.
“ Walk to here ” : A Voice Driven Animation System SCA 2006 Zhijin Wang and Michiel van de Panne.
1 Subsea Reliability and Integrity Management Reliability and Integrity Management – Have very similar intent - almost the same definition – Treated separately.
Natural Language Processing Ellen Back, LIS489, Spring 2015.
User Interfaces. User Interface What do we mean by a user interface? The user is the person who is using the computer. A user interface is what he or.
1 EmuPlayer Music Recommendation System Based on User Emotion Using Vital-sensor KMSF- sunny 親: namachan さん.
Describe the purpose, components, and use of speech recognition systems.
Setup Guide for Win 7 Speech Recognition 6/30/2014 Debbie Hebert, PT, ATP Central AT Services.
Common Sense Computing MIT Media Lab Interaction Challenges for Agents with Common Sense Henry Lieberman MIT Media Lab Cambridge, Mass. USA
Speech Recognition. My computer doesn’t understand me……….. Software is now mainstream Many people use it within office/home setting for inputting text.
Artificial Intelligence and Virtual Librarianship David Bennett Robert Morris University Library National Forum 2003.
T raining on Read&Write GOLD Dick Powers
Center for Human Computer Communication Department of Computer Science, OG I 1 Designing Robust Multimodal Systems for Diverse Users and Mobile Environments.
--Caesar Cat.  Write an optical character recognition application that identifies and recognizes printed text within an image.
Interaction Design Session 12 LBSC 790 / INFM 718B Building the Human-Computer Interface.
Computers and Disability Case Study IB Computer Science II Paul Bui.
Train the system and input simple documents using speech writing techniques.
Using Speech Recognition Copyright 2006 South-Western/Thomson Learning.
1 These courseware materials are to be used in conjunction with Software Engineering: A Practitioner’s Approach, 5/e and are provided with permission by.
Voice Recognition (Presentation 2) By: Priya Devi A. S/W Developer, Xsys technologies Bangalore.
NoteSearch - Find what you’re looking for. Prototype Team B.
Microsoft Assistive Technology Products Brought to you by... Jill Hartman.
1 These courseware materials are to be used in conjunction with Software Engineering: A Practitioner’s Approach, 5/e and are provided with permission by.
Speech Recognition MIT SMA 5508 Spring 2004 Larry Rudolph (MIT)
CMAS Assessment Interface Training for Teachers Lake County School District Spring 2014.
Using Running Records to Inform Instruction
Objective 3.02 Train the system and input simple documents using speech writing techniques.
Using Google's Web Speech API with Moodle for language learning tasks
ANALYZING READING BELIEFS By: Dereque Falls. Reading Methods  Evaluating Reading Beliefs- An Interview  Breaking Down Written Language  Continuum of.
1/10 Problem Frame Analysis Eunyoung Cho Kyu Hou Minho Jeung Heejoon Jung Oct. 25, 2005.
DATA REFLECTION: Providing Generally Effective Instruction Oregon Reading First Cohort B Project Level Data Erin Chaparro, Ph.D. Jean Louise Mercier Smith,
Robotic Assistance. The PROBLEM Providing assistance for the Blind –What do we mean by “Blind?” Stereotypical blindness Visually impaired What assistance.
Graphical User Interface on Analysis of Mechanics and Dynamics of Biopolymers in Living Cells Peter Russel, Biomedical Engineering Shubham Agrawal, Computer.
LISTENING, SPEAKING, & PRONUNCIATION ESL 911. TEACHER Sharon Not ‘Sharon’
EDU 620: Meeting Individual Student Needs With Technology Dr. Moerland.
Free English Grammar Check Tool? -Golden Advice!
1 Speech Recognition. 2 Introduction What is Speech Recognition? - Voice Recognition? Where can it be used? - Dictation - System control/navigation -
CS 397 Review of Presentations 1.Presentation 2.Depth a.Slides b.Delivery.
Test1 Here some text. Text 2 More text.
4.0.2 Microsoft Word Screen Components Quiz
1888 PressRelease - Ghotit is happy to announce the release of Ghotit Real Writer & Reader 6 for Windows.
Speech Recognition There are different kinds of voice or speech "engines" that take the sounds of your voice and match it with words. The engine is software.
Human – Computer Communication
and the Techniques Required
and the Techniques Required
Speech Recognition There are different kinds of voice or speech "engines" that take the sounds of your voice and match it with words. The engine is software.
Retrieval of audio testimonials via voice search
إستراتيجيات ونماذج التقويم
[type text here] [type text here] [type text here] [type text here]
Your text here Your text here Your text here Your text here Your text here Pooky.Pandas.
David Cyphert CS 2310 – Software Engineering
1/2/2019 9:19 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS.
Your text here Your text here Your text here Your text here
[type text here] [type text here] [type text here] [type text here]
Integration: Definite Integration
HoloSync: Exploring Discoverable Conversational Interfaces for Model State Control ALI SIDDIQUI.
Microsoft Cognitive Services with Power BI
2.) How are each of these illustrations related to one another?
Order of Operations (BIDMAS): Inserting brackets
Spontaneous Voice Driven Interaction with Avatars: Discriminating Alerting and Referential Contexts of Sentinel Words The Problem The Solution e-WUW:
WordPress Unit Web Coordinators
Presentation transcript:

Using Commonsense Reasoning to Improve Voice Recognition

Speech Recognition Disambiguate phonetically similar words Improve error correction interfaces Improve overall recognition accuracy By integrating the Microsoft Speech API and ConceptNet, we hope to:

Speech Recognition Here is how our prototype is going to work: I was at the school the other day. I walked up to the (text being dictated appears here)

Speech Recognition User Speaks: I was at the school the other day. I walked up to the principal

Speech Recognition Speech is Processed: I was at the school the other day. I walked up to the principle principal the principle of of the principal Voice recognition engine suggestions

Speech Recognition The Context is analyzed using ConceptNet I was at the school the other day. I walked up to the principle principal the principle of of the principal chalkboard textbook teacher principal Words in ConceptNet for “school”

Speech Recognition Options are calculated and ranked: I was at the school the other day. I walked up to the principal principle principal the principle of of the principal chalkboard textbook teacher principal principle the principle of of the principal The voice recognition input is intelligently sorted by how much each word makes sense, contextually