Gaussian Mixture-Sound Field Landmark Model for Robot Localization Talker: Prof. Jwu-Sheng Hu Department of Electrical and Control Engineering National.

Slides:

Advertisements

Similar presentations

Chunyi Peng, Guobin Shen, Yongguang Zhang, Yanlin Li, Kun Tan BeepBeep: A High Accuracy Acoustic Ranging System using COTS Mobile Devices.

Advertisements

Advanced Speech Enhancement in Noisy Environments

LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.

Microwave Imaging using Indirect Synthetic Reference Beam Holography

MODULATION SPECTRUM EQUALIZATION FOR ROBUST SPEECH RECOGNITION Source: Automatic Speech Recognition & Understanding, ASRU. IEEE Workshop on Author.

Speaker Adaptation for Vowel Classification

A New Household Security Robot System Based on Wireless Sensor Network Reporter :Wei-Qin Du.

Fig. 2 – Test results Personal Memory Assistant Facial Recognition System The facial identification system is divided into the following two components:

Advances in WP1 and WP2 Paris Meeting – 11 febr

Advisor: Prof. Tony Jebara

Cross Strait Quad-Regional Radio Science and Wireless Technology Conference, Vol. 2, p.p. 980 – 984, July 2011 Cross Strait Quad-Regional Radio Science.

Power Consumption Measurement and Clock Synchronization on Low-Power Wireless Sensor Networks Author : Yu-Ping Chen, Quincy Wu 1.

Normalization of the Speech Modulation Spectra for Robust Speech Recognition Xiong Xiao, Eng Siong Chng, and Haizhou Li Wen-Yi Chu Department of Computer.

LE 460 L Acoustics and Experimental Phonetics L-13

GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.

SoundSense: Scalable Sound Sensing for People-Centric Application on Mobile Phones Hon Lu, Wei Pan, Nocholas D. lane, Tanzeem Choudhury and Andrew T. Campbell.

HMM-BASED PSEUDO-CLEAN SPEECH SYNTHESIS FOR SPLICE ALGORITHM Jun Du, Yu Hu, Li-Rong Dai, Ren-Hua Wang Wen-Yi Chu Department of Computer Science & Information.

A VOICE ACTIVITY DETECTOR USING THE CHI-SQUARE TEST

INTRODUCTION  Sibilant speech is aperiodic.  the fricatives /s/, / ʃ /, /z/ and / Ʒ / and the affricatives /t ʃ / and /d Ʒ /  we present a sibilant.

1 Location Estimation in ZigBee Network Based on Fingerprinting Department of Computer Science and Information Engineering National Cheng Kung University,

A Shaft Sensorless Control for PMSM Using Direct Neural Network Adaptive Observer Authors: Guo Qingding Luo Ruifu Wang Limei IEEE IECON 22 nd International.

SPECTRO-TEMPORAL POST-SMOOTHING IN NMF BASED SINGLE-CHANNEL SOURCE SEPARATION Emad M. Grais and Hakan Erdogan Sabanci University, Istanbul, Turkey  Single-channel.

بسم الله الرحمن الرحيم Prof. Dr. ADNAN AFFANDI Supervised by.

Speech Enhancement Using Spectral Subtraction

Tracking with Unreliable Node Sequences Ziguo Zhong, Ting Zhu, Dan Wang and Tian He Computer Science and Engineering, University of Minnesota Infocom 2009.

REVISED CONTEXTUAL LRT FOR VOICE ACTIVITY DETECTION Javier Ram’ırez, Jos’e C. Segura and J.M. G’orriz Dept. of Signal Theory Networking and Communications.

Ekapol Chuangsuwanich and James Glass MIT Computer Science and Artificial Intelligence Laboratory,Cambridge, Massachusetts 02139,USA 2012/07/2 汪逸婷.

LOG-ENERGY DYNAMIC RANGE NORMALIZATON FOR ROBUST SPEECH RECOGNITION Weizhong Zhu and Douglas O’Shaughnessy INRS-EMT, University of Quebec Montreal, Quebec,

Experimental Results ■ Observations:  Overall detection accuracy increases as the length of observation window increases.  An observation window of 100.

Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.

1 Robust Endpoint Detection and Energy Normalization for Real-Time Speech and Speaker Recognition Qi Li, Senior Member, IEEE, Jinsong Zheng, Augustine.

Sensorless Control of the Permanent Magnet Synchronous Motor Using Neural Networks 1,2Department of Electrical and Electronic Engineering, Fırat University.

A New Cost Effective Sensorless Commutation Method for Brushless DC Motors Without Phase Shift Circuit and Neutral Voltage 南台科大電機系 Adviser : Ying-Shieh.

1 Blind Channel Identification and Equalization in Dense Wireless Sensor Networks with Distributed Transmissions Xiaohua (Edward) Li Department of Electrical.

Department of Electrical Engineering Southern Taiwan University of Science and Technology Robot and Servo Drive Lab. 2015/11/20 Simple position sensorless.

College of Engineering Anchor Nodes Placement for Effective Passive Localization Karthikeyan Pasupathy Major Advisor: Dr. Robert Akl Department of Computer.

USE OF IMPROVED FEATURE VECTORS IN SPECTRAL SUBTRACTION METHOD Emrah Besci, Semih Ergin, M.Bilginer Gülmezoğlu, Atalay Barkana Osmangazi University, Electrical.

Doppler Spread Estimation in Frequency Selective Rayleigh Channels for OFDM Systems Athanasios Doukas, Grigorios Kalivas University of Patras Department.

Robust Feature Extraction for Automatic Speech Recognition based on Data-driven and Physiologically-motivated Approaches Mark J. Harvilla1, Chanwoo Kim2.

Speech Communication Lab, State University of New York at Binghamton Dimensionality Reduction Methods for HMM Phonetic Recognition Hongbing Hu, Stephen.

Design of PCA and SVM based face recognition system for intelligent robots Department of Electrical Engineering, Southern Taiwan University, Tainan County,

Voice Activity Detection based on OptimallyWeighted Combination of Multiple Features Yusuke Kida and Tatsuya Kawahara School of Informatics, Kyoto University,

Performance Comparison of Speaker and Emotion Recognition

Face Image-Based Gender Recognition Using Complex-Valued Neural Network Instructor :Dr. Dong-Chul Kim Indrani Gorripati.

Turning a Mobile Device into a Mouse in the Air

A. R. Jayan, P. C. Pandey, EE Dept., IIT Bombay 1 Abstract Perception of speech under adverse listening conditions may be improved by processing it to.

Benedikt Loesch and Bin Yang University of Stuttgart Chair of System Theory and Signal Processing International Workshop on Acoustic Echo and Noise Control,

Experimental Ranging With Mica2 Motes M. Allen, E. Gaura, R. Newman, S. Mount Cogent Computing, Coventry University The experimental work here makes use.

Detection of nerves in Ultrasound Images using edge detection techniques NIRANJAN TALLAPALLY.

A WIRELESS PASSIVE SENSOR FOR TEMPERATURE COMPENSATED REMOTE PH MONITORING IEEE SENSORS JOURNAL VOLUME 13, NO.6, JUNE 2013 WEN-TSAI SUNG, YAO-CHI HSU Ching-Hong.

Smartphone-based Wi-Fi Pedestrian-Tracking System Tolerating the RSS Variance Problem Yungeun Kim, Hyojeong Shin, and Hojung Cha Yonsei University Bing.

Research Methodology Proposal Prepared by: Norhasmizawati Ibrahim (813750)

Sensor-Assisted Wi-Fi Indoor Location System for Adapting to Environmental Dynamics Yi-Chao Chen, Ji-Rung Chiang, Hao-hua Chu, Polly Huang, and Arvin Wen.

An improved SVD-based watermarking scheme using human visual characteristics Chih-Chin Lai Department of Electrical Engineering, National University of.

Traffic State Detection Using Acoustics

Spectral and Temporal Modulation Features for Phonetic Recognition Stephen A. Zahorian, Hongbing Hu, Zhengqing Chen, Jiang Wu Department of Electrical.

Radio Coverage Prediction in Picocell Indoor Networks

Feature Mapping FOR SPEAKER Diarization IN NOisy conditions

Adnan Quadri & Dr. Naima Kaabouch Optimization Efficiency

Robust Data Hiding for MCLT Based Acoustic Data Transmission

Speech and Audio Processing

Two-Stage Mel-Warped Wiener Filter SNR-Dependent Waveform Processing

朝陽科技大學資訊工程系謝政勳 Application of GM(1,1) Model to Speech Enhancement and Voice Activity Detection 朝陽科技大學資訊工程系謝政勳

ELEG 3124 Signals and Systems

AUDIO SURVEILLANCE SYSTEMS: SUSPICIOUS SOUND RECOGNITION

A maximum likelihood estimation and training on the fly approach

Presented by Chen-Wei Liu

Presenter: Shih-Hsiang(士翔)

Real-time Uncertainty Output for MBES Systems

Combination of Feature and Channel Compensation (1/2)

Presentation transcript:

Gaussian Mixture-Sound Field Landmark Model for Robot Localization Talker: Prof. Jwu-Sheng Hu Department of Electrical and Control Engineering National Chiao-Tung University Hsinchu, Taiwan

Outline Introduction Overall System Architecture Robot Localization Methodology Architecture Gaussian Mixture-Sound Field Landmark Model (GM-SFLM) The Flow Chart of the Robot Localization System Experimental Condition Experimental Result Conclusion and Future Work

Introduction This investigation proposes a robust robot localization system. The system contains a novel Gaussian Mixture-Sound Field Landmark Model (GM-SFLM) and can localize the robot accurately in noisy indoor environments. Advantages: –The proposed method depends nothing on the geometry relation between source locations and two microphones. –It is able to cover both near-field and far-field problems. –It can overcome the microphone ’ s mismatch and the coherence problems. –The experiment demonstrates that when the robot is completely non-line-of-sight, this system still provides high detection accuracy. –High accuracy, low-cost, easy to implement and environmental adaptation. The GM-SFLM is realized into a quadruped robot system by using embedded Ethernet technology

Overall System Architecture The overall system contains a dog-like pet robot (named “ eRobot ” ) and a robot localization agent mounted in an arbitrarily indoor position Photo of eRobot Overall System Diagram

Photo of tiny network bridge module on the robot

The overall system including the PC-side remote control.

Robot Localization Methodology Architecture First stage: Pre-Recording Stage - The eRobot moves and barks in the location of interest when the environment is quiet to obtain the pre-recorded database. Second stage: Silent Stage - The environment noise is recorded and the characteristic of environment noise is collected. The GM-SFLM parameters are trained in this stage. Third stage: Barking Stage - The GM-SFLM is duplicated into the location detector to decide the robot ’ s location. Robot localization methodology architecture

where and are the weighting factors, and are the phase difference and magnitude ratio GMM relatively. Gaussian Mixture-Sound Field Landmark Model (GM-SFLM) An environmental sound field could be perceived by animals or human beings through the phase differences and magnitude ratio among sound receiving sensors. The employed characteristics are usually called interaural time difference (ITD) and interaural level difference (ILD). Both ITD and IID represent meaningful physical quantities for a sound field perception. The proposed GM-SFLM at each location is defined as the linear combination of the phase difference GMM and the magnitude ratio GMM.

The Flow Chart of the Robot Localization System

Experimental Condition The experiment was conducted by utilizing two microphones in an environment. The spacing of two microphones is chosen as 10 cm. There are 12 location blocks defined in the experiment. Considering the size of the eRobot, the radius of the each location blocks is selected as 20 cm. The environment is complexly and it contains a partition room.

Experimental Condition The experiment was performed in three different SNR conditionsThe SNR ranges of the three different cases are listed in the left Table. The received signals were sampled at 8 KHz, and the window for STFT (Short Time Fourier Transform) contained 256 zero padding samples and 32ms speech signals, totaling 512 samples. The processed frame and the overlapping condition is shown in the left Figure. The SNR ranges of the three different conditions A processed frame and overlapping condition

Experimental Result Experimental result in condition one using GM-SFLM Experimental result in condition two using GM-SFLM Experimental result in condition three using GM-SFLM

Conclusion and Future Work This work proves that the proposed method can capture the sound field characteristic to achieve a very high localization correct rate in noisy indoor environments. The accurate and robust experimental results indicate a promising direction of using the sound as a mean of localization where the devices are relatively inexpensive. However, several issues can be explored further, such as the relation of the proposed model to the acoustic scattering theory and 3D landmarks. These areas will be the work of continuing research of the authors.