A Study of Sparse Non-negative Matrix Factor 2-D Deconvolution Combined With Mask Application for Blind Source Separation of Frog Species 1 Reporter :

Slides:



Advertisements
Similar presentations
Independent Component Analysis
Advertisements

EE645: Independent Component Analysis
Color Imaging Analysis of Spatio-chromatic Decorrelation for Colour Image Reconstruction Mark S. Drew and Steven Bergner
From Single Channel and Two-Channel Data 32nd Annual International Conference of the IEEE EMBS Combining EMD with ICA for Extracting Independent Sources.
Franz de Leon, Kirk Martinez Web and Internet Science Group  School of Electronics and Computer Science  University of Southampton {fadl1d09,
Digital Audio Processing Lab, Dept. of EEThursday, June 17 th Data-Adaptive Source Separation for Audio Spatialization Supervisors: Prof. Preeti Rao and.
Nonsmooth Nonnegative Matrix Factorization (nsNMF) Alberto Pascual-Montano, Member, IEEE, J.M. Carazo, Senior Member, IEEE, Kieko Kochi, Dietrich Lehmann,
Reporter : Chia-Cheng Chen Advisor : Wen-Ping Chen 1 Network Application Laboratory Department of Electrical Engineering National Kaohsiung University.
REAL-TIME INDEPENDENT COMPONENT ANALYSIS IMPLEMENTATION AND APPLICATIONS By MARCOS DE AZAMBUJA TURQUETI FERMILAB May RTC 2010.
Volkan Cevher, Marco F. Duarte, and Richard G. Baraniuk European Signal Processing Conference 2008.
Fifth International Conference on Information
Independent Component Analysis (ICA)
Page 0 of 8 Time Series Classification – phoneme recognition in reconstructed phase space Sanjay Patil Intelligent Electronics Systems Human and Systems.
Application of Statistical Techniques to Neural Data Analysis Aniket Kaloti 03/07/2006.
Communications & Multimedia Signal Processing Meeting 7 Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel University 23 November,
Subband-based Independent Component Analysis Y. Qi, P.S. Krishnaprasad, and S.A. Shamma ECE Department University of Maryland, College Park.
ICA-based Blind and Group-Blind Multiuser Detection.
Mutual Information for Image Registration and Feature Selection
Independent Component Analysis (ICA) and Factor Analysis (FA)
DIGITAL SIGNAL PROCESSING IN ANALYSIS OF BIOMEDICAL IMAGES Prof. Aleš Procházka Institute of Chemical Technology in Prague Department of Computing and.
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING MARCH 2010 Lan-Ying Yeh
Survey on ICA Technical Report, Aapo Hyvärinen, 1999.
Entropy and some applications in image processing Neucimar J. Leite Institute of Computing
Normalization of the Speech Modulation Spectra for Robust Speech Recognition Xiong Xiao, Eng Siong Chng, and Haizhou Li Wen-Yi Chu Department of Computer.
SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIXFACTORIZATION AND SPECTRAL MASKS Jain-De,Lee Emad M. GraisHakan Erdogan 17 th International.
A VOICE ACTIVITY DETECTOR USING THE CHI-SQUARE TEST
ERP DATA ACQUISITION & PREPROCESSING EEG Acquisition: 256 scalp sites; vertex recording reference (Geodesic Sensor Net)..01 Hz to 100 Hz analogue filter;
Detection and Segmentation of Bird Song in Noisy Environments
畳み込み非負値行列因子分解を用いた 音声パターンの教師無し学習と音素分類 Unsupervised learning of speech patterns and phone classifi- cation using Convolutive Non-negative Matrix Factorization.
-1- ICA Based Blind Adaptive MAI Suppression in DS-CDMA Systems Malay Gupta and Balu Santhanam SPCOM Laboratory Department of E.C.E. The University of.
SPECTRO-TEMPORAL POST-SMOOTHING IN NMF BASED SINGLE-CHANNEL SOURCE SEPARATION Emad M. Grais and Hakan Erdogan Sabanci University, Istanbul, Turkey  Single-channel.
Independent Component Analysis on Images Instructor: Dr. Longin Jan Latecki Presented by: Bo Han.
Heart Sound Background Noise Removal Haim Appleboim Biomedical Seminar February 2007.
College of Engineering WiFi and WCDMA Network Design Robert Akl, D.Sc. Department of Computer Science and Engineering Robert Akl, D.Sc. Department of Computer.
Implementing a Speech Recognition System on a GPU using CUDA
Hongyan Li, Huakui Wang, Baojin Xiao College of Information Engineering of Taiyuan University of Technology 8th International Conference on Signal Processing.
Independent Component Analysis Zhen Wei, Li Jin, Yuxue Jin Department of Statistics Stanford University An Introduction.
2010/12/11 Frequency Domain Blind Source Separation Based Noise Suppression to Hearing Aids (Part 1) Presenter: Cian-Bei Hong Advisor: Dr. Yeou-Jiunn Chen.
Blind Separation of Speech Mixtures Vaninirappuputhenpurayil Gopalan REJU School of Electrical and Electronic Engineering Nanyang Technological University.
May 3 rd, 2010 Update Outline Monday, May 3 rd 2  Audio spatialization  Performance evaluation (source separation)  Source separation  System overview.
Survey of ICASSP 2013 section: feature for robust automatic speech recognition Repoter: Yi-Ting Wang 2013/06/19.
Local Non-Negative Matrix Factorization as a Visual Representation Tao Feng, Stan Z. Li, Heung-Yeung Shum, HongJiang Zhang 2002 IEEE Presenter : 張庭豪.
MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.
TEMPLATE DESIGN © Zhiyao Duan 1,2, Lie Lu 1, and Changshui Zhang 2 1. Microsoft Research Asia (MSRA), Beijing, China.2.
Basics of Neural Networks Neural Network Topologies.
Online Kinect Handwritten Digit Recognition Based on Dynamic Time Warping and Support Vector Machine Journal of Information & Computational Science, 2015.
Basic Concepts of Audio Watermarking. Selection of Different Approaches Embedding Domain  time domain  frequency domain DFT, DCT, etc. Modulation Method.
A Sparse Non-Parametric Approach for Single Channel Separation of Known Sounds Paris Smaragdis, Madhusudana Shashanka, Bhiksha Raj NIPS 2009.
1 Blind Channel Identification and Equalization in Dense Wireless Sensor Networks with Distributed Transmissions Xiaohua (Edward) Li Department of Electrical.
A DISTRIBUTION BASED VIDEO REPRESENTATION FOR HUMAN ACTION RECOGNITION Yan Song, Sheng Tang, Yan-Tao Zheng, Tat-Seng Chua, Yongdong Zhang, Shouxun Lin.
2010/12/11 Frequency Domain Blind Source Separation Based Noise Suppression to Hearing Aids (Part 2) Presenter: Cian-Bei Hong Advisor: Dr. Yeou-Jiunn Chen.
Full-rank Gaussian modeling of convolutive audio mixtures applied to source separation Ngoc Q. K. Duong, Supervisor: R. Gribonval and E. Vincent METISS.
Non-negative Matrix Factor Deconvolution; Extracation of Multiple Sound Sources from Monophonic Inputs International Symposium on Independent Component.
2010/12/11 Frequency Domain Blind Source Separation Based Noise Suppression to Hearing Aids (Part 3) Presenter: Cian-Bei Hong Advisor: Dr. Yeou-Jiunn Chen.
Independent Component Analysis Independent Component Analysis.
Analysis of Traction System Time-Varying Signals using ESPRIT Subspace Spectrum Estimation Method Z. Leonowicz, T. Lobos
Introduction to Independent Component Analysis Math 285 project Fall 2015 Jingmei Lu Xixi Lu 12/10/2015.
An Introduction of Independent Component Analysis (ICA) Xiaoling Wang Jan. 28, 2003.
MINUET Musical Interference Unmixing Estimation Technique Scott Rickard, Conor Fearon Department of Electronic & Electrical Engineering University College.
Spatial Covariance Models For Under- Determined Reverberant Audio Source Separation N. Duong, E. Vincent and R. Gribonval METISS project team, IRISA/INRIA,
Non-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic Inputs C.G. Puntonet and A. Prieto (Eds.): ICA 2004 Presenter.
DISPLACED PHASE CENTER ANTENNA SAR IMAGING BASED ON COMPRESSED SENSING Yueguan Lin 1,2,3, Bingchen Zhang 1,2, Wen Hong 1,2 and Yirong Wu 1,2 1 National.
Advisor: Chang, Chin-Chen Student: Chen, Chang-Chu
Blind Extraction of Nonstationary Signal with Four Order Correlation Kurtosis Deconvolution Name: Chong Shan Affiliation: School of Electrical and Information.
Spoken Digit Recognition
Brain Electrophysiological Signal Processing: Preprocessing
Basic Concepts of Audio Watermarking
Bayesian Nonparametric Matrix Factorization for Recorded Music
INTRODUCTION TO THE SHORT-TIME FOURIER TRANSFORM (STFT)
Emad M. Grais Hakan Erdogan
Presentation transcript:

A Study of Sparse Non-negative Matrix Factor 2-D Deconvolution Combined With Mask Application for Blind Source Separation of Frog Species 1 Reporter : Jain-De Lee Advisor : Wen-Ping Chen Department of Electrical Engineering National Kaohsiung University of Applied Sciences Network Application Laboratory

Outline Introduction and Motivation Introduction and Motivation Background Background Research Method Research Method Experiment Results Experiment Results Conclusion and Future Works Conclusion and Future Works Research Results Research Results 2

Introduction Current Technology of the Ecological Survey ◦ Sensor networks ◦ Wireless network Advantage ◦ Reduce the cost of human resource and time ◦ Save and share the raw data conveniently Disadvantage ◦ Large amount of raw data needs to be analyzed Voiceprint Recognition System ◦ Analyze raw data fast 3

Introduction 4

Introduction Blind Source Separation ◦ Cocktail party problem 5

Introduction Independent Subspace Analysis : ◦ M. A. Casey and A. Westner[2000]  Proceedings of the International Computer Music Conference ◦ Md. K. I. Molla and K. Hirose[2007]  IEEE Transactions on Audio, Speech, and Language Processing Wiener Filter : ◦ L. Bonaroya and F. Bimbot[2003]  International Symposium on Independent Component Analysis and Blind Signal Separation ◦ E. M. Grais and H. Erdogan[2011]  IEEE Digital Signal Processing, Sedona, Arizona Non-negative Matrix Factorization: ◦ P. Smaragdis[2004]  International Symposium on Independent Component Analysis and Blind Source Separation 6

Introduction Independent Component Analysis Combined with Other Methods : ◦ J. Lin and A. Zhang[2005]  NDT & E International ◦ M. E. Davies and C. J. James[2007]  Signal Processing ◦ X. Cheng, N. Li, Y. Cheng and Z. Chen[2007]  International Conference on Bioinformatics and Biomedical Engineering ◦ B. Mijović, M. D. Vos, I. Gligorijević, J. Taelman and S. V. Huffel[2010]  IEEE Transactions on Biomedical Engineering 7

Motivation Single Channel Blind Source Separation Preprocessing of Voiceprint Recognition System Improve Quality of Separated Signals 8

Background Post-processing Reconstruct Signal Blind Source Separation ICA 、 NMF Pre-processing Whitening 、 T-F Representation 9

Background Independent Component Analysis ◦ Looking for components of statistically independent from observational signals and estimating de-mixing matrix ◦ Constraint conditions  The components are statistically independent  At most one gaussian source is allowed  At least as many sensor responses as source signals ◦ Processing steps  Pre-processing  Centering  Whitening  Measurement of non-Gaussian component 10

Background Measurement of Non-Gaussian Component ◦ Kurtosis ◦ Mutual Information ◦ Neg-entropy Random Variable ykurt(y) Gaussiankurt(y) = 0 Non-Gaussian Super-Gaussiankurt(y) > 0 Sub-Gaussiankurt(y) < 0 11 J(Y): Neg-Entropy H(Y gauss ): Entropy of Gaussian Distribution H(Y): Entropy of Random Variable

12 Background Non-negative Matrix Factorization

Background ◦ Cost function  Based on Euclidean Distance  Based on Kullback–Leibler Divergence 13 V : Original Signal : Reconstructed Signal

Background 14

Background Sparse Non-negative Matrix Factor 2-D Deconvolution (SNMF2D) ◦ Obtain temporal structure and the pitch change ◦ Control the sparse degree of non-negative matrix factorization Non-negative Matrix Factor 2-D Deconvolution ◦ τ basis matrix and φ coefficient matrix ◦ Shift operator Sparse Coding ◦ Take a few units to represent the data effectively ◦ Parts-based representations 15

Background 16 0

Background Sparse Non-negative Matrix Factor 2-D Deconvolution ◦ Cost function  Based on Euclidean Distance  Based on Kullback–Leibler Divergence λ:Sparse Factor f():Sparse Function 17

Background 18

Research Method 19

Research Method Pre-processing ◦ Time-domain signal converses to time-frequency signal  Analysis windows  Window function  Signal conversion 20

研究方法 21

研究方法 Reconstructed Signal of Latouche's Frog Reconstructed Signal of Sauter's Brown Frog 22

Mask Correction 23

Mask Correction Binary Mask ◦ The reconstructed signal converses to binary mask ◦ Find a suitable threshold T M(x,y): Binary Mask G(x,y): Reconstructed Signal 24

Mask Correction Otsu Method ◦ Create a histogram Element Number 25

Mask Correction TTTTTT L Element 26

Mask Correction 27

Mask Correction Signal Extraction V(x,y):Original Mixed-Signal S(x,y): Extraction of Signal 28

Mask Correction 29

Mask Correction Find a Ratio of Mixture Components , G T (x,y): Sum of reconstructed signals G i (x,y): Reconstructed Signal R i (x,y): Ratio of mixture N: Total Numbers of reconstructed Signals 30

Mask Correction Signal correction , : Revised Signals : Extraction of Signals 31

Mask Correction Signal correction , : Corrected Signals 32

Mask Correction 33

34

Post-processing Phase Information IDFT Window Function 35

Experiment Results Parameter ItemsParameter Value STFT Window Size512 samples Window Overlapping50% Window FunctionHamming Window Frequency Bin512 SNMF2D Basis Matrix[1…3] Coefficient Matrix[1…5] Sparse Factor5 Frog Species8 Mixtrue Items7 36

Experiment Results 37 Performance Measurement—SDR(Signal-to-Distortion Ratio)

38

Experiment Results 39 Performance Measurement — SIR(Source-to-Interference Ratio)

40

Experiment Results MethodIterationsVariance SNMF2D SNMF2D+MASK

Experiment Results Parameter ItemsParameter Value Frame Length512 samples Frame Overlapping50% Window FunctionHamming Window Frequency Bin512 Feature ParametersMel-Frequency Cepstral Coefficient Feature Dimensions15D Test Syllable410 42

Experiment Results Recognition Experiment MethodIterations Total Syllable Correct Syllable Accuracy(%) SNMF2D SNMF2D+MASK

Conclusion and Future Works The proposed method ◦ Improve the quality of separated signals effectively ◦ Use less time to improve the quality of separated signals ◦ Enhance the recognition rate of separated signals, and the average recognition rate can be improved 29.84% 44

Conclusion and Future Works Future Works ◦ Study of de-noise methods ◦ Determine the numbers of species of raw data ◦ Study of the initial value setting ◦ Collect various sound of species. Then, Improve the recognition rate 45

Research Results 46 Competition ◦ 第七屆數位訊號處理創思設計競賽 — 入圍 Patent ◦ 蛙聲混音分離方法 — 審查中

47 Thank you for your attention !!