Presentation is loading. Please wait.

Presentation is loading. Please wait.

Speech Recognition System Jaime Díaz Raiza Muñiz.

Similar presentations


Presentation on theme: "Speech Recognition System Jaime Díaz Raiza Muñiz."— Presentation transcript:

1 Speech Recognition System Jaime Díaz Raiza Muñiz

2 System Overview Closed-Set Speaker ID system Two active states Speaker training Speaker ID Project partition Jaime – DSP (feature extraction & comparison). Raiza – control, memory & video output.

3 Block Diagram Extractor DistanceMemory Control Unit Register Display Feature_compa re Extracte d Features Extract Write Address Data_out Enable Done Distance Sync Add_User ID Reset (To all blocks) ID ActionReset_Reg 2 User Line_Count Pixel_Count RGB VGA VGA_Out Voice Reset Add_User

4 Extractor Block Processes ~ 3.5 sec audio Outputs 16 Spec. Coeff. Issue: number of samples Need to process small chunks Pipelining to reduce gates Customization  less portable Hamming Window DFT Mel Filters DCT AC’97 Log Voice Spectral Coefficients

5 Distance Block Compares Spec. Coef. (SC) Input vs Stored (Speech) Outputs a distance metric Comparison: Dynamic Time Warping Calc. Euclidean distance bet the SC of input vs stored for each time interval. Dist = Σ smallest dist in each TI row and column of the distance matrix.

6 -4 -3 -4 X7 X9 X7 X8 I8 I8 S7 S675776512889896 S9 -3 -2 -4 -------SSIIIXX- Distance Metric Calculation Example

7 Control Block Tells all other blocks what to do. Drives the direct user I/O interface ADD or ID user inputs. Video output Drives Memory Read/Write cycles Supplies Distance Block stored SC vectors.

8 Other Blocks Memory – store/read user SC as needed Register – tell Control requested action Video interface – feedback to the user

9 Thank You! Questions?


Download ppt "Speech Recognition System Jaime Díaz Raiza Muñiz."

Similar presentations


Ads by Google