專題進度報告資工四 B93902054 洪志豪資工四 B93902076 林宜鴻.

專題進度報告資工四 B 洪志豪資工四 B 林宜鴻

Flow chart

Language Model 1.Normalize(UTF-8) -> normalize_utf8
2.From UTF-8 to Big5 -> iconv 3.parse according to dictory -> textSegment 4.Build language model -> ngram-count

Acoustic Model - HERest
1.meaning：Adjust HMM model 2.command: HERest –C herest.cfg –S train.scp –L /share/data/TrainingLabel –H hmmset.mmf rcd.lst 3.Run more can get higher accuracy

Combine Acoustic and Language Model(1) pre-requirement
1.change the bigram.lm from Big5 to ascii 2.command： /share/program/big52ascii/chenc bigram_Big5.lm > bigram_ascii.lm 3.replace <s> with !=ENTER </s>with !=EXIT A useful command to replace: ->:1,$ s/<s>/!ENTER/g :1,$ s/<\/s>/!EXIT/g

Combine Acoustic and Language Model(2) HBuild
1.meaning：change the SRILM form to HTK network form 2.command： HBuild –n bigram_ascii.lm htkword.dic wd_net 3.htkword.dic is the relation between work and pronunciation EX:[B7C2] k_e e 4.wd_net is that word_net that we need EX:I=9996 W=[A64C][C3D2] J= S=9551 E=2135 l=-7.52

Combine Acoustic and Language Model(3) HBuild(Cond.) - Questions
htkword.dic has no word that corresponds to <unk> <sol>: don’t add flag –u when training language model, but is it appropriate?

Decode-HVite(1) 1.meaning：decode testing corpus 2.command：
HVite –t 250 –H hmmset.mmf –S testing.scp –C hvite.cfg –w wd_net -i result.mlf –p 0.0 –s 0.0 htkword.dic rcd.lst 3.result is in result.mlf EX： !ENTER [B37D]

Decode-HVite(2) - Questions
If we use HERest to train hmmset.mmf with phone.lst，it will error： ->cannot load file n_o <sol>:use rcd.lst instead

Divide Mixtures by HHEd
But hmmset.mmf given by TA already has 32 mixtures for each state What config to use? Old DSP hw: the same as what HERest use My trail: it can run without config

Initailize Acoustic Model
2 methods for 2 different conditions With labels: use HInit and HRest Without labels: use HCompV Both of them has a similar question

Reference 1.HTK_Book.pdf 2.SpeechProject2007Winter.ppt
3.dsp_hw1.ppt(2006_spring_DSP) 4.HTK_Training.pdf 5.SpeechProj board in ptt2

專題進度報告資工四 B93902054 洪志豪資工四 B93902076 林宜鴻.

Similar presentations

Presentation on theme: "專題進度報告資工四 B93902054 洪志豪資工四 B93902076 林宜鴻."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

專題進度報告 資工四 B93902054 洪志豪 資工四 B93902076 林宜鴻.

Similar presentations

Presentation on theme: "專題進度報告 資工四 B93902054 洪志豪 資工四 B93902076 林宜鴻."— Presentation transcript:

Similar presentations

About project

Feedback

專題進度報告資工四 B93902054 洪志豪資工四 B93902076 林宜鴻.

Presentation on theme: "專題進度報告資工四 B93902054 洪志豪資工四 B93902076 林宜鴻."— Presentation transcript: