Step 6 - Creating Flat Start Monophones

Define Prototype Model

The first step in Hidden Markov Model ("HMM") training is defining a prototype model called "proto". The focus here is to create a model structure, the parameters are not important. Create a file called proto in your 'voxforge/tutorial' directory containng the following:

~o <VecSize> 25 <MFCC_0_D_N_Z>
~h "proto"
<BeginHMM>
<NumStates> 5
<State> 2
    <Mean> 25
      0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
    <Variance> 25
      1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0
<State> 3
    <Mean> 25
      0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
    <Variance> 25
      1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0
<State> 4
    <Mean> 25
      0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
    <Variance> 25
      1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0
<TransP> 5
0.0 1.0 0.0 0.0 0.0
0.0 0.6 0.4 0.0 0.0
0.0 0.0 0.6 0.4 0.0
0.0 0.0 0.0 0.7 0.3
0.0 0.0 0.0 0.0 0.0
<EndHMM>

For details of what all this means, see the HTK book.

You also need a configuration file. Create a file called config in your 'voxforge/tutorial' directory and containing the following data:

TARGETKIND = MFCC_0_D_N_Z
TARGETRATE = 100000.0
SAVECOMPRESSED = T
SAVEWITHCRC = T
WINDOWSIZE = 250000.0
USEHAMMING = T
PREEMCOEF = 0.97
NUMCHANS = 26
CEPLIFTER = 22
NUMCEPS = 12

Note: the target kind in you proto file (the "MFCC_0_D_N_Z" on the first line), needs to match the TARGETKIND in your config file.

You also need to tell HTK where all your feature vector files are located (those are the mfcc files you created in the last step). You do this with an HTK script file. Therefore, create a file called train.scp.

The next step is to create a new folder called hmm0.

Then create a new version of proto in the hmm0 folder - using the HTK HCompV tool as follows:

HCompV -A -D -T 1 -C config -f 0.01 -m -S train.scp -M hmm0 proto

This creates two files in the hmm0 folder:

Flat Start Monophones

create hmmdefs

Create a new file called hmmdefs in your 'voxforge/tutorial/hmm0' folder:
- Copy the monophones0 file to your hmm0 folder;
- rename the monophones0 file to hmmdefs;
For each phone in hmmdefs:

put the phone in double quotes;

add '~h ' before the phone (note the space after the '~h'); and

copy from line 5 onwards (i.e. starting from "<BEGINHMM>" to "<ENDHMM>") of the hmm0/proto file and paste it after each phone.

Leave one blank line at the end of your file.

This creates the hmmdefs file, which contains "flat start" monophones.

Create macros File

The final step in this section is to create the macros file.

A new file called macros should be created and stored in your 'voxforge/tutorial/hmm0' folder:

create a new file called macros in hmm0;
copy vFloors to macros
copy the first 3 lines of proto (from ~o to <DIAGC>) and add them to the top of the macros file

It should look something like this when you have finished:

~o
<STREAMINFO> 1 25
<VECSIZE> 25<NULLD><MFCC_D_N_Z_0><DIAGC>
~v varFloor1
<Variance> 25
6.580434e-01 3.732679e-01 3.525515e-01 4.770429e-01 4.332327e-01 4.544640e-01 5.620689e-01 2.553866e-01 4.001572e-01 3.416671e-01 2.128212e-01 2.660224e-01 1.668585e-02 1.700366e-02 1.616409e-02 1.768895e-02 1.718035e-02 2.098122e-02 2.326025e-02 1.677738e-02 2.010739e-02 1.595870e-02 1.417548e-02 1.510511e-02 1.447709e-02

Re-estimate Monophones

Next, create 9 new folders named consecutively in your 'voxforge/tutorial' folder: hmm1 to hmm9.

The Flat Start Monophones are re-estimated using the HERest tool. The purpose of this is to load all the models in the hmm0 folder (these are contained in the hmmdefs file), and re-estimate them using the MFCC files listed in the train.scp script, and create a new model set in hmm1. Execute the HERest command from your 'voxforge/tutorial' directory:

HERest -A -D -T 1 -C config -I phones0.mlf -t 250.0 150.0 1000.0 -S train.scp -H hmm0/macros -H hmm0/hmmdefs -M hmm1 monophones0

The files created by this command are:

This process is repeated 2 more times, creating new model sets in hmm2 and hmm3, respectively:

HERest -A -D -T 1 -C config -I phones0.mlf -t 250.0 150.0 1000.0 -S train.scp -H hmm1/macros -H hmm1/hmmdefs -M hmm2 monophones0

The files created by this command are:

HERest -A -D -T 1 -C config -I phones0.mlf -t 250.0 150.0 1000.0 -S train.scp -H hmm2/macros -H hmm2/hmmdefs -M hmm3 monophones0

The files created by this command are:

Comments

Error [+7321] CreateInsts: Unknown label α¼å+α¼áα¡ì

By prithviraj - 5/4/2018 Hi

Error in HERest

By prithviraj - 5/3/2018 Hi

Struck with ERROR [+6510] LOpen: Unable to open label file ../train/mfcc/sample1.lab

By sekarpdkt - 8/27/2017 - 7 Replies Hi I am struck here

Regarding creation of proto an vfloor

By Gururaj - 12/16/2016 - 1 Replies hi all..

ERROR [+7321]

By Nazik - 11/27/2016 - 3 Replies

problem in hmmdefs files(HErest)

By windclay - 1/22/2016 - 2 Replies I have trained data with 0 to 16 mixtures.

How to write proto file?

By visitor1 - 11/25/2015 - 1 Replies hi

proto file

By ibr - 8/7/2015 - 2 Replies if i have a different set of words than the ones in this example would the same included proto file works fine? if not what a good source to explain the meaning of vectors that composes the proto file, the PDF about HTK toolkit doesn't seems to give a proper explaination

HErest

By Negar - 1/10/2015 - 2 Replies Hello, I have run the first reestimation of the monophones at step6, without receiving error. Now in hmm1 I have the hmmdefs and macro. But the hmm1/hmmdefs is totally identical to what I have in hmm0/hmmdefs!

error 7060 2321

By marlena - 11/28/2013 Hi!

Cannot open Source File varFloor1

By Ankur Rana - 1/30/2013 - 1 Replies On executing the command, i am getting following error message:

Error [+7321] CreateInsts: Unknown label .

By Abdou - 11/27/2012 - 3 Replies Hi i'm in the 6th step and when i execute cammand HERest i got this error Error [+7321] CreateInsts: Unknown label . can u help plz?

ERREUR HERest

By Aghilas (ENP) - 4/25/2012 slt,

New prototype model

By bejimed - 3/2/2012 - 1 Replies The prototype model proposed in this tutorial contain 5 states , so what are the differents modification in order to have a new and specially the new transP matrice protype model contain more than 5 states ( exemple 7 states)

ERROR [+5021]

By Stewie - 11/22/2011 When you've faced this problem,

Discrete model

By Siwar - 11/11/2011 - 2 Replies hello

Error using HERest

By Gerald - 5/10/2011 - 6 Replies Hey,

error with HERest

By Visitor - 4/8/2011 - 2 Replies hi i followed the steps till here properly...but when i ran HERest then it gave me the following error..could anyone tell me what is wrong..also i have the same names for my wav files and the label files.. thank you..

OOT Flat start

By Visitor - 12/26/2010 - 1 Replies Hi ! i wanna create speech to text (again), with manual labelling and not using flat-start.

unable to use HcompV

By aspirant - 10/19/2010 - 1 Replies hi..

Error in Tutorial ***** D'HO!!!

By Carlo - 10/2/2010 Ok, I'm Italian student...

Error 7031

By novision - 6/14/2010 - 2 Replies

big problem

By mmm - 6/1/2010 - 5 Replies hi

6510 error

By Milos - 3/5/2010 - 1 Replies I have the same problem:

error with HERest need help

By LAROUI Ahmed Ridha - 3/5/2010 - 2 Replies hello every body,

ERROR [+6550] LoadHTKList: Label Name Expected FATAL ERROR

By SomeoneWhoNeedsUrgentHelp! - 1/26/2010 - 1 Replies Hi!, well the title says it all,

HERest error [+7036][+7060][+2321]

By Mathspeedy(Boutch) - 1/4/2010 - 12 Replies Hi!, i'm having a theses errors when trying the following:

error [+6210]

By puphe_88 - 11/28/2009 - 5 Replies i'm running htk in windows and i've a problem in HCompv

Incompatible sample kind

By khushami827 - 7/20/2009 - 1 Replies HVite -H model\hmm15\macros -H model \hmm15\hmmdefs -S trainlist\test.scp -l * -i recout.mlf -w def\net.slf -p 0.0 -s 5.0 def\dict tiedlist ERROR [+3231] ProcessFile: Incompatible sample kind MFCC_D_Z_0 vs MFCC_D_N_Z_ 0 FATAL ERROR - Terminating program HVite

ERROR [+5013] ReadString: String too long

By khushami827 - 7/20/2009 - 3 Replies HERest -A -D -T 1 -C config\config - I def\phones0.mlf -t 250.0 150.0 1000.0 -S trainlist\trainlist.scp -H model\hmm0 \macros -H model\hmm0\hmmdefs -M model\hmm1 def\monophones0

«Previous Page • 1 2 • Next Page»


Username	Password