VoxForge
HTK toolkit cannot process your prompts.txt file directly. You have two options, you can create a separate 'label' file for each line your prompts.txt file in the following format:
*/sample1 |
Or you can create a Master Label File (MLF) - which is a single file that contains a label entry for each line in your prompts.txt file. This is the easiest approach, and the one we will use for this tutorial.
Download the Julia script prompts2mlf.jl to your voxforge/bin directory to generate the mlf file from your prompts.txt file. Execute the prompts2mlf script from your 'voxforge/tutporial' folder as follows:
julia ../bin/prompts2mlf.jl prompts.txt words.mlf |
This script generates a words.mlf file.
Next you need to execute the HLEd command to expand the Word Level Transcriptions to Phone Level Transcriptions - i.e. replace each word with its phonemes, and put the result in a new Phone Level Master Label File This is done by reviewing each word in the MLF file, and looking up the phones that make up that word in the dict file you created earlier, and outputing the result in a file called phones0.mlf (which will not have short pauses ("sp"s) after each word phone group).
First, create the mkphones0.led edit script in your
'voxforge/tutorial' folder:
EX |
(note: remember to include a blank line at the end of this script)
Then execute the following HLEd command from your
'voxforge/tutorial' folder:
Linux:
$HLEd -A -D -T 1 -l '*' -d dict -i phones0.mlf mkphones0.led words.mlf |
Windows:
C:>HLEd -A -D -T 1 -l * -d dict -i phones0.mlf mkphones0.led words.mlf |
Which
creates the phones0.mlf
file
Next, we need to create a second phones1.mlf file (which will include short pauses
(“sp”) after each word phone group). First create the
mkphones1.led in your
'voxforge/tutorial' folder as follows:
EX |
(note: remember to include a blank line at the end of this script)
Then run the HLEd command again from your 'voxforge/tutorial' folder as follows:
Linux:
$HLEd -A -D -T 1 -l '*' -d dict -i phones1.mlf mkphones1.led words.mlf |
Windows:
C:>HLEd -A -D -T 1 -l * -d dict -i phones1.mlf mkphones1.led words.mlf |
Which creates the phones1.mlf file.