VoxForge
Hi,
In this tutorial we use the flat start scheme for HMM initialization and parameter estimation. What's the difference if we had a labeled training data for example TIMIT database. In this case we need phones0.mlf or phones1.mlf ?
Best Regards,
Hamid.
Hi hamid,
>In this tutorial we use the flat start scheme for HMM initialization and
>parameter estimation. What's the difference if we had a labeled training
>data [...]
from the HTK Book:
8.3 Flat Starting with HCompV
One limitation of using HInit for the initialisation of sub-word models is that it requires labelled training data. For cases where this is not readily available, an alternative initialisation strategy is to make all models equal initially and move straight to embedded training using HERest. The idea behind this so-called flat start training is similar to the uniform segmentation strategy adopted by HInit since by making all states of all models equal, the first iteration of embedded training will effectively rely on a uniform segmentation of the data.
So I think if you want to use labelled training data, you need to us HInit, using the process described in section 8.2 (Initialisation using HInit) o the HTK book.