VoxForge
Hello,
I am using HTK to do phone alignment in English.
I found out that padding audio files with true silence (with the pad option of Sox) at beginning and end of an audio file completely looses HTK although I use a word network with 'sil'.
This happens when using USEPOWER=T (Use power not magnitude in fbank analysis)
When setting USEPOWER to false, then the alignment is OK.
Does anyone have an explanation on this? (on the difference between energy and magnitude with tru silence) ?
Thanks in advance
> I found out that padding audio files with true silence (with the pad option of Sox) at beginning and end of an audio file completely looses HTK although I use a word network with 'sil'.