VoxForge
--- (Edited on 8/28/2007 10:44 am [GMT-0400] by kmaclean) ---
--- (Edited on 8/28/2007 10:44 am [GMT-0400] by kmaclean) ---
--- (Edited on 8/28/2007 10:45 am [GMT-0400] by kmaclean) ---
--- (Edited on 8/28/2007 10:45 am [GMT-0400] by kmaclean) ---
Hi Ken,
Sorry for the misunderstanding.
A premise for my
study is that I should use only tools non created by me, for this
reason I'm searching for a created language model, but I see that it
will not be possible.
I found a language model in this website for CMU Sphinx in ARPA format but I don't know if it could be used for julius or Julian and how.
I want to use Julius or Julian in order to recognise some
speech audio files, not from dictation and make a profiling during its
execution.
I could use the QuickStart but I would like to use an audio file as a input, not by microphone, it is possible?
Thank you again,
Anna
--- (Edited on 8/28/2007 10:45 am [GMT-0400] by kmaclean) ---
--- (Edited on 8/28/2007 10:46 am [GMT-0400] by kmaclean) ---
Thank you for all the information, Ken.
I will try with these links and I will let you know.
About publishing the thread I think it's a great idea! Please feel free to do it!
Best regards and thanks for your help!
Anna
--- (Edited on 8/28/2007 10:46 am [GMT-0400] by kmaclean) ---
This paper might be of interest to those trying to compare Sphinx and HTK:
A Comparison of Public-Domain Software Tools for Speech Recognition (2003)
K. Samudravijaya and Maria Barot
School of Technology and Computer Science, Tata Institute of Fundamental Research, Mumbai, India
HTK and Sphinx are two freely downloadable software packages with the capability of implementing a large vocabulary, speaker independent, continuous speech recognition system in any language. While HTK has been in use by various groups for about a decade, and has gone through the refinement cycles necessary for a commercial software, Sphinx was released about a year ago and is still undergoing development in a university environment. However, due to certain advanced features and the license for unrestricted use, Sphinx appears to be more attractive. These two software packages have been compared by implementing a Hindi speech recognition system. Although recognition accuracies of the two systems are comparable, we observe that the acoustic modeling of Sphinx is superior.
(my emphasis added).
Ken
--- (Edited on 9/19/2007 8:15 pm [GMT-0400] by kmaclean) ---
It probably was wow effect back in 2003 :) It's very hard to build good acoutic model in HTK due to complicated process and bad defaults but things like discriminative training (MMI/MPE), distributed training and complicated topologies as well as many many more features make HTK far more superior thank sphinx. Sphinx is easy to start with and drive but HTK can be really perfect only if used properly.
--- (Edited on 9/19/2007 11:06 pm [GMT-0500] by nsh) ---
Hi nsh!
I looking for a brief tutorial or step-by-step procedure to do discriminative training using MMI, MPE(MWE), MCE, etc; withing the HTK tutorial/demo in order to explain and understand more datailed that techniques.
thans a lot!
--- (Edited on 5/19/2010 2:44 pm [GMT-0500] by carlosdfresh) ---