VoxForge
Hi,
I'm interest in speech recognition. There are a lot of project of english language speech recognition and i want one of them adapt to lithuanian speech. I take an interest in this project, so would be exciting to consult with you.
In french ! I'm not a programmer also, but if you need any help, I can do it too, by contributing myself with my voice, and also by promoting contributions across french-speaking forums related to Open Source movement, etc.
Perhaps I have good news for all the non programmers out there. Setting up any new language actually requires quite a bit of work that can be done without any programming skills.
You can for instance figure out whether or not there exists a phonetic dictionary for your language. If so perhaps it's a good idea to start a new thread (called "language X") and post the link to the dictionary there.
If it doesn't exist, it needs to be made unfortunately!
What also needs to be made are prompt files containing non copyrighted short sentences such as a in the English prompt files. Preferably they should use modern spelling rules. Also it's nice if they contain all the sounds of your language. You can even records some if you want and store them yourself, ready to be submitted when VoxForge is ready for "language X" (but make sure the quality is okay, because we don't want you to waste your time).
Finally promotion is always good! If you have a blog for instance, you can write a tiny bit about VoxForge and post a link (if you know how, use interesting keywords such as "open source" "speech recognition" etc. - in your language). Your blog/forum post might bring someone else to Voxforge to help you out with your language. You never know.
Robin
Hi!
I am trying to figure out what is available to start putting together a Hungarian corpus, but I am not sure if I understand your terminus technicus.
* phonetic dictionary: a dictionary where every word [wurd] is written like this? Does this correspond to the pronunciation dictionary used in the tutorial or to the lexicon?
* prompts: are books allowed? is it necessary to segment them as the prompts file in the tutorial?
* what about audiobooks + text?
* licensing: what source material is allowed besides public domain? I presume GPL is fine, but what about CC (and which type of CC?), or MIT-like licences?
I am sure that once you answer me, I'll have some more question! :)
Cheers,
V
> phonetic dictionary: a dictionary where every word [wurd] is written like this? Does this correspond to the pronunciation dictionary used in the tutorial or to the lexicon?
yes
> prompts: are books allowed? is it necessary to segment them as the prompts file in the tutorial?
yes, it's necessary to segment them and it's the biggest problem with books
> what about audiobooks + text?
ok, but we prefere raw wav, not mp3
>licensing: what source material is allowed besides public domain? I presume GPL is fine, but what about CC (and which type of CC?), or MIT-like licences?
GPL is better, though any other free speech is also suitable to start. See the discussion on this forum.
Basically to start you can just record yourself (10 minutes) and a few your friends (5 x 10 minutes). About dictionary, you can build it with text2pho from
http://tkltrans.sourceforge.net/
Hi V,
one clarification:
> Does this correspond to the pronunciation dictionary used in the tutorial or to the lexicon?
The pronunciation dictionary used in the Tutorial and How-to is based on the ISIP Switchboard corpus (contains around 27,500 words). Whereas the QuickStart and nightly AM builds is based on version 0.6 of the CMU Pronunciation Dictionary (contains around 130,000 words). Unfortunately, the Switchboard and CMU pronunciation dictionaries use slightly different phoneme syntax. This is enough to make them incompatible from a Grammar and Acoustic Model testing perspective (see ticket #52).
Ken
Hi evryone.
I am very interested in starting a project to develop Swedish open source speech recognition. We have quite a lot of tools and corpora, but unfortunately not much time to handle it at the moment. Is there a time plan for when there will be possibilities for people to donate their recordings etc in other languages?If I (for example) arrange a phone number etc and take care of the recordings and so on, would it then be possible to admit information and resources from these web pages for such a project? I am very interested in creating speaker recognition for Swedish as well.
Best regards
Jonas
Gothenburg University, Sweden