VoxForge
Hi Ken,
I am trying to upload the Sergey-20080422.tgz file into the voxforge1.org FTP server but I always get "Login incorrect" error. I tryed to use username "u41386649-vfpublic" with password "vfpublic" and my own username and password.
What might be wrong?
Thanks a lot,
--Sergey
--- (Edited on 4/22/2008 9:11 am [GMT-0500] by Sergey) ---
Hi Sergey,
If your file is less than 50 meg, you can also upload it to this forum (I just updated this forum to accept attachments). Anything greater than 50 meg will not upload ... unfortunately, the CMS I use does not tell you that it has rejected a submission if it is too large.
Ken
--- (Edited on 4/22/2008 5:32 pm [GMT-0400] by kmaclean) ---
Hi Sergey,
Was the audio for your submission originally in uncompressed WAV format (or a lossless compressed format like FLAC, etc.), or did you convert it from MP3 (or any other lossy compressed format like OGG, etc.) to wav?
Ken
--- (Edited on 4/27/2008 10:24 pm [GMT-0400] by kmaclean) ---
Hi Sergey,
Although we prefer the source audio be uncompressed (i.e. WAV) or lossless compressed (i.e. FLAC) audio formats (these give a bit better recognition rates - see this page for some preliminary analysis), I will add your submission to the VoxForge repository, but label it as coming from an MP3 source.
We've got lots of uncompressed LibriVox audio chapters that need processing located here:
The directory needs to be cleaned up, but if you are interested in segmenting more audio, please let me know. Check the AudioBook Submission Notification Forum to see if a submission has already been processed.
Once we have the Sequitur G2P models trained with the VoxForge Pronunciation dictionary, we should be able to automate most of the segmentation process.
Ken
--- (Edited on 4/28/2008 3:02 pm [GMT-0400] by kmaclean) ---
Wow! I didn't know you have that much uncompressed audio. That is very impressive.
I am definitely interested in segmenting more audio, time permitting.
Is there any time estimate when Sequitur G2P models will be trained with the VoxForge Pronunciation dictionary?
--Sergey
--- (Edited on 4/29/2008 8:18 am [GMT-0500] by Sergey) ---
Hi Sergeym
>Is there any time estimate when Sequitur G2P models will be trained with the
>VoxForge Pronunciation dictionary?
Another day or two - it has been running since last night on an AMD Athlon 64, 3500 (single core), with 3 Gig, using Fedora FC6.
Ken
--- (Edited on 4/29/2008 10:18 am [GMT-0400] by kmaclean) ---