download french corpus
I was looking at the French submited speech data, and I saw that only a part of it was in the Voxforge repository for download. The rest seems to be in the upload directory, which is access restricted, so it is not very easy to recover all the corpus except manually from the download page.
Is there a specific reason for that, or is there a way to get the corpus easily? I saw this post where Ken says:
Unfortunately I have not moved any German audio to subversion.
However, here is quick and dirty way to get the audio:
1. $wget -r -l2 http://www.voxforge.org/home/downloads/speech/german-speech-files -A "ralfherzog*"
this will create a directory called www.voxforge.org
2. search the directory for *.zip files using Gnome's search tool, and drag the results to the directory you want.
I'm not a wget expert but I don't think it's going to get files which are not in the specified directory. Any help?
Thanks a lot!