VoxForge
Hi. What is the preferred method of downloading the audio files? Sure, a person could manually click on each link but that takes forever. Is there an FTP site? Thanks.
--Mike
--- (Edited on 2/6/2012 7:59 pm [GMT-0600] by mdeisher) ---
>What is the preferred method of downloading the audio files?
wget the files you want from:
http://www.repository.voxforge1.org/downloads/SpeechCorpus/Trunk/Audio/
--- (Edited on 2/6/2012 9:26 pm [GMT-0500] by kmaclean) ---
Thanks!
Using wget will get one file. wget -r seems to bring in way to much data. Using "-np" helps a little. Is there a known way just to bring in the files in a single directory or directory hierarch?
Is there any documentation, like a map of the server, so we can figure out where the relevant files are located?
--- (Edited on 2/6/2012 10:19 pm [GMT-0600] by mdeisher) ---
>Is there [...] a map of the server,
Submissions as we received them (i.e. 'Original' audio files) are stored here:
Original/ 07-May-2009 14:07 -
Downsampled versions of the same audio files are stored here:
16kHz_16bit/ 19-Jul-2011 22:21 - 8kHz_16bit/ 19-Jul-2011 22:25 -
These are the ones you want, and they are listed in one big directory list
--- (Edited on 2/7/2012 6:20 pm [GMT-0500] by kmaclean) ---
>all the languages are together in these directories?
nope - just english, but they all follow the same pattern - go to the language's download page and follow the links.
Note there is a backlog of audio to be processed (i.e. yet to be downsampled and reviewed to ensure that they are not spam...).
--- (Edited on 2/7/2012 8:25 pm [GMT-0500] by kmaclean) ---
Thanks! So it looks like:
English:
http://www.repository.voxforge1.org/downloads/SpeechCorpus
Dutch:
http://www.repository.voxforge1.org/downloads/Dutch
Spanish:
http://www.repository.voxforge1.org/downloads/es
etc., etc.
--- (Edited on 2/7/2012 7:38 pm [GMT-0600] by mdeisher) ---
--- (Edited on 2/7/2012 7:43 pm [GMT-0600] by mdeisher) ---