General Discussion

Nested
how to download audio files
User: mdeisher
Date: 2/6/2012 7:59 pm
Views: 6215
Rating: 10

Hi.  What is the preferred method of downloading the audio files?  Sure, a person could manually click on each link but that takes forever.  Is there an FTP site?  Thanks.

 

--Mike

 

--- (Edited on 2/6/2012 7:59 pm [GMT-0600] by mdeisher) ---

Re: how to download audio files
User: kmaclean
Date: 2/6/2012 8:26 pm
Views: 133
Rating: 14

>What is the preferred method of downloading the audio files?

wget the files you want from:

http://www.repository.voxforge1.org/downloads/SpeechCorpus/Trunk/Audio/

--- (Edited on 2/6/2012 9:26 pm [GMT-0500] by kmaclean) ---

Re: how to download audio files
User: mdeisher
Date: 2/6/2012 10:19 pm
Views: 89
Rating: 13

Thanks!

Using wget will get one file.  wget -r seems to bring in way to much data.  Using "-np" helps a little.  Is there a known way just to bring in the files in a single directory or directory hierarch?

Is there any documentation, like a map of the server, so we can figure out where the relevant files are located?

 

--- (Edited on 2/6/2012 10:19 pm [GMT-0600] by mdeisher) ---

Re: how to download audio files
User: kmaclean
Date: 2/7/2012 5:20 pm
Views: 84
Rating: 14

>Is there [...] a map of the server,

Submissions as we received them (i.e. 'Original' audio files) are stored here:

[DIR] Original/               07-May-2009 14:07      -  

 

Downsampled versions of the same audio files are stored here:

[DIR] 16kHz_16bit/            19-Jul-2011 22:21      -  
[DIR] 8kHz_16bit/             19-Jul-2011 22:25      -  

 

These are the ones you want, and they are listed in one big directory list

--- (Edited on 2/7/2012 6:20 pm [GMT-0500] by kmaclean) ---

Re: how to download audio files
User: mdeisher
Date: 2/7/2012 6:15 pm
Views: 139
Rating: 14

Thanks, Ken!  So all the languages are together in these directories?  If so, that makes it easy.

 

--Mike

 

--- (Edited on 2/7/2012 6:15 pm [GMT-0600] by mdeisher) ---

Re: how to download audio files
User: kmaclean
Date: 2/7/2012 7:25 pm
Views: 120
Rating: 12

>all the languages are together in these directories?

nope - just english, but they all follow the same pattern - go to the language's download page and follow the links.

Note there is a backlog of audio to be processed (i.e. yet to be downsampled and reviewed to ensure that they are not spam...). 

--- (Edited on 2/7/2012 8:25 pm [GMT-0500] by kmaclean) ---

Re: how to download audio files
User: mdeisher
Date: 2/7/2012 7:38 pm
Views: 100
Rating: 12

Thanks!  So it looks like:

English:

http://www.repository.voxforge1.org/downloads/SpeechCorpus

Dutch:

http://www.repository.voxforge1.org/downloads/Dutch

Spanish:

http://www.repository.voxforge1.org/downloads/es

etc., etc.

 

 

--- (Edited on 2/7/2012 7:38 pm [GMT-0600] by mdeisher) ---

--- (Edited on 2/7/2012 7:43 pm [GMT-0600] by mdeisher) ---

Re: how to download audio files
User: kmaclean
Date: 2/8/2012 3:52 pm
Views: 1850
Rating: 13

yes

--- (Edited on 2/8/2012 4:52 pm [GMT-0500] by kmaclean) ---

PreviousNext