Re: More on Collecting Speech Audio for Free GPL Speech Corpus
Sorry for replying to a 2 year old statement. This was taken from http://www.voxforge.org/home/forums/message-boards/audio-discussions/more-on-collecting-speech-audio-for-free-gpl-speech-corpus
which was an email to Joe Picone, ISIP (Institute for Signal and Information Processing)
>And further, if clean speech is required, should I be using noise reduction software on the submitted audio. My attempts at noise reduction have not been that successful, with the resulting 'musical noise' that replaces the removed noise giving me very poor recognition results.
I have taken quite noisey wav files and run them through Adobe SoundBooth CS4's noise reduction.
You can adjust this noise/musical noise shifting to a middle ground of 'acceptible.'
I am not a salesman for Adobe. I am however struggling in creating a quality ASR for a hospital in southern India.
I'm in quite the naturally noisey environment, and the doctors will be using the inexpensive noisey headphone mike.
A sound engineer told me you should record in the best environment possible. With quality data you can do what ever you want.
The point of an effective ASR is to understand human speech and it will be done at a hospital under working conditions.
If I use noise cancelling filter on one user, the software will have to be adaptable to each different condition each user has, which will require different filters.
How do you incorporate that?
It does leave me a little bewildered at times.
Thanks for your thoughts,
--- (Edited on 2/6/2009 3:57 am [GMT-0600] by Visitor) ---