VoxForge
Before you begin, you need to make sure that the room you are recording in is as quiet as possible. In addition, make sure you turn off you speakers while recording - to avoid acoustic feedback in your audio files.
See this FAQ entry for more information.
Next you need to adjust your microphone for optimal pickup of your voice.
If you have a headset microphone, this should be easy to do. Your microphone should be a bit to the side and below your mouth (so the microphone won't pick-up your breathing), and no more than a half inch (1-2 cm) away.
See this FAQ entry for more information.
Now you need to test your recording levels.
Start Audacity.
Make sure your microphone volume in Audacity is set to 1.0.
Then click Record (i.e. the red circle button) and begin speaking in your normal voice for a few seconds, and then click Stop (i.e. the yellow square button).
Look at the Waveform Display for the audio track you just created. The Vertical Ruler to the left of the Waveform Display provides you with a guide to your audio levels. Try to keep your recording levels between 0.5 and -0.5, averaging around 0.3 to -0.3. It is OK to have a few spikes go outside the 0.5 to -0.5 range, but avoid having any go beyond the 1.0 to -1.0 range, as this will generate distortion. If necessary, adjust Audacity's microphone volume to keep your audio within the proper ranges.
VoxForge collects speech audio at the highest Sample Rate that your Sound Card can support (up to a Sampling Rate of 48kHz, at 16 Bits Per Sample). You'll need to look at your Sound Card's manual to determine the maximum it supports. If you don't have your manual, this FAQ entry might help you get the required information (another FAQ entry provides some caveats with respect to your sound card and recording rates).
In Audacity, you set the Project Sampling Rate and Sample Format in your Preferences (under the 'File' menu selection). Click the 'Quality' tab:
Next click the 'Audio I/O' tab, and then:
Then click the 'File Formats' tab, and then
Click OK to save your settings.
Now you need to exit and re-start Audacity to make these Project Setting changes active. Look
at Project rate
selector on the bottom left hand corner of the Audacity window, make
sure it says 48000.
Each line of the prompts file (which you got in Step 1) corresponds to the transcribed contents of one audio file. The first column contains the name of the audio file. The following words on the same line contain the text you will record in the audio file (see example below):
vf5-21 So cheer up, and give us your paw vf5-22 This time he did not yap for mercy vf5-23 And the air was growing chilly |
Record your first audio track by clicking Record in Audacity and saying the words in the first line of your prompts file (skipping the name of the file). With our current example you would say:
So cheer up, and give us your paw |
Leave a one second pause before and after you speak (these pauses will help in determining noise levels in your recordings). Speak normally, not too fast or too slow, and clearly. Speak as you would if you were reading the text aloud to someone else, with appropriate pauses corresponding to the punctuation in the text. If you are unsure how to pronounce a word, this FAQ entry might help.
Click the 'Stop' icon when you are completed.
Don't exhale until after you have clicked stop (most people have a tendency to breath out after speaking a sentence) otherwise your microphones will pick up your breath noises.
Review your waveform to ensure that your recording levels are between 0.5 and -0.5, averaging around 0.3 to -0.3. It is OK to have a few spikes outside the 0.5 to -0.5 range, but don't have any go beyond the 1.0 to -1.0 range.
If your waveform looks good, then listen to the track (press 'Play' in Audacity) to make sure your pronunciation is clear and that you do not hear any non-speech noises (i.e. breathing noises, lip smacking, or background noises, ...).
Click here for Tips for Recording VoxForge Prompts with Audacity.
Record your second audio track by clicking Record again (in Audacity) and say the words in the second line of your prompts file. Audacity will start a second track leaving your first track untouched.
Review your waveform and listen to each track before proceeding to the next one. You can select a particular track to listen to by clicking 'solo' on the track menu of that particular track, and then clicking play in Audacity. Remember to click 'solo' again when you are done, to unselect the track.
Repeat the same process for each line in your prompts file, adding a new track for each line.
When you have finished recording all the prompts in the prompt file, then click 'File' on the Audacity menu, and click 'Export Multiple...'. The change the following settings on your 'Export Multiple' window:
Then click 'Export'.
This will generate a series of audio files consecutively named 'vf5-01', 'vf5-02', etc. in your 'train' directory.