General Discussion

Nested
Different depths of voice
User: FrozenFire
Date: 8/27/2008 9:34 pm
Views: 5262
Rating: 5
Hi there, I have two completely different natural depths of voice. One is my dictation voice, which is of a higher pitch, and my other is my regular deep voice, which is significantly deeper than my dictation voice. In my contributions, I have used my dictation voice, however I've been thinking of switching to my speaking voice (The deep one) for an equal number of contributions. Would this screw up processing for my contributions? Or are the sets analyzed separately?

--- (Edited on 8/27/2008 9:34 pm [GMT-0500] by Visitor) ---

Re: Different depths of voice
User: dano
Date: 8/28/2008 12:37 am
Views: 246
Rating: 6

Hi FrozenFire

What I know: each sentence is 'compiled' into the language model. I think the language model should recognize more voices: even if you speak two totally different voices in one time. Contributing (even more) speech and tell it to your friends is the best way to help open source speech recognition!

Daniël

--- (Edited on 8/28/2008 12:37 am [GMT-0500] by dano) ---

Re: Different depths of voice
User: kmaclean
Date: 9/1/2008 7:18 pm
Views: 85
Rating: 13

Hi FrozenFire,

>In my contributions, I have used my dictation voice, however I've been

>thinking of switching to my speaking voice [...] Would this screw up

>processing for my contributions?

As a general rule, a speech recognition engine best recognizes the same type of speech as it was trained on.  Therefore, it would seem that submitting your dictation voice would give you the best recognition results when you to dictate (since that is the voice you will be using).

However, the training process does vocal tract length normalization to take things like this into account (since many people also have different speaking styles depending on the context).  I'm not sure how that would help in contexts like yours where your dictation voice greatly differs from your speaking voice.

I would say that if you feel you've contributed enough of your dictation voice (which will benefit you... once we have enough speech) then by all means contribute with your speaking voice, to provide some variety (and thus benefit others).

Ken

--- (Edited on 9/1/2008 8:18 pm [GMT-0400] by kmaclean) ---

Re: Different depths of voice
User: kmaclean
Date: 9/1/2008 7:19 pm
Views: 1979
Rating: 4

Hi Dano,

>each sentence is 'compiled' into the language model

I think you mean acoustic model.

Ken

--- (Edited on 9/1/2008 8:19 pm [GMT-0400] by kmaclean) ---

PreviousNext