General Discussion

Nested
Re: Acoustic model 0.1.2
User: kmaclean
Date: 9/9/2008 11:43 am
Views: 237
Rating: 13

Hi dano,

>all things of scripts and example configurations and doc etc. are all in the

>file

OK

>I just took the ubuntu packages (which are very usable to my opinion.) 

Agree. But it seems to be set up for an Ubuntu package update process (I'm not that familiar with the ways packages are updated in Ubuntu).  If you are not using Ubuntu or Debian, then you've got to search for things...

>Maybe I should the files move in the home directory of the package? 

Yes, just move the relevant files to the Home directory - i.e. README (to get started and create grammars) and simple bash startup script.

Ken

--- (Edited on 9/9/2008 12:43 pm [GMT-0400] by kmaclean) ---

Re: Acoustic model 0.1.2
User: nsh
Date: 9/10/2008 4:39 pm
Views: 262
Rating: 2

> Do these numbers include the problem prompts too, or did you omit them?  i.e. is all we have to do to get to 97% is remove or correct the offending submission prompts?

Surely not, we have to correct transcriptions, optimize training paramters, train MLLT and LDA transformations.

Btw, David recently tested voxforge-en model on wsj test set, so here is the real result:

 test 20k, Sphinx3.7, bigrams:

==> voxforge_s3_test20k/voxforge_s3_test20k.align <==
TOTAL Words: 5645 Correct: 4557 Errors: 1230
TOTAL Percent correct = 80.73% Error = 21.79% Accuracy = 78.21%
TOTAL Insertions: 142 Deletions: 159 Substitutions: 929

test 5k, Sphinx3.7, trigrams:

==> voxforge_s3_test5k/voxforge_s3_test5k.align <==
TOTAL Words: 5354 Correct: 4880 Errors: 562
TOTAL Percent correct = 91.15% Error = 10.50% Accuracy = 89.50%
TOTAL Insertions: 88 Deletions: 65 Substitutions: 409

 Original WSJ results:

 SI-84 (14 hours), 2800 senones, 8 Gaussians, trigrams, Sphinx3, test 5k:

==> si84_sphinx3/si84_sphinx3.align <==
TOTAL Words: 5354 Correct: 5079 Errors: 325
TOTAL Percent correct = 94.86% Error = 6.07% Accuracy = 93.93%
TOTAL Insertions: 50 Deletions: 56 Substitutions: 219

SI-284 (80 hours?), 3000 senones, 32 Gaussians, bigrams, Sphinx3, test20k:

==> si284_20k_sphinx3/si284_20k_sphinx3.align <==
TOTAL Words: 5645 Correct: 5164 Errors: 559
TOTAL Percent correct = 91.48% Error = 9.90% Accuracy = 90.10%
TOTAL Insertions: 78 Deletions: 68 Substitutions: 413

--- (Edited on 9/10/2008 4:39 pm [GMT-0500] by nsh) ---

Re: Acoustic model 0.1.2
User: nsh
Date: 9/22/2008 12:14 pm
Views: 73
Rating: 1

I managed to use Opera instead of Mozilla to write this one.

Ken, can you please update the model:

http://www.mediafire.com/download.php?atmdlrdt0om

MLLT training made this one better:

TOTAL Words: 28420 Correct: 26989 Errors: 1929
TOTAL Percent correct = 94.96% Error = 6.79% Accuracy = 93.21%
TOTAL Insertions: 498 Deletions: 362 Substitutions: 1069

 

--- (Edited on 9/22/2008 12:14 pm [GMT-0500] by nsh) ---

Re: Acoustic model 0.1.2
User: kmaclean
Date: 9/23/2008 6:52 pm
Views: 2311
Rating: 1

HI nsh,

>Ken, can you please update the model:

Thanks!

The new Sphinx Acoustic model is located here.

Ken

P.S. I am travelling all this week, so turnaround for requests may be delayed a bit...

 

--- (Edited on 9/23/2008 7:52 pm [GMT-0400] by kmaclean) ---

PreviousNext