Click here to register.

English Speech Files

Nested
ralfherzog-20071126-en22
User: ralfherzog
Date: 11/25/2007 7:04 pm
Views: 2881
Rating: 19

Speaker Characteristics:

Gender: male;
Age range: adult;
Pronunciation dialect: General American English.

Recording Information:

Microphone make: Sennheiser PC 131;
Microphone type: noise canceling headset;
Audio card make: Andrea USB adapter;
Audio card type: USB;
Audio Recording Software: Audacity 1.2.6;
O/S: Windows XP Professional.

File Info:

File type: FLAC;
Sampling rate: 48kHz;
Sample rate format: 16bit;
Number of channels: 1;
Audio Processing: no

en22-01 The word has a repeated entry.
en22-02 What is going on here?
en22-03 This comes as a surprise.
en22-04 That word is recognized.
en22-05 He is not going anywhere.
en22-06 Could I ask you to come with me?
en22-07 Do you know anything about this?
en22-08 It is the sequence of phones to be used.
en22-09 The output symbol is not specified.
en22-10 He is unfit to continue.
en22-11 He is afraid that is the point of the meeting.
en22-12 She will be joining us later.
en22-13 They need to find her as soon as possible.
en22-14 He is going to smash this thing.
en22-15 The output symbol is optional.
en22-16 Click here for more information.
en22-17 She knows what we look like.
en22-18 That won't be necessary.
en22-19 I thought I made myself clear.
en22-20 Could he come over here for a second?
en22-21 She is not saying anything.
en22-22 He is going to find out.
en22-23 The dictionary has a very simple format.
en22-24 Each word is determined from the dictionary.
en22-25 He is trying to say something.
en22-26 They are getting her to the hospital.
en22-27 That may not be a good idea.
en22-28 They would generate the following output.
en22-29 He is not going to make it.
en22-30 It is best to generate a large number of examples.
en22-31 She has to find out about those things.
en22-32 I still don't trust her.
en22-33 What did she say?
en22-34 This output can be quite illuminating.
en22-35 This is for more complex grammars.
en22-36 They won't have to.
en22-37 The sooner the better.
en22-38 He is counting on her.
en22-39 He is not a stupid man.
en22-40 This is perhaps not very informative in this case.
en22-41 Why don't you give it to me?
en22-42 You wanted to see me?
en22-43 He will make it quick.
en22-44 This is what they have to construct.
en22-45 Do you need any help?
en22-46 She is trying to help you.
en22-47 I want to talk to her alone.
en22-48 It converts all cases back to their original form.
en22-49 You have got twenty minutes to clean your desk.
en22-50 He is too far away.
en22-51 Let me get back to you.
en22-52 He can't ignore that.
en22-53 She can keep him out of jail.
en22-54 She knows how these things work.
en22-55 I would never lie to you.
en22-56 But they are lucky, because they need them.
en22-57 I will do my best.
en22-58 All examples are to be replaced by the single phone.
en22-59 She will be on the phone for the next two hours.
en22-60 Why don't you want to tell her?
en22-61 Do you really want to do this?
en22-62 I don't have to explain myself to you.
en22-63 This could be achieved by the following things.
en22-64 They will be at war against three innocent countries.
en22-65 He has been removed from office.
en22-66 He will do all the talking.
en22-67 It is just your word against mine.
en22-68 So you want revenge.
en22-69 She wants full immunity from prosecution.
en22-70 I will tell you everything.
en22-71 I came here to protect myself.
en22-72 Give me your keys.
en22-73 Let me see what I can do.
en22-74 He is the only one who can help you now.
en22-75 How is she connected to this?
en22-76 You are a traitor to your country.
en22-77 That is why he found you.
en22-78 He wants to speak to the people directly.
en22-79 We have work to do.
en22-80 There is nothing to say.
en22-81 I don't know what to say.
en22-82 What are you talking about?
en22-83 You have caused me a lot of trouble.
en22-84 They can be set up in a similar way.
en22-85 It appears he was the architect.
en22-86 They just finished setting up the feed.
en22-87 She has got to go now.
en22-88 I will take care of it.
en22-89 I am against any delay in our actions.
en22-90 This is a wireless transmitter.
en22-91 You have full support.
en22-92 Don't tell her what is going on.
en22-93 You think you will be safe out there.
en22-94 She wanted to talk to your sister.
en22-95 Shouldn't we call for help?
en22-96 I need you to get me in contact with him.
en22-97 I can defend myself.
en22-98 You have twenty minutes to report back.
en22-99 There may be some collateral damage.

Copyright (C) 2007  Ralf Herzog

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program; if not, see <http://www.gnu.org/licenses/>.

--- (Edited on 11/25/2007 7:04 pm [GMT-0600] by ralfherzog) ---

ralfherzog-20071126-en22.zip ralfherzog-20071126-en22.zip

Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

Re: ralfherzog-20071126-en22
User: kmaclean
Date: 11/28/2007 8:08 pm
Views: 240
Rating: 23

Hi Ralph,

You've been busy! Thanks for the last bunch of submissions.

The Acoustic Model training script had a little trouble with one of your entries:

en22-69 She wants full immunity from prosecution.

Which I removed from the version in the VF Speech Corpus.  It's strange ...  it sounds OK to me, but caused the training process some problems.  

Here is the link to the version in the Corpus: 

[   ] ralfherzog-20071126-en22.tgz 28-Nov-2007 15:28 12.7M

Ken 

--- (Edited on 11/28/2007 9:08 pm [GMT-0500] by kmaclean) ---


Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

"she wants full immunity from prosecution"
User: ralfherzog
Date: 11/29/2007 12:03 am
Views: 140
Rating: 14
Hello Ken,

I just dictated the sentence "she wants full immunity from prosecution."  And DNS 9.5 recognized this sentence 100% correctly.

You know, I am creating my own prompts (en1, en2, ..., en22, ...) using DNS.  After the creation of the prompts, I read them aloud and record my voice with Audacity.

But I am glad that the the rest of my prompts seems to be OK.  I am planning to submit more prompts in a similar style like before.  I hope that is OK.  If not, please tell me.

You know, in the long term, I want to submit a lot of prompts in the German language.  But I understand that the English-language is on top priority.  And at the moment, there is no one who is able to process the German prompts.  So I have to stick to the English-language.

I am trying to cover a lot of standard dialog situations in the English-language. So there shouldn't be a lot of problems.  At least, this is what I am hoping.  

You do a great job, Ken, and I hope that more people will join the VoxForge project.

Greetings, Ralf

--- (Edited on 11/29/2007 12:03 am [GMT-0600] by ralfherzog) ---

--- (Edited on 11/29/2007 12:07 am [GMT-0600] by ralfherzog) ---


Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

Re: "she wants full immunity from prosecution"
User: Robin
Date: 11/30/2007 11:56 am
Views: 157
Rating: 14

I personally think that you shouldn't let the fact that there is no one to process your speech files stop you from submitting more files.  The more files (German) we have the more interesting it becomes for someone to help us process them (make a dictionary etc.)

Perhaps you can persuade someone from the Simon Project to help us out?

Maybe you should also write a small text on "Spracherkennung", Linux, open source and VoxForge.  That way we might get more German visitors.  You can publish something like that on the dev pages.

I have done that as well to attract more Dutch visitors.  So far without success, however it is now possible to find VoxForge using some Dutch search terms.

--- (Edited on 11/30/2007 11:56 am [GMT-0600] by Robin) ---


Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

How to process German prompts files?
User: ralfherzog
Date: 11/30/2007 4:45 pm
Views: 138
Rating: 19

Hi Robin,

Thanks for your answer.

Maybe you are able to help me out?  You seem to be able to understand how to build a dictionary for the Dutch language.  And you had submitted some prompts in the German language.  So this could be some kind of teamwork: I submit a lot of prompts in the German language, and you could help me to process them.

Maybe we can build a German dictionary from scratch?  Obviously, there are some German dictionaries available, but they aren't published under the GPL.  So in my opinion, maybe we can build our own German dictionary under the GPL.

Greetings, Ralf

--- (Edited on 11/30/2007 4:45 pm [GMT-0600] by ralfherzog) ---


Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

Re: How to process German prompts files?
User: kmaclean
Date: 11/30/2007 8:22 pm
Views: 510
Rating: 21

Hi Ralf,

Timo has started some work on the German phoneset on the VoxForgeDev wiki, and was looking for feedback.  This would be an excellent place to start in creating a German Pronunciation dictionary.

Creating a (GPL) German Phoneset is an important step to creating a (GPL) German Pronunciation dictionary.  Essentially, what you can do is use this GPL Germane Phoneset in a Text-To-Speech Engine (like eSpeak, or Festival) to output the phonemes for a word (rather than speaking them), so you can use the TTS engine to give you a good first pass pronunciation dictionary for a list of words.  This is similar to the approach I used to find the pronunciations for out-of-vocabulary words for the LibriVox text I segmented.  

I believe I sent you a password a while back to for the German Trac site on VoxForge ... if not, let me know, and I can send you one (I still have not got around to fixing mod_security on the Web server to allow updates without signing-on and keep comment spammers out...).

Ken 

--- (Edited on 11/30/2007 9:22 pm [GMT-0500] by kmaclean) ---


Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

PreviousNext