VoxForge
--- (Edited on 2008-03-12 12:53 am [GMT-0500] by ralfherzog) ---
> Feature request: use Ekiga for collecting/contributing speech.
From Ekiga site:
Ekiga uses both the H.323 and SIP protocols. It supports many audio and video codecs, and is interoperable with other SIP compliant software [...]
So we should be able to use Ekiga as a SIP client front-end for the VoxForgeIVR application (Asterisk-based app) that trevarthen created.
The main problem is that our current server resources would likely not be able to support the bandwidth required of a VoIP solution. Where the web front-end server is located, there is not enough bandwidth to support VoIP (I've tried direct connections with people, and have been dropped many times). On the VoxForge repository server (which is just a regular web-hosting service), we don't have root access, which (I think) is required for an Asterisk server. If there is a way around this, please let me know.
thanks,
Ken--- (Edited on 3/12/2008 3:29 pm [GMT-0400] by kmaclean) ---
> If there is a way around this, please let me know.
Of course you can build * in a custom prefix from sources. The only difference is that you should configure it with all dirs:
./configure --prefix=... --sysconfdir=... --datadir=... and
probably some more options. There are different problems with firewall but it should work fine I suppose.
And SIP is a very nice idea of course.
--- (Edited on 3/12/2008 3:04 pm [GMT-0500] by nsh) ---
--- (Edited on 3/15/2008 3:59 pm [GMT-0500] by colbec) ---
I'd be happy to lend my & my (willing) clients voices to contributing to this project via VoIP: easiest would be to implement a plug-in on either the VoIP gateway (Asterisk, FreeSwitch) or on the client (Ekiga).
Maybe talk to the respective developers to have it implemented as a feature, since they would be able to directly benefit from the project (i.e live voice translations, verbal commands to dial from a list/emergencies as opposed to DDI's, access control, etc)
Wide range of sampling can be gathered from call-centre implementations
--- (Edited on 3/2/2009 7:26 pm [GMT-0600] by Visitor) ---
Hi freakalad,
>I'd be happy to lend my & my (willing) clients voices to contributing to this
>project via VoIP
Our current approach is to collect high quality speech using the Java applet (or some another audio editor like Audacity) and then downsample it to the target rate used by the application (telephony uses 8kHz:8bit audio, ...).
For certain applications, like VoIP, there would be an additional step that would likely involve passing the speech audio through the target VoIP codec, and then training acoustic models using this transformed speech audio (for more info on this approach see my post in this thread, and David Gelbart's post in the same thread).
Ken
--- (Edited on 3/5/2009 9:55 pm [GMT-0500] by kmaclean) ---