VoxForge
Hi,
I wonder, how the GPL would be applied to the following situation:
A huge number of users like you and me collect a speech corpus that allows to build acoustic models of similar quality compared to commercial systems. Now a hardware developer comes up with some speech recognition tool that is designed to work exactly with VoxForge's model. He doesn't distribute the model with his device, but provides a link to voxforge.org where customers may download the model.
- Is this situation covered by GPL?
If not, I'm reluctant to spend time on something that others may turn into profit without returning anything to the community.
If yes:
- Does the GPL require the device developer to distribute his device together with the model sources, OR
- Does the GPL reuqire him to distribute his device together with the model sources PLUS any sources and hardware description of his own device?
The latter situation is most desirable, because everybody who benefits from the acoustic models has to provide some improvement in return. But I doubt GPL includes anything except for directly derived code (or models). Your opinion?
--- (Edited on 12/17/2006 4:10 pm [GMT-0600] by Visitor) ---
The general rule is that the GPL only covers 'distribution' of a program or work.
So if you try to distribute modifications to the original work, you must provide users with your changes. Or if you distribute the original work in object or executable form, you must provide the source. If you never distribute the original GPL'ed work, GPL does not apply.
The GPL in section 2 says:
[...] If identifiable sections of that work are not derived from the Program, and can be reasonably considered independent and separate works in themselves, then this License, and its terms, do not apply to those sections when you distribute them as separate works. [...]
Thus, it is not the intent of this section to claim rights or contest your rights to work written entirely by you; rather, the intent is to exercise the right to control the distribution of derivative or collective works based on the Program.
(my emphasis added)
Therefore, my reading of the GPL is that if someone develops a hardware (or proprietary software-based) speech recognition engine that uses VoxForge Acoustic Models ('AM'), GPL would not prevent him from telling his users to link to the VoxForge site and get their AM there. I am not a lawyer, and I think that this question is important enough to put to the FSF to confirm my interpretation.
Having said that, the reality of the situation is that Acoustic Models, like software, are never static. There is always room for improvement. A commercial venture would most likely need to make improvements to transcriptions, clean up the audio, add their own domain specific audio to address specific customer needs. Acoustic Models would have to be localized to improve their recognition rates for specific regions. And these modifications, if 'distributed' to their customers, would need to be published pursuant to the GPL.
That is how I can see that the community would win in this situation.
Ken
--- (Edited on 12/17/2006 10:21 pm [GMT-0500] by kmaclean) ---
Autoreply from SFS:
This message has been automatically generated in response to the a
licensing question you sent to the Free Software Foundation, with subject:
"VoxForge Licensing Question".
There is no need to reply to this message right now. Your request has
been assigned an ID of [gnu.org #324354].
Please include the string:
[gnu.org #324354]
in the subject line of all future correspondence about this issue. To do
so, you may reply to this message.
This address is answered primarily by volunteers, overseen by one staff
member of FSF. We have very limited resources to answers requests.
--- (Edited on 12/17/2006 10:32 pm [GMT-0500] by kmaclean) ---
Can you provide a link where everybody can follow this issue at FSF?
Or will you post FSF's clarification as a follow-up to this thread when it comes in?
--- (Edited on 12/22/2006 10:06 am [GMT-0600] by Visitor) ---
I just sent an email to FSF and I posted the auto-reply e-mail they sent back. I am not sure how this process works at FSF ... if there is a link I will post it, if not I will post their e-mail reply here.
Ken
--- (Edited on 12/22/2006 12:37 pm [GMT-0500] by kmaclean) ---
Yoni Rabkin via RT <[email protected]> | Fri, Feb 23, 2007 at 2:31 AM | |
Reply-To:
[email protected]
| ||
|
--- (Edited on 2/23/2007 11:22 am [GMT-0500] by kmaclean) ---
--- (Edited on 2/27/2007 11:24 am [GMT-0500] by kmaclean) ---
--- (Edited on 2/27/2007 11:25 am [GMT-0500] by kmaclean) ---
Yoni Rabkin via RT <[email protected]> | Tue, Feb 27, 2007 at 10:44 AM |
Reply-To:
[email protected]
| |
Hi,
I'm still convinced that the acoustic model is a library as far as the
GPL is concerned. Allow me to recapitulate:
To the best of my understanding, the primary purpose of a speech
recognition engine is not to read the acoustic model as data. Instead,
the speech recognition engine uses the acoustic model in order to
achieve its primary purpose. Moreover, I think that the output of the
speech recognition engine is derived from the acoustic model.
If the acoustic model was "just data", switching it should not have a
profound effect on the speech recognition engine's primary
functionality.
To contrast, an audio player plays any media file just as well. Just as
a compiler would compile any program just as well. If an audio player
needed *at least* a single instance of some data X in order to perform
its primary function (playing music), I would say that the data X is as
essential as any of the other libraries the audio player requires.
I'll stress again that without legal precedence, we can continue to
argue and convince each other, but ultimately only a court can decide
such a subtle matter.
Since the speech corpus is licensed under the GPL, I am taking the
stance which I view has the best chance of protecting the wishes of the
copyright holder/s. If the speech corpus was licensed under the LGPL, I
would probably have chosen a different route.
I'm just a volunteer, and this is uncharted territory, so this is the
best I can do. But I look at it like this: once upon a time, long ago,
licensing almost *any* software under the GNU GPL was uncharted
territory, and things seem to have been working out well.
--- (Edited on 2/27/2007 11:26 am [GMT-0500] by kmaclean) ---
--- (Edited on 2/27/2007 11:28 am [GMT-0500] by kmaclean) ---
Yoni,
let me comment on two paragraphs of your preivous post.
>To the best of my understanding, the primary purpose of a speech
>recognition engine is not to read the acoustic model as data. Instead,
>the speech recognition engine uses the acoustic model in order to
>achieve its primary purpose.
Yes, but there is not "the" acoustic model (like "the" library version against which a program has to be compiled)
>If the acoustic model was "just data", switching it should not have a
>profound effect on the speech recognition engine's primary
>functionality.
There can be several similar models with no profound difference in recognition rate. A user might even be able to train his own acoustic model with better (single user) performance than the voxforge model.
Given these comments, does it still make sense to say, that the voxforge acoustic model is like a library to some software that allows to use this model, but can instead work with other acoustic models as well?
--- (Edited on 4/22/2007 6:21 pm [GMT-0500] by Visitor) ---