Speech Recognition in the News

Nested
GigaSpeech: An Evolving, Multi-domain ASR Corpus
User: kmaclean
Date: 8/23/2021 2:40 pm
Views: 1572
Rating: 0

GigaSpeech is:

An evolving, multi-domain English speech recognition corpus with 10,000 hours of high quality labeled audio suitable for supervised training, and 40,000 hours of total audio suitable for semi-supervised and unsupervised training. Around 40,000 hours of transcribed audio is first collected from audiobooks, podcasts and YouTube, covering both read and spontaneous speaking styles, and a variety of topics, such as arts, science, sports, etc

paper

Next