You are here
»
Universal Catalogue
»
Spoken Resources
»
Telephone
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-ST49
22 Language Corpus
The corpus contains fixed vocabulary utterances (e.g. days of the week) as well as fluent continuous speech. There is at least 300 callers in each language. Each utterance is verified by a native speaker to determine if the caller followed instructions when answering the prompts. Some of the calls in each language are transcribed orthographically.
All of the data in this corpus were collected over digital telephone lines. The digital data were recorded with the CSLU T1 digital data collection system. These files were sampled at 8 khz 8-bit and stored as ulaw files.
All of the wave files were converted to riff format with 16-bit linear coding.
Contents
Click on the arrow to display content.
speech corpus
Language(s) :
Eastern Arabic ; Cantonese ; Czech ; Farsi ; French ; German ; Hindi ; Hungarian ; Japanese ; Korean ; Malay ; Mandarin ; Italian ; Polish ; Portuguese ; Russian ; Spanish ; Swedish ; Swahili ; Tamil ; Vietnamese ; English
Source Channel :
telephone
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4