You are here
»
Universal Catalogue
»
Spoken Resources
»
Desktop/microphone
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-S 0084
Formosa Speech Database
This multilingual corpus contains speech for the three most frequently used languages in Taiwan: Mandarin, Min-Nan (Taiwanese) and Hakka. The project plans to record more than 1,800 speakers and hundreds of hours.
- Acquisition: microphone or telephone.
- Speech recorded using a microphone: 16KHz/16bits PCM file.
- Speech recorded using a telephone: 8KHz/8bits m -law file.
Each audio file has a corresponding label file containing the phonetic transcription.
A first version of this corpus has already been released. It contains the speech of 600 speakers of Taiwanese and Mandarin, for a total of 49 hours of speech and 247,000 utterances.
It is produced for the development of multilingual speech systems.
Identification
Period of coverage :
Version :
v 1.0
Version history :
Applications
application Area :
Research
Contents
Click on the arrow to display content.
speech corpus
Language(s) :
Mandarin (Taiwan) ; Chinese (Taiwan)
Source Channel :
Microphone#Telephone
Speech Acquisition Mode : Acoustic
Transcription Entries : Phonetic
Friday 22 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4