Universal Catalogue  
  You are here » Universal Catalogue » Spoken Resources » Desktop/microphone
Language Resources
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Anglais
Catalog Reference : ELRA-U-S 0084
Formosa Speech Database
This multilingual corpus contains speech for the three most frequently used languages in Taiwan: Mandarin, Min-Nan (Taiwanese) and Hakka. The project plans to record more than 1,800 speakers and hundreds of hours.

- Acquisition: microphone or telephone.
- Speech recorded using a microphone: 16KHz/16bits PCM file.
- Speech recorded using a telephone: 8KHz/8bits m -law file.
Each audio file has a corresponding label file containing the phonetic transcription.

A first version of this corpus has already been released. It contains the speech of 600 speakers of Taiwanese and Mandarin, for a total of 49 hours of speech and 247,000 utterances.

It is produced for the development of multilingual speech systems.
Identification
Period of coverage :
Version : v 1.0
Version history :
Applications
application Area : Research
Contents Click on the arrow to display content.
 speech corpus 
 

Joint Copyright © 2008 ELRA & ELDA
Universal Catalogue 1.0.4