Universal Catalogue  
  You are here » Universal Catalogue » Spoken Resources » Desktop/microphone
Language Resources
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Anglais
Catalog Reference : ELRA-U-S 0049
Min-dialectal Chinese Corpus
This is a corpus of Min dialectal Chinese (Xiamen-centered).

Sampling rate: 22,050 Hz
Three channels: two standard microphones and one USB microphone (Sony C-38B, Sennhiser e835s, Logitech LPAC-5000).
Number of speakers: 36 (18 male, 18 female).
Age: 20-30.
Data: 200 long sentences, 10 digits, 26 English letters.
Transcription: character, syllable.

The processing of Chinese dialects is a global issue in the processing of the Chinese language in general since China can be divided in 8 major dialectal regions (in addition to Mandarin) and each of these can be divided in many sub-categories.

Data have also been collected for Wu and Chuan dialects. It is planned to collect data for more dialects (Xiang: Changsha-centered, Yue: Guangzhou-centered, Jin: Taiyuan-centered, etc.).
Applications
Applications possible : Speech recognition#Speech synthesis
application Area : Research
Contents Click on the arrow to display content.
 speech corpus 
 

Joint Copyright © 2008 ELRA & ELDA
Universal Catalogue 1.0.4