|
Language Resources |
|
|
|
Search Catalogue |
|
|
|
Send us information |
|
|
|
Languages |
|
|
|
|
|
Displaying 241 to 260 (of 423 products) |
Result Pages: 13 |
This Chinese corpus contains data of different speaking styles, genres and situations. It has been transcribed and annotated.
Language(s) : Chinese
|
|
|
|
This corpus contains 400 hours of accented Mandarin Chinese speech recorded from 100 speakers representing three different Tibetan areas. It has been transcribed and phonetically annotated.
Language(s) : Mandarin Chinese
|
|
|
|
This is a 120 hour multilingual and multi-accented database containing speech for Min, Yue and Beijing regions (60 speakers). It has been transcribed and phonetic annotations have been added.
Language(s) : Chinese
|
|
|
|
This is a database of infant cries. 200 infants were recorded in three different hospitals (various ages, weights, diseases, ...).
Language(s) :
|
|
|
|
This database is composed of various types of recordings. 600 recording sheets containing 80 sentences each were prepared with English long sentences, English short sentences, English words and mixed Chinese-English sentences. They were read by English Department people and non-English Department people, male and female, and recorded using hand-held microphone OR wire/wireless telephone (PSTN/GSM).
Language(s) : English (Taiwan)
|
|
|
|
This is a corpus of Min dialectal Chinese (Xiamen-centered).
Language(s) : Chinese
|
|
|
|
This is a corpus of Chuan dialectal Chinese (Chendu-centered).
Language(s) : Chinese
|
|
|
|
This database contains emotional speech in Chinese. 16 college students were asked to read 55 paragraphs containing an emotionally unbiased sentence. Physiological data were also collected (electrocardiogram, respiration, electro-dermal data, finger pulse).
Language(s) : Chinese
|
|
|
|
This speech database contains the recording of one female speaker reading texts from the Hakka Written Texts collection.
Language(s) : Chinese
|
|
|
|
This is a speech database for Urdu, an Indian language. It contains the recordings of 87 speakers (different genre, age, context).
Language(s) : Urdu
|
|
|
|
This database for Hindi comprises recordings of syllables, frequent words, digits, phonetically rich sentences, prosody rich sentences, domain-specific texts and news texts. Professional speakers (male and female) were recorded in noise free and unechoic studio conditions.
Language(s) : Hindi
|
|
|
|
This database for Punjabi comprises recordings of syllables, frequent words, digits, phonetically rich sentences, prosody rich sentences, domain-specific texts and news texts. Professional speakers (male and female) were recorded in noise free and unechoic studio conditions.
Language(s) : Panjabi, Punjabi
|
|
|
|
This database for Marathi comprises recordings of syllables, frequent words, digits, phonetically rich sentences, prosody rich sentences, domain-specific texts and news texts. Professional speakers (male and female) were recorded in noise free and unechoic studio conditions.
Language(s) : Marathi
|
|
|
|
This is a speech database containing 84,000 read sentences. Each speaker was asked to read a set of 210 sentences, and nearly 500 speakers of three different age groups, genre (50% male/female) and four major Western Indonesian accents (Javanese, Sundanese, Batak, Standard Indonesian) were recorded by telephone or microphone.
Language(s) : Indonesian
|
|
|
|
400 native speakers of Korean were recorded to compile a corpus of remote speech, using multi-channel microphone array and HAT (Head and Torso simulator).
Language(s) : Korean
|
|
|
|
This is a database of spoken Singapore English. It contains recordings of 500 speakers in noisy environments. The distance of recording is 0,5 / 1 m.
Language(s) : English
|
|
|
|
This is a corpus containing recordings of children's speech. 100 children were asked to read prompted isolated words and sentences. They were recorded in their classroom.
Language(s) : English
|
|
|
|
These corpora were designed for the study of Mandarin Chinese continuous speech and its prosody:
- phonetically-balanced corpus (18h38)
- multiple-speaker corpus (19h29)
- intonation-balanced corpus (31h19)
- stress-pattern balanced corpus (48 mns)
- lexically-balanced corpus (35h50)
- focus-balanced prosody groups corpus (7h30)
- text-type / speaking-style varied corpus (1h32)
- prosody balanced monosyllable corpus (16h50)
- comparable spontaneous/read corpus (42 mns)
Language(s) : Mandarin Chinese
|
|
|
|
This multilingual corpus contains speech for the three most frequently used languages in Taiwan: Mandarin, Min-Nan (Taiwanese) and Hakka. The project plans to record more than 1,800 speakers and hundreds of hours.
Language(s) : Mandarin (Taiwan) - Chinese (Taiwan)
|
|
|
|
This meeting corpus has Singapore accent and bilingual speech.
Technical characteristics of the meeting room:
- 24 audio channels (sampling rate of 16kHz, 16 bits).
- 9 video channels (full D1 resolution, DV compression).
Language(s) :
|
|
|
|
Displaying 241 to 260 (of 423 products) |
Result Pages: 13 |
|
|