|
Language Resources |
|
|
|
Search Catalogue |
|
|
|
Send us information |
|
|
|
Languages |
|
|
|
|
|
Displaying 301 to 320 (of 423 products) |
Result Pages: 16 |
This is a non-native Arabic corpus containing sentences uttered by English speakers.
Language(s) : Arabic
|
|
|
|
This is a non-native Japanese database containing 35,000 utterances. The 140 speakers come from 10 different countries.
Language(s) : Japanese
|
|
|
|
This multilingual collection contains spontaneous speech data that were collected and transcribed for the training and testing of Verbmobil systems. The different languages concerned are German, English and Japanese.
3,200 dialogs were collected from 1,658 speakers.
Language(s) : German - Japanese - English
|
|
|
|
VODIS database is a French and German natural speech corpus recorded in a moving car.
Language(s) : French - German
|
|
|
|
CORP-ORAL is a spontaneous speech corpus of European Portuguese (60 h).
Recordings come with orthographic transcription, phonetic transcription and prosodic marking.
Language(s) : Portuguese (Portugal)
|
|
|
|
The GTRP project aimed at recording the pronunciations of words in varieties of Dutch. It has been phonetically transcribed in accordance with the "International Phonetic Alphabet".
This database contains more than 1,1 million transcribed elements.
Language(s) : Dutch
|
|
|
|
The IFA corpus is a database of hand-segmented Dutch speech. 8 speakers (4 male, 4 female) were recorded for a total of 5,5 hours. It contains informal as well as read speech.
All material is segmented and labeled at the phoneme level.
Language(s) : Dutch
|
|
|
|
The El Paso English survey is a three-hour collection of interviews (20 mns per interview). It was designed to analyze lexical and phonetic features of European-American El Pasoans.
Language(s) : English
|
|
|
|
The Dutch Schiphol (Amsterdam Airport) Information Office corpus contains two-agent human-human dialogues where an assistant has to answer the request of a customer on the airport activities and facilities.
Language(s) : Dutch
|
|
|
|
The OVIS corpus is a Dutch corpus containing sentences on travel information (train schedules in the Netherlands). It consists of 10,000 syntactically and semantically annotated sentences (3,738 pairs of system questions and user answers, 441 full dialogues, 80 different speakers).
Language(s) : Dutch
|
|
|
|
The DIAMOND corpus contains two-agent human-human dialogues in Dutch, where instructors explain to users how to configure and operate a fax-machine.
Language(s) : Dutch
|
|
|
|
The HCRC Map Task Corpus consists of 128 dialogues that have been recorded (video and audio), transcribed, and annotated.
Language(s) : English
|
|
|
|
This Dutch Map Task corpus contains the recordings of 16 conversations (4 pairs of speakers). The sound files are in wav format and the orthograhic transcriptions in rtf format.
Language(s) : Dutch
|
|
|
|
The Japanese map task corpus is a collection of 23 hour digital recordings, digitized maps and orthographic transcriptions. It contains a total of 128 dialogues by 64 native Japanese speakers.
Language(s) : Japanese
|
|
|
|
The Swedish map task corpus is a small database of high quality. It contains 50 minutes of word-labelled conversations, for a total of 8,000 words. 4 speakers of Central Standard Swedish were recorded performing map tasks in an anechoic room.
Language(s) : Swedish
|
|
|
|
The AEMT database is a collection of 16 dialogues recorded using the Map Task protocol of the HCRC Map Task.
Language(s) : English (USA)
|
|
|
|
The IViE corpus contains speech data and intonation transcriptions from nine urban dialects of British English in five speaking styles. The complete set contains 36 hours and a prosodically labelled sub-corpus of 4 hours has also been produced.
Language(s) : English (United Kingdom)
|
|
|
|
DCIEM Map Task Corpus contains 216 spoken dialogues and their orthographic transcription. They were collected during a sleep deprivation experiment.
Language(s) : English
|
|
|
|
The SITAL ADAM corpus contains transcribed travel agent-client dialogues (450), which are human-human and human-machine interactions. Each dialogue is annotated at five levels of linguistic information: prosody, morphosyntax, syntax, semantics and pragmatics.
Language(s) : Italian
|
|
|
|
The Greenflag corpus is an American English corpus of military air traffic control speech.
Language(s) : English (USA)
|
|
|
|
Displaying 301 to 320 (of 423 products) |
Result Pages: 16 |
|
|