|
Language Resources |
|
|
|
Search Catalogue |
|
|
|
Send us information |
|
|
|
Languages |
|
|
|
|
|
Displaying 221 to 240 (of 423 products) |
Result Pages: 12 |
It contains more than 35 hours of recordings from 46 French speakers (24 female, 22 male) engaged in conversations with friends.
Language(s) : French
|
|
|
|
The Big ASC is planned to contain auditory-visual (AV) recordings from 1100 speakers, covering variants of Australian English over various areas and social background.
Language(s) : English (Australia) -
|
|
|
|
This is a multimodal corpus which contains audio and video recordings from 50 male and 50 female speaker under five different driving conditions.
Language(s) : English -
|
|
|
|
This corpus contains 25 high quality x-ray films. It consists of recordings of 14 English and French speakers from Canada. Each speaker read about 30 phonetically contrastive sentences.
Language(s) : English (Canada) - French (Canada) -
|
|
|
|
MC-WSJ-AV contains audio and video recordings of about 45 speakers reading sentences in several scenarios including a single stationary speaker, a single moving speaker and multiple concurrent speakers.
Language(s) : English -
|
|
|
|
This is an audio-visual database. It contains recordings from 21 French Canadian (11 male and 10 female speakers). Each speaker were recorded reading broadcast news during about 5 hours.
Language(s) : French (Canada) -
|
|
|
|
This corpus represents Italian spontaneous speech events collected from 1965 onwards to develop studies on the intonation of Italian. It is divided in two sub-corpora illustrating adult speech and early acquisition.
Language(s) : Italian (Italy)
|
|
|
|
This corpus represents Italian spontaneous speech events collected from 1965 onwards to develop studies on the intonation of Italian. It is divided in two sub-corpora illustrating adult speech and early acquisition.
Language(s) : Italian (Italy)
|
|
|
|
NECTE is a corpus of dialect speech from Tyneside in North-East England (from 1969 to 1994). The speech data come with orthographic and phonetic transcriptions, and with part-of-speech tagging.
The NECTE corpus was released in 2005.
Language(s) : English (United Kingdom)
|
|
|
|
CoSIH is a Israeli Hebrew spoken corpus that will amount to 5,000,000 words when completed (still under construction).
Language(s) : Hebrew (Israel)
|
|
|
|
This is a small parallel corpus of spoken texts taken from the EUROM-1 speech corpus. 40 short passages have been translated from English into Romanian, Slovene, Estonian, Hungarian, Czech and Bulgarian.
For four languages (Romanian, Slovene, Estonian and Hungarian) recordings of the texts are also provided (with links between texts and spoken passages).
Language(s) : English - Romanian - Slovene - Bulgarian - Czech - Estonian - Hungarian - Romanian (Romania) - Slovene (Slovenia) - Estonian (Estonia) - Hungarian (Hungary)
|
|
|
|
It gathers 100 hours of recorded speech available in wav format with orthographic transcriptions in txt and phonetic annotations.
Language(s) : Italian (Italy)
|
|
|
|
The OTG corpus contains recordings of real conversations between one or several tourists and a receptionist in a French tourist office. The collected data (two hours) have been transcribed.
Language(s) : French (France)
|
|
|
|
The Ecole de Massy corpus contains recordings of directed conversations in French between children of 7 years old and their teacher.
Language(s) : French (France)
|
|
|
|
This corpus contains speech records for a total duration of one hour. Four speakers (2 males, 2 females) were asked to read out a set of around 740 isolated words. This set covers all the most important features of standard Lithuanian speech.
This corpus is annotated; the phone-level and the word-level transcriptions data are aligned.
Language(s) : Lithuanian
|
|
|
|
The Bangla speech corpora contains approximately 70 hours of speech recordings.
Bangla, or Bengali, is an East Indian language (it is also spoken in Bangladesh).
Language(s) : Bengali
|
|
|
|
This resource is a large Assamese speech database.
Assamese is language spoken in East India.
Language(s) : Assamese
|
|
|
|
This resource is a large Manipuri speech database.
Manipuri is a language spoken in East India.
Language(s) :
|
|
|
|
The TCC300 is a microphone speech database for Mandarin; it contains recordings of 300 speakers (150 male and 150 female speakers).
Language(s) : Mandarin
|
|
|
|
This corpus contains task-oriented conversations between familiar persons. The total duration of the conversations is 5 hours (average of 10 mns per conversation).
With the help of Translist, 26 conversations (66,000 characters) have been completely transcribed and annotated.
Language(s) : Mandarin (Taiwan)
|
|
|
|
Displaying 221 to 240 (of 423 products) |
Result Pages: 12 |
|
|