|
Language Resources |
|
|
|
Search Catalogue |
|
|
|
Send us information |
|
|
|
Languages |
|
|
|
|
|
Displaying 1 to 20 (of 45 products) |
Result Pages: 1 |
The database consists of audio, video and annotation transcripts of about 36 hours of television daily news program.
Language(s) : Slovenian
|
|
|
|
It contains two TV interviews, with video, audio and transcribed data. Each interview lasted 51 minutes about a general topic, mainly the profession, past live and work of interviewee. In each session, a native Slovenian male journalist interviewed a female non-native speaker. The transcriptions consist of 12.5k words, where 2,516 are different.
Language(s) : Slovenian
|
|
|
|
This multimedia and multi-genre corpus contains videos of the plenary sessions and press conferences of the European Parliament in English and Greek, evening TV news programmes in the same languages which were aired the same day as the European Parliament events, and videos of travel documentaries and TV travel magazines in both languages. The data amounts to approximately 80 hours of politics in each language, with 1/8 of this data being press-conferences, 10 hours of TV news in English and 20 hours in Greek, 87 hours of travel videos in English and 53 hours in Greek (a small part of this data consists of same audio stream in both languages and another small part consists of audio stream in one language and subtitles in the other).
Language(s) : English - Greek
|
|
|
|
It consists of approximately 52,000 words of contemporary spoken British English, mainly taken from radio broadcasts dating from the mid 1980s.
Language(s) : English (New Zealand)
|
|
|
|
This corpus consists of corpora on CD-ROMs with newspaper articles or material downloaded from the internet. These were used to select a set of 1200 representative sentences that would be used for concatenative speech synthesis.
Language(s) : Slovenian
|
|
|
|
Speech Corpora representing spoken and written varieties of MSA. They were selected from the Egyptian Media. These two corpora were used to formalise a tool for the comparison of syntactic structures.
Language(s) : Modern Colloquial Arabic
|
|
|
|
Real video broadcast by a leading Indian television network. 33 bulletins in Tamil and 20 bulletins in Telugu. 4h of speech as a whole.
Language(s) : Tamil - Telugu
|
|
|
|
A collection of 450 human-human and human-machine spontaneous dialogues. Each dialogue is represented by a collection of audio files, one for each turn, and five files corresponding to POS, syntactic, semantic and pragmatic annotation.
Language(s) : Italian
|
|
|
|
This corpus consists of 10 hours of French radio interview archives with corresponding press-oriented transcripts.
Language(s) : French
|
|
|
|
The Dutch material consists of news broadcasts of about 430,000 words and the English material contains documentaries and talk shows of about 400,000 words.
Language(s) : Dutch - English
|
|
|
|
It consists of Dutch and Flemish parallel broadcasting corpus aligned at sentence and chunk level.
Language(s) : Dutch - Flemish
|
|
|
|
This corpus consists of 66 hours of audio data obtained from the web.
Language(s) : Arabic
|
|
|
|
Radio broadcasting in Spain.
Language(s) : Spanish
|
|
|
|
Radio broadcasting for Arabic speakers in Australia.
Language(s) : Arabic
|
|
|
|
News and music radio broadcasting in Casablanca.
Language(s) : Arabic (Morocco)
|
|
|
|
Radio Broadcasting in Spain.
Language(s) : Spanish
|
|
|
|
Radio broadcasting from Radio Montecarlo Moyen Orient.
Language(s) : Arabic
|
|
|
|
General news radio broadcasting in 33 different languages.
Language(s) : Portuguese (Brazil) - Spanish - Albanian - Macedonian - Romanian - Russian - Serbian - Turkish - Ukrainian - Arabic - French - Hausa - Kinyarwanda - Portuguese (Portugal) - Somali - Swahili - Pushto - Persian - - Azeri - Kirghiz - Uzbek - Bengali - Hindi - Nepali - Sinhalese - Tamil - Urdu - Burmese - Chinese - Indonesian
|
|
|
|
News and music broadcasting in Tunisia.
Language(s) : Arabic (Tunisia)
|
|
|
|
Television broadcasting archives. Also available for radio and newspaper.
Language(s) : Arabic - Chinese - English - French - Russian - Spanish
|
|
|
|
Displaying 1 to 20 (of 45 products) |
Result Pages: 1 |
|
|