Universal Catalogue  
  You are here » Universal Catalogue » Spoken Resources » Broadcast Resources
Language Resources
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Anglais
Broadcast Resources
Displaying 1 to 20 (of 45 products) Result Pages:  1  2  3  [Next >>] 

ELRA-MULT10
Slovenian broadcast news speech database (Available since 22/04/2008)


The database consists of audio, video and annotation transcripts of about 36 hours of television daily news program.
Language(s) : Slovenian

Click here for
more information


ELRA-MULT16
SINOD: Slovenian Non-native Speech Database 


It contains two TV interviews, with video, audio and transcribed data. Each interview lasted 51 minutes about a general topic, mainly the profession, past live and work of interviewee. In each session, a native Slovenian male journalist interviewed a female non-native speaker. The transcriptions consist of 12.5k words, where 2,516 are different.
Language(s) : Slovenian

Click here for
more information


ELRA-MULT20
Reveal-This Corpus 


This multimedia and multi-genre corpus contains videos of the plenary sessions and press conferences of the European Parliament in English and Greek, evening TV news programmes in the same languages which were aired the same day as the European Parliament events, and videos of travel documentaries and TV travel magazines in both languages. The data amounts to approximately 80 hours of politics in each language, with 1/8 of this data being press-conferences, 10 hours of TV news in English and 20 hours in Greek, 87 hours of travel videos in English and 53 hours in Greek (a small part of this data consists of same audio stream in both languages and another small part consists of audio stream in one language and subtitles in the other).
Language(s) : English - Greek

Click here for
more information


ELRA-SBR10
Lancaster/IBM Spoken English Corpus (SEC) 


It consists of approximately 52,000 words of contemporary spoken British English, mainly taken from radio broadcasts dating from the mid 1980s.
Language(s) : English (New Zealand)

Click here for
more information


ELRA-SBR2
Slovenian Speech Corpus 


This corpus consists of corpora on CD-ROMs with newspaper articles or material downloaded from the internet. These were used to select a set of 1200 representative sentences that would be used for concatenative speech synthesis.
Language(s) : Slovenian

Click here for
more information


ELRA-SBR3
Written and Spoken MSA Corpora 


Speech Corpora representing spoken and written varieties of MSA. They were selected from the Egyptian Media. These two corpora were used to formalise a tool for the comparison of syntactic structures.
Language(s) : Modern Colloquial Arabic

Click here for
more information


ELRA-SBR4
Doordarshan 


Real video broadcast by a leading Indian television network. 33 bulletins in Tamil and 20 bulletins in Telugu. 4h of speech as a whole.
Language(s) : Tamil - Telugu

Click here for
more information


ELRA-SBR5
Annotated Dialogues for Advanced Voice Interfaces 


A collection of 450 human-human and human-machine spontaneous dialogues. Each dialogue is represented by a collection of audio files, one for each turn, and five files corresponding to POS, syntactic, semantic and pragmatic annotation.
Language(s) : Italian

Click here for
more information


ELRA-SBR6
Radio interview archives and their transcripts 


This corpus consists of 10 hours of French radio interview archives with corresponding press-oriented transcripts.
Language(s) : French

Click here for
more information


ELRA-SBR7
Parallel Corpora of Programme Transcripts and Subtitles 


The Dutch material consists of news broadcasts of about 430,000 words and the English material contains documentaries and talk shows of about 400,000 words.
Language(s) : Dutch - English

Click here for
more information


ELRA-SBR8
Parallel Transcript/Subtitle Corpus 


It consists of Dutch and Flemish parallel broadcasting corpus aligned at sentence and chunk level.
Language(s) : Dutch - Flemish

Click here for
more information


ELRA-SBR9
Arabic Broadcast News 


This corpus consists of 66 hours of audio data obtained from the web.
Language(s) : Arabic

Click here for
more information


ELRA-SDiffr1
Cadena Ser 


Radio broadcasting in Spain.
Language(s) : Spanish

Click here for
more information


ELRA-SDiffr2
ABS Radio 


Radio broadcasting for Arabic speakers in Australia.
Language(s) : Arabic

Click here for
more information


ELRA-SDiffr3
Radio Casablanca 


News and music radio broadcasting in Casablanca.
Language(s) : Arabic (Morocco)

Click here for
more information


ELRA-SDiffr4
Radio Nacional 


Radio Broadcasting in Spain.
Language(s) : Spanish

Click here for
more information


ELRA-SDiffr5
RMC Moyen-Orient 


Radio broadcasting from Radio Montecarlo Moyen Orient.
Language(s) : Arabic

Click here for
more information


ELRA-SDiffr6
BBC 


General news radio broadcasting in 33 different languages.
Language(s) : Portuguese (Brazil) - Spanish - Albanian - Macedonian - Romanian - Russian - Serbian - Turkish - Ukrainian - Arabic - French - Hausa - Kinyarwanda - Portuguese (Portugal) - Somali - Swahili - Pushto - Persian - - Azeri - Kirghiz - Uzbek - Bengali - Hindi - Nepali - Sinhalese - Tamil - Urdu - Burmese - Chinese - Indonesian

Click here for
more information


ELRA-SDiffr7
Radio Tunis 


News and music broadcasting in Tunisia.
Language(s) : Arabic (Tunisia)

Click here for
more information


ELRA-SDifft11
UN Multimedia 


Television broadcasting archives. Also available for radio and newspaper.
Language(s) : Arabic - Chinese - English - French - Russian - Spanish

Click here for
more information


Displaying 1 to 20 (of 45 products) Result Pages:  1  2  3  [Next >>] 

Joint Copyright © 2008 ELRA & ELDA
Universal Catalogue 1.0.4