You are here
»
Universal Catalogue
»
Spoken Resources
»
Desktop/microphone
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-SD43
Travel Conversation and Basic Expression Corpora
Broad-coverage bilingual corpus to study corpus-based speech translation technologies for the real world. Three important points to consider in designing and constructing a corpus for future speech translation research: to have a variety of speech samples, with a wide range of pronunciations and speakers; to have data for a variety of situations and to have a variety of expressions. TC (travel conversation) and BE (basic expressions) corpora are designed to be complementary. TC is a collection of transcriptions of bilingual spoken dialogues (between a foreign tourist and a front desk clerk at a hotel), while BE is a collection of Japanese sentences and their English translations. TC: 20,000 sentences; BE: 200,000 sentences.
Contents
Click on the arrow to display content.
speech corpus
Language(s) :
Japanese ; English
Source Channel :
microphone
Friday 22 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4