Universal Catalogue  
  You are here » Universal Catalogue » Spoken Resources » Desktop/microphone
Language Resources
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Anglais
Catalog Reference : ELRA-SD43
Travel Conversation and Basic Expression Corpora
Broad-coverage bilingual corpus to study corpus-based speech translation technologies for the real world. Three important points to consider in designing and constructing a corpus for future speech translation research: to have a variety of speech samples, with a wide range of pronunciations and speakers; to have data for a variety of situations and to have a variety of expressions. TC (travel conversation) and BE (basic expressions) corpora are designed to be complementary. TC is a collection of transcriptions of bilingual spoken dialogues (between a foreign tourist and a front desk clerk at a hotel), while BE is a collection of Japanese sentences and their English translations. TC: 20,000 sentences; BE: 200,000 sentences.
Contents Click on the arrow to display content.
 speech corpus 
 

Joint Copyright © 2008 ELRA & ELDA
Universal Catalogue 1.0.4