Universal Catalogue  
  You are here » Universal Catalogue
Language Resources
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Anglais
Catalog Reference : ELRA-ST37
Multilingual Corpus for Language Identification
This multilingual corpus was designed to enable the development and testing of algorithms for automatic language identification. It contains speech from 250 natives speakers of each language calling a data collection system from their home country via a toll-free number, as well as 50 native speakers of each language calling from within France (or from Germany, Spain, or the United Kingdom). Types of data : general questions concerning the call and the caller, series of items containing pre-defined texts to read and fixed prompts, set of questions aimed at obtaining spontaneous speech. It contains over 300 calls for each language. 70 hours of data.
Contents Click on the arrow to display content.
 speech corpus 
 

Joint Copyright © 2008 ELRA & ELDA
Universal Catalogue 1.0.4