You are here
»
Universal Catalogue
»
Spoken Resources
»
Telephone
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : U-S0296
French Speechdat(II) FDB-1000
This French SpeechDat(II) FDB-1000 database contains the recordings of 1,017 French speakers recorded over the fixed telephone network. This speech database was sponsored by the European Commission (CEC DGXIII), under the project LE2-4001.
Speech samples are stored as sequence of 8-bit, 8kHz A-law and are uncompressed. Each prompt utterance is stored within a separate file (file extension FRA) and has an accompanying ASCII SAM label file (file extension FRO).
It contains 48 utterances (40 mandatory and 8 optional items) for 1,017 different speakers, 17 speakers have been added to the original 1,000 speakers to fit the requirements of the database. The main content of the database is speech and orthographic transcription files.
The database was validated by SPEX (the Netherlands) to assess its compliance with the SpeechDat format and content specifications.
It is designed for development and assessment of French speech recognizers.
Each speaker uttered the following items:
* 5 application words
* 1 sequence of 10 isolated digits
* 4 connected digits: 1 sheet number (5+ digits), 1 telephone number (9-11 digits), 1 credit card number (14-16 digits), 1 PIN code (6 digits)
* 3 dates: 1 spontaneous date (e.g. birthday), 1 prompted date (word style), 1 relative and general date exp.
* 2 word spotting phrases using an application word (embedded)
* 1 isolated digit
* 3 spelled words (letter sequences): 1 spontaneous, e.g. own forename, 1 spelling of direct. city name, 1 real/artificial for coverage
* 1 currency money amount
* 1 natural number
* 5 directory assistance names + 1 spelled name: 1 spontaneous, e.g. own forename, 1 city of birth / growing up (spont), 1 most frequent cities (set of 500), 1 most frequent company/agency (set of 500), 1 "forename surname", 1 spelled city of birth
* 2 questions, including "fuzzy" yes/no: 1 predominantly "yes" question, 1 predominantly "no" question
* 9 phonetically rich sentences
* 2 time phrases: 1 time of day (spontaneous), 1 time phrase (word style)
* 8 phonetically rich words
ISLRN : 404-131-596-481-9
Production
Project :
SpeechDat(II)
Creation date :
1998
Contents
Click on the arrow to display content.
speech corpus
Language(s) :
French
TEXT_CLIPPING_RATE_PERCENTAGE8 kHz
Source Channel :
Telephone
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4