You are here
»
Universal Catalogue
»
Spoken Resources
»
Speech Related
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : U-S0301
Luxembourgish-German SpeechDat(II) FDB-500
The Luxembourgish-German SpeechDat(II) FDB-500 database contains the recordings of 560 Luxembourgish-German speakers (247 Males, 313 Females) recorded over the Luxembourgish fixed telephone network.
Speech samples are stored as sequences of 8-bit 8 kHz A-law. Each prompted utterance is stored in a separate file. Each signal file is accompanied by an ASCII SAM label file, which contains the relevant descriptive information.
This speech database was validated by SPEX (the Netherlands) to assess its compliance with the SpeechDat format and content specifications.
Each speaker uttered the following items:
- 7 application words
- 4 isolated digits
- 1 sequence of 10 isolated digits
- 5 connected digits (1 area code, 1 spontaneous phone number, 1 credit card number –15/16 digits, etc.
- 3 dates (1 spontaneous date e.g. birthday, 1 prompted date, 1 general and relative date expression)
- 1 embedded application word
- 4 spelled words
- 1 currency money amount
- 1 natural number
- 6 directory assistance names (1 forename, 1 city of birth, 1 most frequent city, 1 city name, 1 company name, 1 "forename surname")
- 2 yes/no questions (1 predominantly "yes" question, 1 predominantly "no" question)
- 10 phonetically rich sentences
- 2 time phrases (1 spontaneous time of day, 1 time phrase)
- 6 phonetically rich words
The following age distribution has been obtained: 5 speakers are under 16, 113 speakers are between 16 and 30, 174 speakers are between 31 and 45, 184 speakers are between 46 and 60 and 84 are over 60.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included.
ISLRN : 423-799-403-506-4
Production
Project :
SpeechDat(II) LE2-4001
Applications
Applications existing :
Speech recognition
Contents
Click on the arrow to display content.
speech corpus
#110936
Language(s) :
German (Luxembourg)
Duration : Ca. 42h, for 30021 .LGA files
Signal Encoding : A-law
TEXT_CLIPPING_RATE_PERCENTAGE8 kHz
Source Channel :
Telephone
speech lexicon
#210936
Phoneme setSAMPA
Lexicon creation mode
Lexicon type
Lexicon entries
Sunday 24 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4