You are here
»
Universal Catalogue
»
Spoken Resources
»
Telephone
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : U-S0323
Telephone Speech Data Collection for Czech
This database contains speech collected in Czech Republic during summer 1999. This database comprises telephone recordings from 1227 speakers (590 males and 637 females) recorded directly over the fixed telephone network using an ISDN interface.
Speech files are stored as sequences of 8bit 8 kHz A-law uncompressed speech samples. Each prompted utterance is stored within a separate file. Each speech file has an accompanying ASCII SAM label file according to the specifications of the SpeechDat project.
Corpus contents:
- connected digits (prompt sheet number, telephone number, credit card number),
- sequences of isolated digits (5 digits),
- answers to yes/no questions,
- common application words and phrases.
The following age distribution has been obtained: 36 speakers are below 16 years old, 537 speakers are between 16 and 30, 306 speakers are between 31 and 45, 259 speakers are between 46 and 60, 88 speakers are over 60, and 1 speaker whose age is unknown.
The transcription included in this database is an orthographic, lexical transcription with a few details that represent audible acoustic events (speech and non speech) present in the corresponding waveform files. SpeechDat conventions were used in this database.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included.
ISLRN : 499-113-645-559-4
Contents
Click on the arrow to display content.
speech corpus
Language(s) :
Czech
TEXT_CLIPPING_RATE_PERCENTAGE8 kHz
Source Channel :
Telephone
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4