Universal Catalogue  
  You are here » Universal Catalogue » Spoken Resources » Desktop/microphone
Language Resources
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Anglais
Catalog Reference : ELRA-SD157
Colloquial Gulf Arabic - Speech Database
35 minutes of audio per speaker per channel, 280 utterances per speaker.
The database comprises 150 speakers from the United Arab Emirates and Saudi Arabia (75 from each). They are all native speakers of Colloquial Gulf Arabic (50% male / 50% female).

Technical details:
- Desktop recording
- Recorded at 16kHz, linear PCM
- 4 channels:
- 1 close head-set microphone
- 2 mid-distance microphones (different types)
- 1 mid-distance array microphone

Each speaker read 280 utterances (approximately 30 minutes) organised as follows:
- 30 Person names (first name and family name) from a set of 150
- 10 single Isolated digits 0-9
- 10 8-digit sequences (randomly generated)
- 200 Phonetically balanced sentences
- 30 10-word phonetically balanced word strings

The audio material was transcribed (fully vowelised) and tagged using conventions derived from the SpeechDAT model.
The phonemic representation is in SAMPA and annotations are in SAM file format.
Applications
Applications possible : Speech recognition#Automatic speech recognition
Contents Click on the arrow to display content.
 speech corpus 
 

Joint Copyright © 2008 ELRA & ELDA
Universal Catalogue 1.0.4