You are here
»
Universal Catalogue
»
Spoken Resources
»
Broadcast Resources
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-S 0007
Spoken Arabic Corpora
This spoken Arabic corpus is the result of a long experience in the field of corpora (interviews in Moroccan Arabic in the 70's, radio broadcasts records of Modern Arabic in the 80's).
It has the following characteristics:
- comparable material of the year 1990.
- newscasts of the radio broadcast.
- data from three countries in which language use seems to differ (Saudi Arabia, Egypt and Algeria).
All the material was transcribed and tagged. The corpus gathers approximately 320,000 words (80,000 per country, plus a control corpus).
Applications
application Area :
Education#Research
Contents
Click on the arrow to display content.
speech corpus
Language(s) :
Modern Standard Arabic (Egypt) ; Modern Standard Arabic
Source Channel :
Radio
Speech Acquisition Interface : Radio
Speech Acquisition Mode : Acoustic
Transcription Entries : Orthographic
Annotation Coverage : Full
Annotation Granularity : Word
Annotation level : Morphological
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4