The Alphadigit Corpus is a collection of 78,044 examples from 3,025 speakers saying 6 digit strings of letters and digits. Each file has an orthographic transcription and time-aligned transcription.
The CALLHOME Mandarin Chinese corpus of telephone speech consists of 120 unscripted telephone conversations between native speakers of Mandarin Chinese. All calls, which lasted up to 30 minutes, originated in North America and were placed to locations overseas.
It contains fixed vocabulary utterances as well as fluent continuous speech. There are at least 300 callers in each language. Some of the calls in each language are transcribed orthographically.
Language(s) : Eastern Arabic - Cantonese - Czech - Farsi - French - German - Hindi - Hungarian - Japanese - Korean - Malay - Mandarin - Italian - Polish - Portuguese - Russian - Spanish - Swedish - Swahili - Tamil - Vietnamese - English
This speech database contains the recordings of 598 Tunisian speakers of Modern Standard Arabic recorded over the Tunisian fixed and mobile telephone network.
It contains about 69.5 hours of speech. 998 calls were collected on an analog system and 2010 calls were collected on a digital system. Each caller repeated a list of phrases as they were prompted. The phrases were command and control type phrases.
It consists of American English utterances by non-native speakers. The corpus contains 4925 telephone quality utterances from native speakers of 23 languages.
It consists of utterances gathered from 346 callers who were using cellular telephones, and their text transcriptions. Each caller listened and responded to a series of pre-recorded prompts.
The corpus contains fixed vocabulary utterances as well as fluent continuous speech from 11 languages. The current release includes recorded utterances from about 2052 speakers, for a total of about 38.5 hours of speech.
Language(s) : English - Farsi - French - German - Hindi - Japanese - Korean - Mandarin - Spanish - Tamil - Vietnamese
The Names Corpus is a collection of 24,245 first and last name utterances from 20184 speakers. All of the utterances have been phonetically transcribed.
It consists of cellular telephone speech from 2336 callers from locations throughout the United States. The data collection protocol contains requests for fixed vocabulary and continuous speech utterances. A total of about one minute of speech from each caller is collected.
The Numbers Corpus is a collection of naturally produced numbers, including isolated digit strings, continuous digit strings, and ordinal/cardinal numbers. A total of 23902 files are included in this corpus.
The Speaker Recognition corpus (formerly known as Speaker Verification), consists of telephone speech from 91 participants. Each participant has recorded speech in twelve sessions over a two-year period. Most of the utterances in the release of the corpus have corresponding non-time-aligned word level transcriptions.
It consists of spelled and spoken words from over 4000 callers. 1000 callers also recited the English alphabet with pauses between the letters. In addition, a subset of the calls has been phonetically labeled.
The Stories Corpus is made up of extemporaneous speech collected from English speakers who were asked to speak on a topic of their choice for one minute.
This speech database contains the recordings of 750 Egyptian speakers of Modern Colloquial Arabic recorded over the Egyptian fixed and mobile telephone network.
It consists of 120 telephone conversations between native speakers of Egyptian Colloquial Arabic (ECA), the spoken variety of Arabic found in Egypt. The dialect of ECA that this dictionary represents is Cairene Arabic.