Universal Catalogue

You are here » Universal Catalogue » Spoken Resources » Desktop/microphone

Language Resources

Search Catalogue

Send us information

Would you like to collaborate ?
Contact Us

Languages

Desktop/microphone

Displaying 41 to 60 (of 423 products)

Result Pages: [<< Prev] 1 2 3 4 5 ... [Next >>]

ELRA-SD131

CMULM Chaplain Spoken Dialog Database

4.15 hours of English-English dialogs were recorded, with US Army Chaplains playing both sides. The data were then hand transcribed.
Language(s) : English

Click here for
more information

ELRA-SD132

CMU FAF (Facts and Fables) Database

It consists of 107 paragraphs (15,000 words) of single male speaker monologues. It contains texts from Aesop's Fables and the CIA World Fact Book.
Language(s) : English

Click here for
more information

ELRA-SD133

CSTR US KED Timit

It contains 453 phonetically balanced utterances spoken by a US male speaker.
Language(s) : English (USA)

Click here for
more information

ELRA-SD134

CMU TIME AWB

This limited domain database for reading the time includes 24 utterances autolabelled and built into a clunits synthesizer.
Language(s) : English

Click here for
more information

ELRA-SD135

CMU Weather AWB

This database allows weather reprorts for the whole US. It consists of 100 reports automatically constructed to cover, date, time, outlook, temperature and wind direction.
Language(s) : English

Click here for
more information

ELRA-SD136

CMU Communicator KAL Database

The database consists of 500 utterances and was built for an automated telephone based dialog system for booking flight information.
Language(s) : English

Click here for
more information

ELRA-SD137

CMU US KAL Diphone

This database consists of a set of nonsense words containing all phone-phone transitions for US English. It includes waveforms, laryngograph (EGG) files, hand corrected labels, extracted pitchmarks, and various support files.
Language(s) : English (USA)

Click here for
more information

ELRA-SD138

CSTR UK RAB Diphone

This databases consists of a set of nonsense words containing all phone-phone transitions for UK English. It includes, waveforms, laryngograph files, hand corrected labels, extracted pitchmarks, and various support files.
Language(s) : English (United Kingdom)

Click here for
more information

ELRA-SD139

MBROLA binary and voices

It consists of short synthesis demos in 34 languages and two songs : the Mbrola Christmas song (fully synthestic choral music) and Melissa (an Mbrola-based virtual singer).
Language(s) : Afrikaans - English (USA) - Arabic - Portuguese (Brazil) - Breton - English (United Kingdom) - French (Canada) - Croatian - Czech - Dutch - Estonian - French - German - Greek - Korean - Hebrew - Hindi - Hungarian - Icelandic - Indonesian - Italian - Japanese - Latin - Lithuanian - Malay - Polish - Portuguese - Romanian - Spanish - Spanish (Mexico) - Swedish - Telugu - Turkish - Spanish (Venezuela)

Click here for
more information

ELRA-SD14

SPEECON Japanese

(Available since 24/10/2007)

The Japanese Speecon database comprises the recordings of 607 Japanese speakers:
1) 556 adult Japanese speakers (268 males, 288 females), recorded over 4 microphone channels in 4 recording environments (office, entertainment, car, public place).
2) 51 child Japanese speakers (25 boys, 26 girls), recorded over 4 microphone channels in 1 recording environment (children room).
Language(s) : Japanese

Click here for
more information

ELRA-SD140

TIDIGITS

It consists of more than 25 thousand digit sequences spoken by over 326 speakers, men, women, and children. In order to obtain a dialectically balanced database, the continental U.S. was divided into 21 dialectical regions.
Language(s) : English (USA)

Click here for
more information

ELRA-SD141

Boston University Speech Corpus

The corpus consists of professionally read radio news data from seven (4 male, 3 female) announcers, including speech and accompanying annotations, suitable for speech and language research.
Language(s) : English

Click here for
more information

ELRA-SD142

TIMIT Corpus

It contains 6300 sentences, 10 sentences spoken by each of 630 speakers from 8 major dialect regions of the United States.
Language(s) : English (USA)

Click here for
more information

ELRA-SD143

Charlotte Narratives

It contains 95 narratives, conversations and interviews representative of the residents of Mecklenburg County, North Carolina and surrounding North Carolina communities.
Language(s) : English (USA)

Click here for
more information

ELRA-SD144

Kids' Speech Corpus

Approximately 100 children at each grade level (from 1 to 10) have been recorded. The database contains words, phrases, and fluent speech in a manner that could be repeated for all of the children, regardless of age. All children read approximately 60 items.
Language(s) : English

Click here for
more information

ELRA-SD145

Jeida Common Speech Data Corpus

The Japan Electronic Industry Development Association's Common Speech Data Corpus is an isolated phrase corpus consisting of 150 native speakers of Japanese (75 males and 75 females) and almost 200,000 utterances, including city names, control words, monosyllabic words, isolated digits and strings of four digits.
Language(s) : Japanese

Click here for
more information

ELRA-SD146

SPINE Audio Corpus

It consists of 120 files, one conversation each, for a rough total of 9 hours and 22 minutes of audio data.
Language(s) : English

Click here for
more information

ELRA-SD147

SPINE2 Audio Corpus

The Speech in Noisy Environments 2 Evaluation Audio Corpus was used as part of the training set for the Second Speech in Noisy Environments Evaluation. The data comprises 2-speaker pairs (4 speakers as a whole) with 32 conversations per speaker pair (64 conversations total).
Language(s) : English

Click here for
more information

ELRA-SD148

Wall Street Journal Corpus

The WSJ database was generated from a machine-readable corpus of Wall Street Journal news text. Some spontaneous dictation is included in addition to the read speech.
Language(s) : English

Click here for
more information

ELRA-SD149

Isolet

Isolet is a corpus of letters of the English alphabet, spoken in isolation. The database consists of 7800 spoken letters, 2 productions of each letter by 150 speakers. It contains approximately 1.25 hours of speech.
Language(s) : English

Click here for
more information

Displaying 41 to 60 (of 423 products)

Result Pages: [<< Prev] 1 2 3 4 5 ... [Next >>]