|
Language Resources |
|
|
|
Search Catalogue |
|
|
|
Send us information |
|
|
|
Languages |
|
|
|
|
|
Displaying 41 to 60 (of 423 products) |
Result Pages: 3 |
4.15 hours of English-English dialogs were recorded, with US Army Chaplains playing both sides. The data were then hand transcribed.
Language(s) : English
|
|
|
|
It consists of 107 paragraphs (15,000 words) of single male speaker monologues. It contains texts from Aesop's Fables and the CIA World Fact Book.
Language(s) : English
|
|
|
|
It contains 453 phonetically balanced utterances spoken by a US male speaker.
Language(s) : English (USA)
|
|
|
|
This limited domain database for reading the time includes 24 utterances autolabelled and built into a clunits synthesizer.
Language(s) : English
|
|
|
|
This database allows weather reprorts for the whole US. It consists of 100 reports automatically constructed to cover, date, time, outlook, temperature and wind direction.
Language(s) : English
|
|
|
|
The database consists of 500 utterances and was built for an automated telephone based dialog system for booking flight information.
Language(s) : English
|
|
|
|
This database consists of a set of nonsense words containing all phone-phone transitions for US English. It includes waveforms, laryngograph (EGG) files, hand corrected labels, extracted pitchmarks, and various support files.
Language(s) : English (USA)
|
|
|
|
This databases consists of a set of nonsense words containing all phone-phone transitions for UK English. It includes, waveforms, laryngograph files, hand corrected labels, extracted pitchmarks, and various support files.
Language(s) : English (United Kingdom)
|
|
|
|
It consists of short synthesis demos in 34 languages and two songs : the Mbrola Christmas song (fully synthestic choral music) and Melissa (an Mbrola-based virtual singer).
Language(s) : Afrikaans - English (USA) - Arabic - Portuguese (Brazil) - Breton - English (United Kingdom) - French (Canada) - Croatian - Czech - Dutch - Estonian - French - German - Greek - Korean - Hebrew - Hindi - Hungarian - Icelandic - Indonesian - Italian - Japanese - Latin - Lithuanian - Malay - Polish - Portuguese - Romanian - Spanish - Spanish (Mexico) - Swedish - Telugu - Turkish - Spanish (Venezuela)
|
|
|
|
The Japanese Speecon database comprises the recordings of 607 Japanese speakers:
1) 556 adult Japanese speakers (268 males, 288 females), recorded over 4 microphone channels in 4 recording environments (office, entertainment, car, public place).
2) 51 child Japanese speakers (25 boys, 26 girls), recorded over 4 microphone channels in 1 recording environment (children room).
Language(s) : Japanese
|
|
|
|
It consists of more than 25 thousand digit sequences spoken by over 326 speakers, men, women, and children. In order to obtain a dialectically balanced database, the continental U.S. was divided into 21 dialectical regions.
Language(s) : English (USA)
|
|
|
|
The corpus consists of professionally read radio news data from seven (4 male, 3 female) announcers, including speech and accompanying annotations, suitable for speech and language research.
Language(s) : English
|
|
|
|
It contains 6300 sentences, 10 sentences spoken by each of 630 speakers from 8 major dialect regions of the United States.
Language(s) : English (USA)
|
|
|
|
It contains 95 narratives, conversations and interviews representative of the residents of Mecklenburg County, North Carolina and surrounding North Carolina communities.
Language(s) : English (USA)
|
|
|
|
Approximately 100 children at each grade level (from 1 to 10) have been recorded. The database contains words, phrases, and fluent speech in a manner that could be repeated for all of the children, regardless of age. All children read approximately 60 items.
Language(s) : English
|
|
|
|
The Japan Electronic Industry Development Association's Common Speech Data Corpus is an isolated phrase corpus consisting of 150 native speakers of Japanese (75 males and 75 females) and almost 200,000 utterances, including city names, control words, monosyllabic words, isolated digits and strings of four digits.
Language(s) : Japanese
|
|
|
|
It consists of 120 files, one conversation each, for a rough total of 9 hours and 22 minutes of audio data.
Language(s) : English
|
|
|
|
The Speech in Noisy Environments 2 Evaluation Audio Corpus was used as part of the training set for the Second Speech in Noisy Environments Evaluation. The data comprises 2-speaker pairs (4 speakers as a whole) with 32 conversations per speaker pair (64 conversations total).
Language(s) : English
|
|
|
|
The WSJ database was generated from a machine-readable corpus of Wall Street Journal news text. Some spontaneous dictation is included in addition to the read speech.
Language(s) : English
|
|
|
|
Isolet is a corpus of letters of the English alphabet, spoken in isolation. The database consists of 7800 spoken letters, 2 productions of each letter by 150 speakers. It contains approximately 1.25 hours of speech.
Language(s) : English
|
|
|
|
Displaying 41 to 60 (of 423 products) |
Result Pages: 3 |
|
|