|
Language Resources |
|
|
|
Search Catalogue |
|
|
|
Send us information |
|
|
|
Languages |
|
|
|
|
|
Displaying 101 to 120 (of 423 products) |
Result Pages: 6 |
This is a spontaneous-speech corpus. It contains information about flights, fares, airlines, cities, airports, and ground services. Simulate situations were recorded. 41 sessions were recorded, containing 1,041 utterances. Each session lasted about one hour and comprises 25.4 queries on average.
Language(s) : English
|
|
|
|
It contains audio meetings with a significant textual component. The meeting scenarios consist of oral discussions and written text documents reflecting the results of these discussions. It also comprises 4 types of metadata encoded in XML: segmentation elements to establish text and speech units, time stamps to keep track of actions on text documents, detailed action descriptions and keywords. The entire corpus contains 29 meetings which last in total more than 17 hours, 14,665 words, 5,015 text actions and 1,125 gesturing actions. A manual annotation is still in progress, which includes orthographic transcription of contents and tagging of dialogue acts.
Language(s) : English
|
|
|
|
It includes spontaneous spoken Italian materials and lists of read words, collected in Bari, Napoli and Pisa. It contains a speech sample produced by hearing-impaired and normal children.
Language(s) : Italian
|
|
|
|
This spoken Italian database contains an online edition of the 500,000 word LIP-Corpus.
Language(s) : Italian
|
|
|
|
The Portuguese Speecon database comprises the recordings of 553 adult Portuguese speakers and 52 child Portuguese speakers who uttered respectively over 290 items and 210 items (read and spontaneous).
Language(s) : Portuguese
|
|
|
|
This corpus is a sample representative of the urban "orléanaise" community, including approximately 200 interviews and more than 300 hours of sound recordings recordings.
Language(s) : French
|
|
|
|
This is a corpus of phonology of contemporary French, through the whole francophonie and according to precise geographical, social and linguistic criteria. It includes recordings, annotations, socio-linguistic information and codings of certain phonological phenomena.
Language(s) : French
|
|
|
|
The corpus includes speech and transcripts from the European Parliament, and is made of original (source) and simultaneously interpreted (target) speeches in three languages: English, Italian and Spanish. It contains 177,295 words. It has been orthographically transcribed, tagged and lemmatised.
Language(s) : English - Italian - Spanish
|
|
|
|
This database is intended for the development and the evaluation of noise robust pitch marking (PMA) and/or pitch determination (PDA) algorithms. The audio data used for the construction of the database was selected as a subset of the Speecon Spanish database (ref. SD165). The reference database comprises 60 minutes of pitch-marked speech signal.
Language(s) : Spanish, Castilian
|
|
|
|
100 adults speakers were recorded, reading about 100 sentences each. The entire corpus covers 18 languages and is uniform across languages.
It is distributed through the ELRA catalogue (http://catalog.elra.info) under the reference ELRA-S0218.
Language(s) : Tamil
|
|
|
|
The Swiss German Speecon database comprises the recordings of 600 German speakers from Switzerland.
Language(s) : German (Switzerland)
|
|
|
|
The US English Speecon database comprises the recordings of 600 American English speakers.
Language(s) : English (USA)
|
|
|
|
The Cantonese Speecon database comprises the recordings of 600 Cantonese speakers.
Language(s) : Cantonese (China) - Cantonese (Hong Kong)
|
|
|
|
The Thai Speecon database comprises the recordings of 600 Thai speakers.
Language(s) : Thai
|
|
|
|
The Spanish Speecon database comprises the recordings of 550 adult Spanish speakers and 50 child Spanish speakers recorded in the US and who uttered respectively over 290 items and 210 items (read and spontaneous).
Language(s) : Spanish (USA)
|
|
|
|
The Taiwan Mandarin Speecon database comprises the recordings of 550 adult Taiwanese speakers and 50 child Taiwanese speakers who uttered respectively over 290 items and 210 items (read and spontaneous).
Language(s) : Mandarin (Taiwan)
|
|
|
|
The Turkish Speecon database comprises the recordings of 550 adult Turkish speakers and 50 child Turkish speakers who uttered respectively over 290 items and 210 items (read and spontaneous).
Language(s) : Turkish
|
|
|
|
The Korean Speecon database comprises the recordings of 568 adult Korean speakers and 58 child Korean speakers who uttered respectively over 290 items and 210 items (read and spontaneous).
Language(s) : Korean
|
|
|
|
The Egyptian Arabic Speecon database comprises the recordings of 550 adult Egyptian speakers and 50 child Egyptian speakers who uttered respectively over 290 items and 210 items (read and spontaneous).
Language(s) : Arabic (Egypt)
|
|
|
|
100 adults speakers were recorded, reading about 100 sentences each. The entire corpus covers 18 languages and is uniform across languages.
It is distributed through the ELRA catalogue http://catalog.elra.info under the reference ELRA-S0206.
Language(s) : Turkish
|
|
|
|
Displaying 101 to 120 (of 423 products) |
Result Pages: 6 |
|
|