 |
Language Resources |
 |
|
 |
Search Catalogue |
 |
|
 |
Send us information |
 |
|
 |
Languages |
 |
|
|
Multimodal/Multimedia Resources |
|
|
 |
Displaying 61 to 80 (of 80 products) |
Result Pages: 4 |
This is a multimodal corpus which contains audio and video recordings from 50 male and 50 female speaker under five different driving conditions.
Language(s) : English -
|
|
|
|
This corpus contains 25 high quality x-ray films. It consists of recordings of 14 English and French speakers from Canada. Each speaker read about 30 phonetically contrastive sentences.
Language(s) : English (Canada) - French (Canada) -
|
|
|
|
MC-WSJ-AV contains audio and video recordings of about 45 speakers reading sentences in several scenarios including a single stationary speaker, a single moving speaker and multiple concurrent speakers.
Language(s) : English -
|
|
|
|
This is an audio-visual database. It contains recordings from 21 French Canadian (11 male and 10 female speakers). Each speaker were recorded reading broadcast news during about 5 hours.
Language(s) : French (Canada) -
|
|
|
|
The Finnish Broadcast Corpus contains speech recordings from the Finnish Broadcasting Company. The material is divided into four categories: radio monologues, radio dialogues, TV monologues and TV dialogues.
In addition to these primary data, the corpus contains annotations giving information on units in speech (fones, words and utterances, which are aligned with the speech and video signals).
Language(s) : Finnish (Finland) -
|
|
|
|
This audio/video corpus is based on the first Norwegian series of the reality show Big Brother, which was broadcasted in 2001. It gathers 150,000 words from 10 young adults staying in the BigBrother house.
Language(s) : Norwegian (Norway) -
|
|
|
|
This corpus is composed of orthographically transcribed speech with linked audio and video files. Speech was collected in Oslo in 2005 and gathers 1,032,198 words from 180 speakers.
Language(s) : Norwegian (Norway) -
|
|
|
|
This Swedish database contains 182 hours of speech, for a total of 1,416,248 words. Speech has been transcribed and automatically POS tagged.
Language(s) : Swedish -
|
|
|
|
These corpora of Chinese include: the Tone Acquisition and Voice Corpus of Hearing-Impaired Children, the Speech and Vocal Tract Video of Standard Chinese.
Language(s) : Chinese -
|
|
|
|
This meeting corpus has Singapore accent and bilingual speech.
Technical characteristics of the meeting room:
- 24 audio channels (sampling rate of 16kHz, 16 bits).
- 9 video channels (full D1 resolution, DV compression).
Language(s) :
|
|
|
|
This Japanese speech corpus contains camera and microphone recordings of multi-person business and social dialogues that were collected over 3 years.
The corpus is transcribed and annotated for discourse moves.
Language(s) : Japanese -
|
|
|
|
The RWCP-SP01 contains speech data of a played meeting between more than three participants speaking Japanese. The meeting focuses on the schedule of the participants. MPEG format video recording of the meeting was also taken from three directions.
Language(s) : Japanese -
|
|
|
|
The project aims to collect 1,000 hours of recordings of Mandarin Chinese spoken in China. 650 hours of audio and 150 hours of video recordings have already been collected.
The corpus is transcribed and annotated, with segmented audio/video chunks linked to the corresponding transcripts.
Language(s) : Chinese -
|
|
|
|
The BASE corpus consists of 160 lectures and 39 seminars recorded in different university departments, for a total of 1,644,942 tokens. It also comprises video data.
Recordings were transcribed and tagged according to the TEI guidelines.
Language(s) : English (United Kingdom) -
|
|
|
|
The ELISA corpus contains audio-video recordings of interviews with English native speakers talking about their professional career (in tourism, politics, the media or environmental education).
The corpus currently contains 28 interviews, for a total of about 60,000 words.
Language(s) : English -
|
|
|
|
The Nepali Spoken Corpus is a part of the Nepali National Corpus (NNC).
This is a spoken corpus in Nepali, designed on the basis of the Goteborg Spoken Language Corpus. It contains audio-video recordings.
Language(s) : Nepali -
|
|
|
|
This is a corpus of Norwegian spoken by young people from 13 to 23 years old. It is intended to be representative of the multi-ethnic youth in urban environments and contains video-recorded interviews and conversations collected at school, in families or among friends groups in Oslo.
Language(s) : Norwegian (Norway) -
|
|
|
|
This corpus contains spoken dialogues between children (8-15 years old) and virtual fairy-tale characters in a computer game scenario. It also contains dialogues between children and adults collected in a post-session interview.
Language(s) : Swedish -
|
|
|
|
This is a collection of written, spoken and multimodal corpora, which represents about 300 million tokens.
Language(s) : Russian - Russian >>>> English - Russian >>>> German -
|
|
|
|
The Chinese affective database (CHAD) is designed and established for seven emotion states: neutral, happy, sad, fear, angry, surprise and disgust.
Language(s) :
|
|
|
|
Displaying 61 to 80 (of 80 products) |
Result Pages: 4 |
|
|