Universal Catalogue

You are here » Universal Catalogue » Multimodal/Multimedia Resources

Language Resources

Search Catalogue

Send us information

Would you like to collaborate ?
Contact Us

Languages

Multimodal/Multimedia Resources

Displaying 61 to 80 (of 80 products)

Result Pages: [<< Prev] 1 2 3 4

ELRA-U-MM0052

AVICAR corpus

This is a multimodal corpus which contains audio and video recordings from 50 male and 50 female speaker under five different driving conditions.
Language(s) : English -

Click here for
more information

ELRA-U-MM0053

X-ray Film database for speech research (X-Ray)

This corpus contains 25 high quality x-ray films. It consists of recordings of 14 English and French speakers from Canada. Each speaker read about 30 phonetically contrastive sentences.
Language(s) : English (Canada) - French (Canada) -

Click here for
more information

ELRA-U-MM0054

Multi-Channel Wall Street Journal Audio-Visual Corpus (MC-WSJ-AV corpus)

MC-WSJ-AV contains audio and video recordings of about 45 speakers reading sentences in several scenarios including a single stationary speaker, a single moving speaker and multiple concurrent speakers.
Language(s) : English -

Click here for
more information

ELRA-U-MM0055

Audio-visual French Canadian speech database

This is an audio-visual database. It contains recordings from 21 French Canadian (11 male and 10 female speakers). Each speaker were recorded reading broadcast news during about 5 hours.
Language(s) : French (Canada) -

Click here for
more information

ELRA-U-S 0012

Finnish Broadcast Corpus (FBC)

The Finnish Broadcast Corpus contains speech recordings from the Finnish Broadcasting Company. The material is divided into four categories: radio monologues, radio dialogues, TV monologues and TV dialogues.
In addition to these primary data, the corpus contains annotations giving information on units in speech (fones, words and utterances, which are aligned with the speech and video signals).
Language(s) : Finnish (Finland) -

Click here for
more information

ELRA-U-S 0027

Big Brother Corpus

This audio/video corpus is based on the first Norwegian series of the reality show Big Brother, which was broadcasted in 2001. It gathers 150,000 words from 10 young adults staying in the BigBrother house.
Language(s) : Norwegian (Norway) -

Click here for
more information

ELRA-U-S 0028

Corpus of Spoken Norwegian Language (NoTa-Oslo)

This corpus is composed of orthographically transcribed speech with linked audio and video files. Speech was collected in Oslo in 2005 and gathers 1,032,198 words from 180 speakers.
Language(s) : Norwegian (Norway) -

Click here for
more information

ELRA-U-S 0029

Göteborg Spoken Language Corpus (GSLC)

This Swedish database contains 182 hours of speech, for a total of 1,416,248 words. Speech has been transcribed and automatically POS tagged.
Language(s) : Swedish -

Click here for
more information

ELRA-U-S 0055

Speech Therapy and Physiological Research Corpora of Chinese

These corpora of Chinese include: the Tone Acquisition and Voice Corpus of Hearing-Impaired Children, the Speech and Vocal Tract Video of Standard Chinese.
Language(s) : Chinese -

Click here for
more information

ELRA-U-S 0095

I2R Meeting Corpus

This meeting corpus has Singapore accent and bilingual speech.

Technical characteristics of the meeting room:
- 24 audio channels (sampling rate of 16kHz, 16 bits).
- 9 video channels (full D1 resolution, DV compression).
Language(s) :

Click here for
more information

ELRA-U-S 0098

Scope/Kaken Corpora

This Japanese speech corpus contains camera and microphone recordings of multi-person business and social dialogues that were collected over 3 years.
The corpus is transcribed and annotated for discourse moves.
Language(s) : Japanese -

Click here for
more information

ELRA-U-S 0102

RWCP-SP01 Meeting Speech Corpus (RWCP-SP01)

The RWCP-SP01 contains speech data of a played meeting between more than three participants speaking Japanese. The meeting focuses on the schedule of the participants. MPEG format video recording of the meeting was also taken from three directions.
Language(s) : Japanese -

Click here for
more information

ELRA-U-S 0134

Spoken Chinese Corpus of Situated Discourse (SCCSD BJ-500)

The project aims to collect 1,000 hours of recordings of Mandarin Chinese spoken in China. 650 hours of audio and 150 hours of video recordings have already been collected.

The corpus is transcribed and annotated, with segmented audio/video chunks linked to the corresponding transcripts.
Language(s) : Chinese -

Click here for
more information

ELRA-U-S 0148

British Academic Spoken English Corpus (BASE)

The BASE corpus consists of 160 lectures and 39 seminars recorded in different university departments, for a total of 1,644,942 tokens. It also comprises video data.
Recordings were transcribed and tagged according to the TEI guidelines.
Language(s) : English (United Kingdom) -

Click here for
more information

ELRA-U-S 0152

ELISA Corpus

The ELISA corpus contains audio-video recordings of interviews with English native speakers talking about their professional career (in tourism, politics, the media or environmental education).
The corpus currently contains 28 interviews, for a total of about 60,000 words.
Language(s) : English -

Click here for
more information

ELRA-U-S0205

Nepali Spoken Corpus

The Nepali Spoken Corpus is a part of the Nepali National Corpus (NNC).

This is a spoken corpus in Nepali, designed on the basis of the Goteborg Spoken Language Corpus. It contains audio-video recordings.
Language(s) : Nepali -

Click here for
more information

ELRA-U-S0213

UPUS / Oslo corpus

This is a corpus of Norwegian spoken by young people from 13 to 23 years old. It is intended to be representative of the multi-ethnic youth in urban environments and contains video-recorded interviews and conversations collected at school, in families or among friends groups in Oslo.
Language(s) : Norwegian (Norway) -

Click here for
more information

ELRA-U-S0218

Swedish NICE Corpus

This corpus contains spoken dialogues between children (8-15 years old) and virtual fairy-tale characters in a computer game scenario. It also contains dialogues between children and adults collected in a post-session interview.
Language(s) : Swedish -

Click here for
more information

ELRA-U-W0311

Russian National Corpus (RNC)

This is a collection of written, spoken and multimodal corpora, which represents about 300 million tokens.
Language(s) : Russian - Russian >>>> English - Russian >>>> German -

Click here for
more information

U-MM0056

Chinese affective database (CHAD)

The Chinese affective database (CHAD) is designed and established for seven emotion states: neutral, happy, sad, fear, angry, surprise and disgust.
Language(s) :

Click here for
more information

Displaying 61 to 80 (of 80 products)

Result Pages: [<< Prev] 1 2 3 4