You are here
»
Universal Catalogue
»
Spoken Resources
»
Desktop/microphone
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-SD187
COWRAT Corpus
It contains audio meetings with a significant textual component. The meeting scenarios consist of oral discussions and written text documents reflecting the results of these discussions. It also comprises 4 types of metadata encoded in XML: segmentation elements to establish text and speech units, time stamps to keep track of actions on text documents, detailed action descriptions and keywords. The entire corpus contains 29 meetings which last in total more than 17 hours, 14,665 words, 5,015 text actions and 1,125 gesturing actions. A manual annotation is still in progress, which includes orthographic transcription of contents and tagging of dialogue acts. The participants were asked to perform 3 main kinds of tasks: reordering an existing text, organising a weekend break and discussing a research project. During a meeting, participants didn't see each other and were only communicating via a shared editor and audio conferecing tool.
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
English
speech corpus
Language(s) :
English
Source Channel :
Microphone
Friday 22 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4