Universal Catalogue  
  You are here » Universal Catalogue » Spoken Resources » Desktop/microphone
Language Resources
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Anglais
Catalog Reference : ELRA-SD187
COWRAT Corpus
It contains audio meetings with a significant textual component. The meeting scenarios consist of oral discussions and written text documents reflecting the results of these discussions. It also comprises 4 types of metadata encoded in XML: segmentation elements to establish text and speech units, time stamps to keep track of actions on text documents, detailed action descriptions and keywords. The entire corpus contains 29 meetings which last in total more than 17 hours, 14,665 words, 5,015 text actions and 1,125 gesturing actions. A manual annotation is still in progress, which includes orthographic transcription of contents and tagging of dialogue acts. The participants were asked to perform 3 main kinds of tasks: reordering an existing text, organising a weekend break and discussing a research project. During a meeting, participants didn't see each other and were only communicating via a shared editor and audio conferecing tool.
Contents Click on the arrow to display content.
 written corpus 
 speech corpus 
 

Joint Copyright © 2008 ELRA & ELDA
Universal Catalogue 1.0.4