You are here
»
Universal Catalogue
»
Spoken Resources
»
Desktop/microphone
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-MULT21
Hans Christian Andersen (HCA) Conversation Corpus
It contains 5 subcorpora consisting of approximately 57 hours of English spoken user-system interaction recorded from 2002 to 2005, and their transcription. The corpus presents transcription-tagged data from conversations between 7-19 year-old speakers and 3D life-like animated fairytale author Hans Christian Andersen (HCA). Wizard-of-Oz simulation has been used.
In the first subcorpus, 60-70 users aged from 7 to 19 recorded 35-40 conversations during approximately 6 hours. It contains 2,047 user utterances.
In the second one, 500 users of all ages recorded 502 conversations during approximately 30 hours. It contains 6,870 user utterances. 50% have been semantically annotated and topics (such as childhood, youth and travels) have been annotated in 70% of the data.
In the third subcorpus, 18 users aged from 10 to 18 recorded 36 conversations during approximately 11 hours. It contains 1,206 user utterances. There was video and audio interaction. The visitors used 2D deictic gesture.
In the fourth subcorpus, 13 users aged from 11 to 16 recorded 26 conversations during approximately 8 hours. It contains 1,101 user utterances. There was video and audio interaction. The visitors used 2D deictic gesture.
In the last subcorpus, 4 users aged from 10 to 14 recorded 6 conversations during approximately 2 hours. It contains 276 user utterances. There was video and audio interaction. The visitors used 2D deictic gesture.
Contents
Click on the arrow to display content.
speech corpus
Language(s) :
English
Source Channel :
Microphone
Thursday 21 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4