You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W0348
MULINCO corpus
This is a multilingual corpus which contains both parallel and comparable texts, including:
- EU texts such as debates of the European Parliament and news from the Acquis Communautaire corpus,
- a collection of New Year speeches (by the Danish Queen, the German Chancelor, the French President, etc).
- literary texts covering different periods: some Danish texts translated into all the other languages, like Hans Christian Andersen, contemporary Danish authors widely translated, novels by Jules Verne originally in French with translations into English and Danish, short stories by E.A. Poe originally in English with translations into Danish and French, stories by Pirandello, originally in Italian with Danish translation.
Annotation includes POS, lemmas and structure annotation (title, filename, title page, chapters, paragraphs and periods) with XML mark-up. Alignment of the parallel texts were automatically produced and manually validated.
Production
Project :
MULINCO
Creation date :
2005
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Multilingual
Language(s) :
Danish ; English ; French ; German ; Italian ; Spanish
Annotation Coverage : Full
Annotation Granularity : Sentence#Paragraph#Document
Annotation level : Morphological
Lexical Unit Information : Single word lemma
Annotation Mode : Semi automatic
Annotation language : XML
Friday 01 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4