You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W0308
Mannheim German Reference Corpus
The Mannheim German Reference Corpus (DeReKo) is a collection of German corpora covering the period of 1956 to 2001. It contains more than 3.9 billion tokens and is continuously expanded.
It includes a wide range of texts (literary, scientific and popular texts, newspaper texts, ...). Each corpus is annotated with metadata (following TEI guidelines) and is Part-Of-Speech tagged.
DeReKo stands for Deutsches ReferenzKorpus. This is the largest German corpus collection. The interface COSMAS II allows complex search options.
Identification
Period of coverage :
Version :
Version history :
Last update: 2010
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
German
Number of tokens :
3.9 billion tokens
Annotation level : Morphological
Annotation Scheme : TEI
Annotation language : XML
Friday 01 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4