You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-WC057
Deutscher Wortschatz-Large monolingual corpora
This corpus contains more than 300 million words with approx. 6 million different word types in 13,4 million sentences, for German; 250 million words, 850.000 word types and 13 million sentences, for English, and 22 million words, 600.000 word types and 1,5 million sentences, for Dutch. Data acquisition for these corpora is based on the analysis of electronic text from various sources: general newspaper text, electronic dictionaries, electronic books and journals, web resources. Data has been marked up with HTML.
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Multilingual
Language(s) :
English ; German ; Dutch
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4