You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0127
Structured Corpus of Bulgarian printed editions
The Structured Corpus of Bulgarian printed editions contains electronic versions of printed documents. Those texts are distributed in two main genres: fiction and informative prose. The corpus is divided into two parts :
- the Structured corpus of Bulgarian printed editions, published in the period 2001 until 2010, which contains about 29.2 million words from 2948 texts. 62% of the texts are original Bulgarian texts (the other part are translations from another language to Bulgarian). It consists of 79% of informative texts and 21% of fiction texts.
- the Structured corpus of Bulgarian printed editions, published in the period 1945 until 2010. It contains more than 285 million words from over 6,662 documents, including electronic versions of books (original and translated) and periodicals (newspapers, magazines, year books, etc.). It consists of 56% of informative texts and 44% of fiction texts.
Identification
Period of coverage :
1945-2010
Version :
Version history :
Production
Creation date :
2005
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Bulgarian (Bulgaria)
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4