You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W0380
Bulgarian National Corpus
The Bulgarian National Corpus consists of about 320,000,000 words from more than 10,000 texts. This corpus reflects the state of the Bulgarian language (mainly in its written form) from the middle of XX century (1945) until present. It contains large general corpora and smaller thematic corpora, including:
- the Brown Corpus of Bulgarian (see WC338 for a full description)
- the Structured Corpus of Bulgarian (see U-W 0127)
- some Domain-specific Corpora in Bulgarian (see U-W 0130)
It has been automatically annotated for parts of speech and other grammatical information.
Production
Creation date :
2010
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Bulgarian
Number of tokens :
320,000,000 words
Annotation Granularity : Word
Annotation level : Morphological
Annotation Mode : Automatic
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4