You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0130
Domain-specific Corpora of Bulgarian
These corpora are representative of various specific domains:
- Parliament Proceedings: 1,735,718 words
- Law and Court: 848,986 words
- Politics: 489,071 words
- Economics: 753,734 words
- Medicine: 728,482 words
- Sports: 441,017 words
- Army: 202,143 words
- Lifestyle: 89,891
- Law and Economics: 369,699 words
- Law and Medicine: 219,016 words
- Law and Sports: 46,693 words
- Bulgarian Laws: 1,147,133 words
- “24 chasa” newspaper: 7,368,711 words
- Fiction: 20,000,000 words
All of them are XML-compatible (sentence level).
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Bulgarian (Bulgaria)
Annotation Coverage : Full
Annotation Granularity : Document
Annotation language : XML
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4