You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-WC344
The Bulgarian Corpus
It is intended to yield 100 million running words whish are collected from different sources in HTML and RTF formats. It is representative of different genres: 15 % fiction, 78 % newspapers and 7 % legal texts, government bulletins and others. About 72 million words are converted into
XML documents, marked up in conformance with the TEI guidelines.
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Bulgarian
Friday 01 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4