You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-WC064
BulTreeBank Text Archive
It is a collection of Bulgarian texts from the Internet (more than 90,000,000 running words) covering different genres: fiction, newspapers, legal texts, etc.
Format of the archive: XML (TEI scheme).
Production
Project :
The BulTreeBank Project
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Bulgarian (Bulgaria)
Annotation Coverage : Full
Annotation Granularity : Paragraph#Document
Annotation Scheme : TEI
Annotation language : XML
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4