You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0128
Tagged Corpus of Bulgarian
The Tagged Corpus of Bulgarian (BulPosCor) is composed of 300 word extracts from the Brown Corpus of Bulgarian, for a total of 200,000 words. Those extracts have been POS annotated by language experts with the help of the annotation tool Chooser.
Parts of this corpus were used as training and test corpora in the creation of TokTag (a programme for automatic POS disambiguation).
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Bulgarian
Annotation Coverage : Full
Annotation Granularity : Word
Annotation level : Morphological
Annotation Mode : Semi automatic
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4