You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0129
Semantic Corpus of Bulgarian
The Semantic Corpus of Bulgarian (BulSemCor) consists of excerpts from the Brown Corpus of Bulgarian for a total of 75,000 sense-annotated words.
Words have been lemmatized and associated with the set of possible meanings in the Bulgarian wordnet. The correct grammatical or semantic meaning was then assigned manually by language experts.
The corpus has been used as a training and test corpora in the elaboration of a probability formalism for automatic word-sense disambiguation oriented towards machine translation.
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Bulgarian (Bulgaria)
Annotation Coverage : Full
Annotation Granularity : Word
Annotation level : Semantic
Annotation Mode : Manual
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4