You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0098
Croatian Dependency Treebank
The Croatian Dependency Treebank is a part of the Croatian National Corpus (weekly newspaper Croatia Weekly, CW2000) which is lemmatized and morphosyntactically tagged (manual disambiguation) in accordance with MulTextEast recommendations. It is planned to gather at least 100,000 tokens.
The annotation scheme is based on the Prague Dependency Treebank. The process is divided in two stages: annotation of the analytical tree structure first and then annotation of the tectogrammatical layer.
This project is part of the program "Development of Croatian Language Resources" supported by the Ministry of Science, Education and Sports of the Republic of Croatia.
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Croatian
Annotation Coverage : Full
Annotation Granularity : Word
Annotation level : Syntactic
Annotation Scheme : TEI
Annotation language : XML
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4