You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0018
Dependency Treebank for Russian
The Russian treebank SynTagRus is representative of modern written Russian. It contains texts of two different types:
- the primary source is the Uppsala Russian corpus (10,000 sentences), which is a balanced corpus of fiction and journalistic texts.
- the other part is based on Internet news portals (published in 2001-2002). Recently, about 88 modern papers of popular science, economic, and political genres published in Russian newspapers, journals or magazines in 2007-2008 have been added.
This resource is morpho-syntactically tagged and a syntactic annotation using dependencies is also planned (same annotation style as the Prague Dependency Treebank).
Size in 2009: 41,187 tagged sentences.
Format : XML, compliant with TEI recommendations.
Production
Creation date :
2000
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Russian (Russia)
Annotation Coverage : Full
Annotation Granularity : Word
Annotation level : Syntactic
Annotation Scheme : TEI
Annotation language : XML
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4