Universal Catalogue  
  You are here » Universal Catalogue » Written Resources » Written Corpora
Language Resources
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Anglais
Catalog Reference : ELRA-U-W 0018
Dependency Treebank for Russian
The Russian treebank SynTagRus is representative of modern written Russian. It contains texts of two different types:
- the primary source is the Uppsala Russian corpus (10,000 sentences), which is a balanced corpus of fiction and journalistic texts.
- the other part is based on Internet news portals (published in 2001-2002). Recently, about 88 modern papers of popular science, economic, and political genres published in Russian newspapers, journals or magazines in 2007-2008 have been added.

This resource is morpho-syntactically tagged and a syntactic annotation using dependencies is also planned (same annotation style as the Prague Dependency Treebank).

Size in 2009: 41,187 tagged sentences.
Format : XML, compliant with TEI recommendations.
Production
Creation date : 2000
Applications
application Area : Research
Contents Click on the arrow to display content.
 written corpus 
 

Joint Copyright © 2008 ELRA & ELDA
Universal Catalogue 1.0.4