You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W0342
The comparable corpus of English and Russian news texts
This is a comparable corpus of English and Russian news texts.
The English part contains newswires texts from 1996 to 1997 (83,491,119 words), including texts on political events, crime, entertainment, etc,. This is a subset of the corpus of Reuters news, POS tagged and lemmatised.
The Russian part contains articles collected between 2000 and 2001 from Izvestia, a national newspaper in Russia (14,564,884 words). Texts have been POS tagged and lemmatized. This part also includes texts from the Russian Reference Corpus (50,512,584 words), which covers various genres of texts.
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Bilingual
Language(s) :
English ; Russian
Alignment :
Comparable
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4