You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0136
Slovak National Corpus
The Slovak National Corpus (SNK) is a database of contemporary Slovak language texts. It contains 526,082,640 tokens and covers a broad range of textual genres: journalist texts, fiction, specialized texts, scientific texts, etc.
It comprises a balanced subcorpus of 254,236,903 tokens (33.3% journalistic texts, 33.3% fiction, 33.3% specialized texts).
A part of the corpus has also been manually annotated with morphological information. It is called "r-mak-3.0" and contains 1,207,939 tokens.
The SNK has been enlarged with multilingual resources (French-Slovak parallel corpus and Russian-Slovak parallel corpus).
Identification
Period of coverage :
Version :
4.0
Version history :
3.0 (2006)
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Slovak
Annotation Coverage : Partial
Annotation Granularity : Word
Annotation level : Morphological
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4