You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0113
Finnish Language Text Collection
The Finnish Language Text Collection (Suomen kielen tekstikokoelma) contains 180 million running tokens of written Finnish from the 1990's. Several genres are represented: newspapers, journals, novels, scientific articles. A part of the corpus (about a half) is annotated with morpho-syntactic information using Textmorfo and Swesg tools.
Texts were first marked up with SGML, according to the TEI recommendations, but the collection is now available only in XML.
Note: the Finnish Parole Corpus is included in the Finnish Text Collection.
Identification
Period of coverage :
1990's
Version :
SKTP-A
Version history :
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Finnish (Finland)
Annotation Coverage : Full
Annotation Granularity : Document
Annotation Scheme : TEI
Annotation language : XML
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4