You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-WC325
Talbanken76 Swedish Treebank
This is a Swedish POS tagged and syntactically annotated corpus which contains a written part (professional prose and high school students’ essays) and a spoken part (interviews, and conversations and debates). The professional prose of the written language part consists of data from textbooks, brochures, newspapers, etc. The written and the spoken parts are of roughly equal size. The whole corpus consists of 300,000 tokens. The part-of-speech tagging includes morphological features, and the syntactic analysis shows grammatical functions.
This Swedish treebank has recently been modernized (see Talbanken 05).
Production
Creation date :
1976
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Swedish
Annotation Coverage : Full
Annotation Granularity : Word
Annotation level : Syntactic
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4