Universal Catalogue  
  You are here » Universal Catalogue » Written Resources » Written Corpora
Language Resources
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Anglais
Catalog Reference : ELRA-WC325
Talbanken76 Swedish Treebank
This is a Swedish POS tagged and syntactically annotated corpus which contains a written part (professional prose and high school students’ essays) and a spoken part (interviews, and conversations and debates). The professional prose of the written language part consists of data from textbooks, brochures, newspapers, etc. The written and the spoken parts are of roughly equal size. The whole corpus consists of 300,000 tokens. The part-of-speech tagging includes morphological features, and the syntactic analysis shows grammatical functions.

This Swedish treebank has recently been modernized (see Talbanken 05).
Production
Creation date : 1976
Applications
application Area : Research
Contents Click on the arrow to display content.
 written corpus 
 

Joint Copyright © 2008 ELRA & ELDA
Universal Catalogue 1.0.4