You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-WC324
Talbanken05 Swedish Treebank
The Talbanken05 is a modernized version of Talbanken76, a syntactically annotated Swedish corpus.
It is available in three formats besides the original one, two versions of phrase structure annotation and one dependency-based version:
- MAMBA: Original syntactic and lexical annotation, (original text, encoding in ISO-8859-1),
- FPS: Flat phrase structure annotation (TIGER-XML, encoding in ISO-8859-1),
DPS: Deepened phrase structure annotation (TIGER-XML, encoding in ISO-8859-1),
- Dep: Dependency structure annotation (Malt-XML, encoding in ISO-8859-1 and CoNLL-X shared task format in UTF-8).
Identification
Period of coverage :
Version :
v1.1
Version history :
Production
Creation date :
2005
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Swedish
Annotation Coverage : Full
Annotation Granularity : Word
Annotation level : Syntactic
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4