You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0007
The Bosque Treebank
It was developed in the framework of the Floresta Sintatica project (a collaboration between Lingateca and the VISL project) in 2005.
Bosque is a subset of Floresta; it comprises 215,003 tokens from CETEMPúblico and CETENFolha (corresponding to 9,431 trees).
Each tree corresponds to three different representations: Constraint Grammar representation in text format, phrase tree in text format and phrase tree in graphical format.
It was fully revised by linguists.
Identification
Period of coverage :
Version :
v7.4 (Dec 2005)
Version history :
Production
Project :
Floresta Sintatica
Creation date :
2005
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Portuguese (Portugal) ; Portuguese (Brazil)
Annotation Coverage : Full
Annotation Granularity : Word
Annotation level : Syntactic
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4