You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W0314
Quechua-Spanish Parallel Treebank
This is a corpus-based parallel treebank of 200 sentences in both languages Quechua and Spanish.
The corpus contains two main bilingual texts:
- the declaration of human rights (about 100 sentences),
- information texts and the FAQ from a Peruvian website about the rights of citizens (about 100 sentences).
The Quechua treebank is annotated on words and morphemes, following the guidelines of the RRG (Role and Reference Grammar) whereas the Spanish treebank is annotated on words only following the AnCora tagsets (simplified). Both treebanks were then aligned at the sentence level.
This resource is still under developement and should be extended to provide a larger parallel treebank.
Production
Creation date :
2008
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Parallel
Language(s) :
Spanish (Peru) <<< >>> Quechua (Peru)
Alignment :
Sentence
Annotation Coverage : Full
Annotation Granularity : Morpheme#Word
Annotation level : Syntactic
Friday 01 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4