You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0037
AnCora-CAT Catalan Corpus
AnCora-CAT is a Catalan corpus of 395,379 words(still under development, aim: 500,000 words). It was built from the previous 3LB-CAT corpus (100,000 words), which has been enlarged with more materials from the Catalan ACN news agency and from the El Periodico newspaper.
The AnCora-CAT corpus has been annotated from POS to semantic information (the 3LB-CAT had already been annotated from POS to syntactic information).
Production
Creation date :
2007
Applications
Applications possible :
Discourse analysis#Information retrieval
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Catalan (Spain)
Annotation Coverage : Full
Annotation Granularity : Word
Annotation level : Semantic
Annotation Mode : Semi automatic#Manual
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4