You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W0337
Galician Technical Corpus
The Galician Technical Corpus (CTG) is a monolingual corpus of contemporary specialized Galician including various fields (law, computing, economics, environmental science, sociology and medicine). It contains about 12 million words, in the XML format.
A lemmatized and POS tagged version of the CTG is also provided for 2 million of words (mostly from the environmental science texts). Annotation for this part is still under progress. It is refered to as the Galician Technical Corpus Annotated (CTAG).
Production
Creation date :
2006
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Galician
Annotation Granularity : Sentence
Annotation level : Morphological
Annotation language : XML
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4