You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W0327
Tagged corpus for Galician language
This is a POS tagged corpus in Galician which contains 309,505 gramatical elements extracted from newspapers and journals. These texts were collected from the Reference Corpus of Present-day Galician Language / CORGA (see U-W 0242). It mostly deals with economic issues.
The corpus has been automatically POS tagged and lemmatized by the galician language tagger XIADA and entirely revised by hand.
It can be used as a train corpus for various statistical linguistic tools.
Identification
Period of coverage :
Version :
2.3
Version history :
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Galician
Character set :
ISO-8859-1
Annotation Coverage : Full
Annotation level : Morphological
Annotation Mode : Automatic
Friday 01 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4