You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0065
The UCLA Chinese corpus
It is a modern Chinese written corpus of one million tokens from texts collected between 2000 and 2005. It is segmented and POS tagged.
The UCLA Chinese Corpus is designed as a Chinese counterpart of the FLOB and FROWN corpora of British and American English. It can also be considered as a recent update of the Lancaster Corpus of Mandarin Chinese (LCMC), which is available from the ELRA catalogue (
http://catalog.elra.info
) under the reference ELRA-W0039.
Production
Creation date :
2007
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Chinese (China)
Annotation Coverage : Full
Annotation Granularity : Word
Annotation level : Morphological
Annotation language : XML
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4