You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0211
Czech Academic Corpus
The CAC is a Czech corpus with a manual annotation of morphology, consisting of approximately 650,000 words (it was originally called Corpus of the Pragmatic Style). It is composed of articles from a wide range of media (newspapers, magazines, and transcripts of the spoken language from radio and TV programs).
When the annotation of the Prague Dependency Treebank was launched, it was decided to convert the internal format and annotation schemes of the CAC in a way that they would be compatible with those in PDT. The CAC v2.0 contains these conversions of syntactic annotations.
Identification
Period of coverage :
Version :
2.0
Version history :
1.0
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Czech
Annotation Coverage : Full
Annotation Granularity : Word
Annotation level : Syntactic
Annotation Mode : Manual
Friday 01 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4