You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W0370
SUBTLEX-CH
This is a subtitle corpus for Chinese. It contains 33.5 million words (46.8 million characters) from 6,243 different contexts (7,148 files) coming from movies (50%) and from television series (50%).
It was built in order to create an improved frequency norm for Dutch Word frequencies. This new word frequency measure is called the SUBTL Frequency Norm.
Production
Creation date :
2010
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Chinese
Number of tokens :
33.5 million words
Saturday 05 April, 2025
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4