You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0189
Cambridge Learner Corpus
The Cambridge Learner Corpus (CLC) is a large collection of exam scripts written by learners of English taking Cambridge ESOL exams (25 million words). It currently contains scripts from 85,000 students from 180 different countries and 100 different first languages. It forms part of the Cambridge International Corpus (CIC).
Each script contains information about the student's first language, nationality, level of English, age, etc.
An interesting feature of the corpus is that approximately 13 million words (about 45,000 scripts) have been annotated with a Learner Error Coding system. This makes it possible to extract and analyze the words or structures that produce the most errors in Learner English. It is also possible to search for particular errors and find many examples.
This corpus is expanding all the time.
Applications
application Area :
Education#Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
English
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4