You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W0382
British English 2006
The British English 2006 (BE06) contains 1,010,996 words of written British English published between 2003 and 2008 on the Web. A large part of the texts (82%) were published between 2005 and 2007.
The corpus is balanced in the same way as the LOB and FLOB corpora, using the Brown model. It results in 500 files of 2000 word samples taken from 15 genres of writing:
- Press: Reportage
- Press: Editorial
- Press: Reviews
- Religion
- Skills, Trades and Hobbies
- Popular Lore
- Belles Lettres, Biographies, Essays
- Miscellaneous: Government documents, industrial reports, etc
- Academic prose
- General Fiction
- Mystery and Detective Fiction
- Science Fiction
- Adventure and Western
- Romance and Love story
- Humour
Identification
Period of coverage :
2003-2008
Version :
Version history :
Production
Creation date :
2008
Applications
Applications existing :
Discourse analysis
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
English (United Kingdom)
Number of tokens :
1,010,996 words
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4