Universal Catalogue  
  You are here » Universal Catalogue » Written Resources » Written Corpora
Language Resources
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Anglais
Catalog Reference : ELRA-WC0136
ILOLEX-PEKING corpus
Spanish and English documents for the experiments were downloaded from ILOLEX, a database of classified documents.The corpus documents have been mono-classified into 12 categories, each of these with a rather varying number of documents. English consists of 2165 documents, 4.2 million words. Document length: between 39 and 38,646 words. Spanish consists of 1590 documents, 4.7 million words. Document length: between 117 and 7500 words.
Contents Click on the arrow to display content.
 written corpus 
 

Joint Copyright © 2008 ELRA & ELDA
Universal Catalogue 1.0.4