You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0045
Italian-English Europarl Corpus
This Italian-English parallel corpus is extracted from the proceedings of the European Parliament (04/1996-10/2009). It contains 1,635,140 aligned sentences, 46,380,851 words in L1 and 47,236,441 words in L2.
It is not tokenised but a tokeniser is provided.
Identification
Period of coverage :
1996-2009
Version :
v5, 2010
Version history :
v3, 2007
Production
Creation date :
2007
Applications
application Area :
Research
Technical Informations
Bytesize :
170 MB
Compression :
Zip
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Bilingual
Language(s) :
Italian (Italy)English (United Kingdom)
Alignment :
Sentence
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4