You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0047
Portuguese-English Europarl Corpus
This Portuguese-English parallel corpus is extracted from the proceedings of the European Parliament (04/1996-10/2009). It contains 1,681,991 aligned sentences, 47,621,552 words in L1 and 47,000,805 words in L2.
It is not tokenised but a tokeniser is provided.
Identification
Period of coverage :
1996-2009
Version :
v5, 2010
Version history :
v3, 2007
Production
Creation date :
2007
Technical Informations
Bytesize :
172 MB
Compression :
Zip
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Bilingual
Language(s) :
Portuguese (Portugal)English (United Kingdom)
Alignment :
Sentence
Friday 01 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4