You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0049
German-English Europarl Corpus
This German-English parallel corpus is extracted from the proceedings of the European Parliament (04/1996-10/2009). It contains 1,581,107 aligned sentences, 41,587,670 words in L1 and 43,848,958 words in L2.
It is not tokenised but a tokeniser is provided.
Identification
Period of coverage :
1996-2009
Version :
v5, 2010
Version history :
v3, 2007
Production
Creation date :
2007
Applications
application Area :
Research
Technical Informations
Bytesize :
164 MB
Compression :
Zip
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Bilingual
Language(s) :
German (Germany)English (United Kingdom)
Alignment :
Sentence
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4