Universal Catalogue  
  You are here » Universal Catalogue » Written Resources » Written Corpora
Language Resources
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Anglais
Catalog Reference : ELRA-U-W 0040
Danish-English Europarl Corpus
This Danish-English parallel corpus is extracted from the proceedings of the European Parliament (04/1996-10/2009). It contains 1,684,664 aligned sentences, 43,692,760 words in L1 and 46,282,519 words in L2.

It is not tokenised but a tokeniser is provided.
Identification
Period of coverage : 1996-2009
Version : v5, 2010
Version history : v3, 2007
Applications
application Area : Research
Technical Informations
Bytesize : 163 MB
Compression : Zip
Contents Click on the arrow to display content.
 written corpus 
 

Joint Copyright © 2008 ELRA & ELDA
Universal Catalogue 1.0.4