Universal Catalogue  
  You are here » Universal Catalogue » Written Resources » Written Corpora
Language Resources
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Anglais
Catalog Reference : ELRA-U-W 0139
Evrokorpus Parallel Corpus
It is composed of five aligned corpora of legal texts from the Acquis Communautaire corpus:
- an English-Slovene corpus (compiled in 2002) which contains 1,650,000 translation units (about 67 million words),
- a German-Slovene corpus (compiled in 2006) which contains more than 340,000 translation units (about 13 million words),
- a French-Slovene corpus (compiled in 2007) which contains more than 560,000 translation units (about 25 million words),
- a Spanish-Slovene corpus (compiled in 2008) which contains 230,000 translation units (about 10 million words),
- an Italian-Slovene corpus (compiled in 2008) which contains 270,000 translation units (about 11 million words).

It also contains :
- EU Commission data (added in 2008) including 610,000 multilingual translation units (98 million words),
- 16,000 Slovene-English translation units (700,000 words) from the Trans corpus (added in 2010) covering five domains (medicine, geology, tourism, nuclear engineering, and public administration),
- 200,000 English-Slovene translation units (7 million words) from the EMEA corpus (added in 2010) covering the medicine area (documents from the European Medicines Agency).

The Evrokorpus is also the basis of Evroterm (a multilingual terminology database).
Identification
Period of coverage :
Version : 2002
Version history : 2010: Trans corpus and EMEA corpus added; 2008: Italian-Slovene, Spanish-Slovene and multilingual data added; 2007: French-Slovene added; 2006: German-Slovene added; 2002: English-Slovene corpus.
Production
Creation date : 2002
Applications
application Area : Research
Contents Click on the arrow to display content.
 written corpus 
 

Joint Copyright © 2008 ELRA & ELDA
Universal Catalogue 1.0.4