You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-WC316
Hunglish Corpus Written Resources
This is a sentence-aligned English–Hungarian parallel corpus. It contains 23.7 million English and 29.4 million Hungarian words in 2.07 million sentence pairs from 5 genres of text: legal texts (the EU Constitution; 951,491 sentence pairs), literary (652,142 sentence pairs), software documentation (135,472 sentence pairs), movie subtitles (324,174 sentence pairs), magazines and news (10,276 sentence pairs). A sixth genre is under processing: the business domain (financial reports of Hungarian companies).
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Bilingual
Language(s) :
Hungarian ; English
Alignment :
Sentence
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4