You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W0381
INTERREG corpus
This is a parallel corpus which consists of web documents from the general domain. It contains 4,190,000 tokens in Greek, 3,430,000 tokens in Bulgarian and 3,900,000 tokens in English.
It was created in the framework of the project "Transfer and Adaptation of Text-to-Speech Technology to the Bulgarian Language and its Application in Cultural Tools with Emphasis on Accessibility".
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Multilingual
Language(s) :
Greek ; Bulgarian ; English
Document source :
Internet
Alignment :
Parallel
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4