Universal Catalogue  
  You are here » Universal Catalogue » Written Resources » Written Corpora
Language Resources
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Anglais
Catalog Reference : ELRA-WC0246
Trilingual Parallel Computer Corpus
This trilingual parallel computer corpus of 2,000,000 words has been compiled in order to study the syntactic and semantic structure of the preposition and particle "up/on" and its prototypical equivalents in French ("sur") and Dutch ("op"). All texts are aligned at paragraph level. Linked with this source corpus, a trilingual database TRIPTIC (TRIlingual Parallel Text Information Corpus) has been developed to facilitate the linguistic analysis of selected sample sentences.
The corpus is divided into two subcorpora (fiction and non-fiction), each containing approximately 1,000,000 words. The non-fiction subcorpus consists of extracts from the UNESCO Courier (1993) and from the verbatim transcription of the plenary sessions of the Debates of the European Parliament (1993) (further called the Debates subcorpus). The fiction subcorpus consists of extracts from 12 original fiction texts, four per language. From each of these works, all published between 1982 and 1992, the first chunk of approximately 25,000 words (up to the end of a sentence) was selected. Twelve original extracts translated into two languages gives a total of 36 extracts.
Contents Click on the arrow to display content.
 written corpus 
 

Joint Copyright © 2008 ELRA & ELDA
Universal Catalogue 1.0.4