|
Language Resources |
|
|
|
Search Catalogue |
|
|
|
Send us information |
|
|
|
Languages |
|
|
|
|
|
Displaying 481 to 500 (of 730 products) |
Result Pages: 25 |
LOCNESS is a corpus of native English essays containg 324,304 words : British pupils' A level essays (60,209 words), British university students essays (95,695 words) and American university students' essays (168,400 words).
Language(s) : English
|
|
|
|
It contains over 23 million words in five language combinations related to Galician: English-Galician, Galician-Spanish, French-Galician, English-Galician-French-Spanish and Spanish-Galician-Catalan-Basque.
The parallel texts are aligned in an XML-adaptation of the TMX format (Translation Memory eXchange).
Language(s) : Galician - Spanish - English - French - Portuguese - Basque - Catalan
|
|
|
|
This is a corpus of web pages email messages (440 documents), where each document is provided with one of the four category labels: conferences, jobs, resources and trash.
Language(s) : English
|
|
|
|
Syntactically annotated corpus of spoken and written Tibetan from different regions and time periods.
Language(s) : Tibetan
|
|
|
|
This corpus consists of 10 million morphologically annotated Korean words.
Language(s) : Korean
|
|
|
|
This corpus consists of 5,5 million of semantically annotated Korean words. It is TEI-compliant.
Language(s) : Korean
|
|
|
|
This corpus contains syntactically parsed sentences (150,000 in 2003).
Language(s) : Korean
|
|
|
|
It consists of a collection of documents that have been annotated for cross-document structure theory relationships.
Language(s) : English
|
|
|
|
It contains 88,000 pairs of aligned sentences and a hundred Web newspaper articles.
Language(s) : Japanese
|
|
|
|
This is a corpus of 13,665 organization names.
Language(s) : Chinese
|
|
|
|
This is a part-of-speech tagged corpus in biomedical domain.
Language(s) : English
|
|
|
|
The first corpus contains articles of general information about cancer and about different specific cancers. The size of this corpus is about 430,000 words. The other corpus (CHEM) contains about 350,000 words of different articles of chemistry for beginners.
Language(s) :
|
|
|
|
It consists of one million words of spoken and written English from India and contains 500 texts of approximately 2,000 words each.
Language(s) : English
|
|
|
|
It contains 1683 sentences.
Language(s) : Turkish
|
|
|
|
It contains 7000 sentences.
Language(s) : Turkish
|
|
|
|
This is a 2 million word corpus from newspapers.
Language(s) : Turkish
|
|
|
|
It is a component of the American National Corpus First Release and consists of over 4000 articles from the New York Times newswire, for each of the odd-numbered days in July, 2002.
Language(s) : English
|
|
|
|
It contains 4694 articles from the Slate archives published between 1996 and 2000, on topics such as News and Politics, Arts, Business, Sports, Technology, Travel, Food, etc.
Language(s) : English
|
|
|
|
The European Corpus Initiative Multilingual Corpus contains over 98 million words, covering most of the major European languages. The primary focus in this effort is on textual material of all kinds, including transcriptions of spoken material.
Language(s) : Albanian - Bulgarian - Chinese - Czech - Danish - Dutch - English - Estonian - French - Gaelic - German - Greek - Italian - Japanese - Latin - Lithuanian - Malay - Norwegian - Portuguese - Russian - Serbian - Spanish - Swedish - Turkish - Uzbek
|
|
|
|
Displaying 481 to 500 (of 730 products) |
Result Pages: 25 |
|
|