You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0158
Mongolian Written Corpus
This is a Mongolian corpus of 5 million words, covering various domains: laws, literature and newspaper. It is POS tagged and syntactically annotated.
It contains texts collected from 144 texts from law, 278 stories, 8 novelettes, 4 novels, 597 news, 505 interviews, 302 reports, 578 essays, 469 stories, and 1258 editorials from newspapers.
The aim is to support the development of NLP tools for Mongolian (spell ckecker, POS tagger, sentence parser).
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Mongolian
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4