|
Language Resources |
|
|
|
Search Catalogue |
|
|
|
Send us information |
|
|
|
Languages |
|
|
|
|
|
Displaying 81 to 97 (of 97 products) |
Result Pages: 5 |
This is a lexicon in Galician which contains 723,998 entries. It provides part-of-speech tag and lemma for each word.
Language(s) : Galician
|
|
|
|
This is a lexicon for English verbs valency which contains 6,397 verbs, including subcategorization (SCF) combinations and frequency informations, for a total of 212,741 entries (around 33 entries per verb).
Language(s) : English
|
|
|
|
This is a lexicon of Czech verbs with informations about valency. It contains 2,730 verb entries and 6,460 lexical units (valency frames).
Language(s) : Czech
|
|
|
|
This is a XML formatted and syntactically annotated Vietnamese lexicon. It contains about 40,000 entries.
Language(s) : Vietnamese
|
|
|
|
This is a XML formatted lexicon in Nepali. It contains 11,000 entries.
Language(s) : Nepali
|
|
|
|
This is a semantically structured lexicon of around 45,800 single word entries and 18,700 multi-word expression entries in English.
Language(s) : English
|
|
|
|
This lexicon contains nearly 35,000 entries. Sinhala (or Sinhalese) is a language spoken in Sri Lanka.
Language(s) : Sinhalese
|
|
|
|
This is a lexicon for Russian. It contains 13,153 single-words, 4,444 proper names and 713 multi-word expressions (MWE) templates.
Language(s) : Russian
|
|
|
|
This is a French subcategorisation lexicon extracted from a syntactically annotated treebank. It contains about 2,000 French verbs with their valence frames.
Language(s) : French
|
|
|
|
This lexicon of pronunciation contains over 1.4 million lexical entries. Each entry includes words, corresponding pronunciations, lemmas and morphosyntactic descriptors of lexical entries.
Language(s) : Slovenian
|
|
|
|
The Chinese Character dictionary (CCDICT) contains about 52,000 entries. It consists of pronunciations for Mandarin, Cantonese, Hakka (Kejia) and other Chinese languages (in the mostly used phonetic systems).
Language(s) : Chinese
|
|
|
|
The VfrLPL is a syntactical lexicon of French verbs. It contains about 8,800 entries (6,700 distinctive lemmas). Each verb is provided with a set of conjugate forms, phonetic counterpart, frequency of usage and syntactical informations (like the associated auxiliary, pronominal characterics, transitive nature, etc).
Language(s) : French (France)
|
|
|
|
This is a lexical representation of affective knowledge. It consists of affective concepts correlated with affective words.
Language(s) : English
|
|
|
|
This is a database which contains 250,000 toponymic entries covering 40 regions in Finland. For the moment, about 10% of them have been converted to a computer format. It is still under development.
Language(s) : Finnish
|
|
|
|
Maltilex is a computational lexicon in Maltese. It is still under development.
Each entry contains: headword, lemma, gender, number, person, valency/argument structure information, root or stem information, variants, ...
Language(s) : Maltese
|
|
|
|
This resource is a speech database containing 17 hours of records from radio broadcast news (from 2003-2004) read by 31 speakers (17 females and 14 males). News cover political, economical, cultural, sport areas of local and foreign affairs.
Language(s) : Lithuanian
|
|
|
|
The Formosan (Taiwan Austronesian) archive is aimed to be a multimodal archive for endangered Formosan languages.
The Chinese archive is divided into 5 sub-groups:
- "Early Mandarin Chinese Lexicon",
- "Lexicon of Pre-Qin Bronze Inscriptions and Bamboo Scripts (LBB)",
- "Modern Chinese Corpus and Treebank" (Sinica corpus and treebank),
- "New Age Corpus: Linguistic Representations and Archives of Multimedia Data",
- "Southern-Min Archive: A Database of Historical Change in Language Distribution".
Language(s) : Chinese
|
|
|
|
Displaying 81 to 97 (of 97 products) |
Result Pages: 5 |
|
|