You are here
»
Universal Catalogue
»
Written Resources
»
Multilingual lexicons
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-M0045
IndoWordnet
The IndoWordnet (IWN) is a lexical-semantic network that is structured along the same lines as the Princeton WordNet (lexical reference system).
In a wordnet, which is basically a semantic network, the different lexical categories of words (nouns, verbs,..) are organised into 'synsets' (sets of synonyms). Each synset represents a lexical concept and they can be linked by different types of relation (hyperonymy, antonymy, etc.). For obvious reasons, all wordnets resort to the same system of synset identification.
The IndoWordnet includes several existing wordnet and covers 17 Indian languages:
- Assamese: 3,530 synsets, 19,609 unique-words,
- Bengali: 8,679 synsets, 18,563 unique-words,
- Bodo: 3,837 synsets, 13,357 unique-words,
- Gugarati: 970 synsets, 2,125 unique-words,
- Hindi: 33,900 synsets, 82,000 unique-words,
- Kannada: 5,920 synsets, 7,344 unique-words,
- Kashmiri: 6,569 synsets, 8,674 unique-words,
- Malayalam: 6,154 synsets, 8,622 unique-words,
- Manipuri: 2,744 synsets, 5,231 unique-words,
- Marathi: 9,739 synsets, 21,223 unique-words,
- Nepali: 5,802 synsets, 10,278 unique-words,
- Oriya: no indication,
- Punjabi: no indication,
- Sanskrit: 3,340 synsets, 17,820 unique-words,
- Tamil: 4,750 synsets, 9,821 unique-words,
- Telugu: 10,639 synsets, 18,250 unique-words,
- Urdu: 123 synsets, 9,641 unique-words.
Work is still in progress to enlarge the IndoWordnet. The Hindi WordNet (the pivot language) is also aimed to be linked to the English WordNet. 13,693 synsets have already been linked.
WordNet is a basic resource for computational linguistics purposes and language engineering application (machine translation, IE, WSD, knowledge representation, etc.).
Identification
Period of coverage :
Version :
2010
Version history :
Contents
Click on the arrow to display content.
written lexicon
Number of languages
: Multilingual
Language(s) :
Assamese ; Bengali ; Gujarati ; Hindi ; Kannada ; Kashmiri ; Malayalam ; Maharati ; Nepali ; Oriya ; Panjabi, Punjabi ; Sanskrit ; Tamil ; Telugu ; Urdu
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4