|
Language Resources |
|
|
|
Search Catalogue |
|
|
|
Send us information |
|
|
|
Languages |
|
|
|
|
|
Displaying 21 to 40 (of 97 products) |
Result Pages: 2 |
The Simple lexicon for Spanish contains 10,000 semantic units from the categories of nouns, verbs and adjectives. These units correspond to a subset of the Parole lexicon, which includes morphological and syntactic information for a total amount of 20,000 lemmas.
Language(s) : Spanish
|
|
|
|
The GLDB is described as a sense-oriented full scale lexical database. It contains 61,050 entries and each lemma comprises the following data: technical stem, spelling variation, part of speech, inflection(s), pronunciation(s), stress, morpheme division, compound boundary and element, abbreviated form, verbal nouns (for verbs).
It expresses semantic relations between words and is in this way comparable to WordNet.
Language(s) : Swedish
|
|
|
|
In the framework of the European projects PAROLE and SIMPLE, electronic lexicons with morphological, syntactic and semantic informations were developed for 12 European languages.
Languages: Catalan, Danish, Dutch, English, Finnish, French, German, Greek, Italian, Portuguese, Spanish, Swedish.
Language(s) : Swedish - Spanish - German - French - Catalan - English - Dutch - Danish - Finnish - Greek - Italian - Portuguese
|
|
|
|
The Swedish Academy Glossary is a comprehensive historical dictionary for Swedish.
Language(s) : Swedish
|
|
|
|
The Groene Boekje is a Dutch-language database for modern Dutch spelling.
Language(s) : Dutch
|
|
|
|
The Romanian word-forms lexicon was compiled from a 35,000 lemma lexicon thanks to the EGLU natural language processing platform.
It is in SGML format and TEI compliant.
Language(s) : Romanian
|
|
|
|
Sahelia is a data bank on Sahelo-Saharan lexicon.
Language(s) : Arabic dialects
|
|
|
|
This resource for Korean comprises two parts: one containing lemmas and the other containing lexical transducers (morphological rules).
Language(s) : Korean
|
|
|
|
The latest version of the semantic lexicon contains over 12,000 handcrafted entries. It includes semantic information but also supports morphological and syntactic analysis. The semantic description relates to concepts and to properties of concepts defined in the Ontosem ontology (5,500 concepts).
Language(s) : English
|
|
|
|
This lexicon for Turkish contains 30,000 words.
Language(s) : Turkish
|
|
|
|
It contains pronunciation information for 90,988 lexical entries.
Language(s) : English (USA)
|
|
|
|
This is a large-coverage lexicon of Modern Hebrew, consisting of over 20,000 entries, presented in XML. Every lexicon item has a unique identifier, three representations (dotted, undotted and transliterated) and a script encoding deviations from the standard script. Part-of-speech category and morpho-syntactic features are also specified.
Language(s) : Hebrew
|
|
|
|
This frequency dictionary was built from the Hungarian Webcorpus. It contains the words and their frequency in the corpus. A list of the first 100,000 most frequent word forms was also created.
Language(s) : Hungarian
|
|
|
|
It contains elementary syntactic structures, based on distributional and transformational criteria.
Language(s) : French
|
|
|
|
This is a dialectological multimedia database for the Occitan language.
It is basically a collection of all existing dialectological material for Occitan. Each entry includes rich information: IPA transcriptions, lemmas, etymons, sounds and localisation on geographical maps.
Language(s) : Occitan
|
|
|
|
This is a French distributional database which has been automatically built from a corpus containing the articles of Le Monde over a 10-year period (1991-2000).
Language(s) : French
|
|
|
|
This verb lexicon was created to annotate semantically the Sensem Corpus. It describes the syntactic and semantic behavior of the 250 more frequent Spanish verbs.
Language(s) : Spanish (Spain)
|
|
|
|
This resource is based on Wordnet 3.0. Each synset is attributed a numerical score regarding to three parameters (objectivity, negativity and positivity). These scores determine how objective, positive and negative the words are.
Language(s) : English
|
|
|
|
The BOMP is a machine-readable pronunciation dictionary for German containing more than 50,000 entries which were all checked.
Language(s) : German
|
|
|
|
This database contains 75,000 entries (lemmas and affixes) with their associated linguistic features (category, subcategory, case, number, etc.). It reflects the general lexicon of standard Basque.
Language(s) : Basque
|
|
|
|
Displaying 21 to 40 (of 97 products) |
Result Pages: 2 |
|
|