You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0074
Modern Hebrew Treebank
The Modern Hebrew Treebank (MHT) contains 6,500 sentences of news items from the Ha'aretz daily newspaper. It is segmented and analysed at morpho-syntactic level (automatic parsing, manually corrected). The tagset is an adaptation for Hebrew of that used in the English Penn Treebank.
In the Modern Hebrew Treebank the annotated words are segmented into morphemes. Special annotation features have also been added to mark cases where the morpho-syntactic features of a node are inherited from one or more of its children ('father-child dependencies').
A version of the Treebank with only the morphological level is also available.
Identification
Period of coverage :
Version :
v2.0
Version history :
Applications
application Area :
Education#Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
Hebrew (Israel)
Annotation Coverage : Full
Annotation Granularity : Morpheme
Annotation level : Syntactic
Annotation Mode : Automatic
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4