ELRA - ELRA-U-W 0099 : Hinoki Treebank / Sensebank

You are here » Universal Catalogue » Written Resources » Written Corpora

Language Resources

Search Catalogue

Send us information

Would you like to collaborate ?
Contact Us

Languages

Catalog Reference : ELRA-U-W 0099

Hinoki Treebank / Sensebank

The Hinoki is built from dictionary definition sentences (38,900, 3 million words). It contains syntactic annotation based on HPSG and word sense tagging (lexical and structural semantic information). First, the corpus is automatically parsed and then the correct analysis is selected by the annotator. It is inspired by the Lingo Redwoods initiative.

It has been designed for natural language understanding and has already been used to automate the acquisition of thesauruses from computational dictionaries.

Definition sentences of the Hinoki are taken from the Lexeed Semantic Database of Japanese.

Applications


application Area : Research

Contents

Click on the arrow to display content.

written corpus
Number of languages : Monolingual
Language(s) : Japanese (Japan)
Annotation Coverage : Full
Annotation Granularity : Word
Annotation level : Semantic
Annotation Mode : Semi automatic