You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W0331
TR-CoNLL
The TR-CoNLL corpus contains 946 news articles (204,566 tokens) from the CoNLL shared task, in which 6,980 toponym instances have been annotated in TRML (Toponym Resolution Markup Language), an XML-based markup language.
Toponym Resolution (TR) is the task of mapping from a set of potentially ambiguous place names to the intended latitude/longitude coordinates of places they refer to, taking into account textual context.
This data set was described by Leidner (2006, 2008) and can be used to evaluate automatic systems that can carry out toponym resolution. This is a gold standard for the toponym resolution task.
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
English
Number of tokens :
204,566 tokens
Friday 01 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4