ELRA - ELRA-U-W0323 : BioInfer Corpus

You are here » Universal Catalogue » Written Resources » Written Corpora

Language Resources

Search Catalogue

Send us information

Would you like to collaborate ?
Contact Us

Languages

Catalog Reference : ELRA-U-W0323

BioInfer Corpus

This is an annotated corpus of biomedical English containing 1100 sentences. It consists of biomedical research articles' abstracts annotated for relationships, named entities, and syntactic dependencies (in both the Stanford and Link Grammar schemes). Syntactic dependencies have been fully manually corrected.

In addition to this annotated corpus, the Biomedical Information Extraction Resource (BioInfer) provides others related resources:
- a binarised version of the corpus (1.2.0b version) with relations between proteins represented as binary relationships,
- ontologies defining types of entities and relationships annotated in the corpus,
- supporting softwares to explore the corpus (parser, extractor, vizualiser).

Identification

Period of coverage :

Version : 1.1.1
Version history : Version 1.1.0: 2007 Version 1.0.1: 2006

Production

Creation date : 2006

Contents

Click on the arrow to display content.

written corpus
Number of languages : Monolingual
Language(s) : English
Annotation Coverage : Full
Annotation Granularity : Word#Sentence
Annotation language : XML