You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W0277
Problem Report Corpus
The Problem Report Corpus contains problem report summaries from various open source projects collected around January 18th, 2006 :
- Linux Kernel (5,916 summaries, 1.1 MB)
- Apache (1,234 summaries, 3.5 MB)
- Firefox (37,952 summaries, 10.6 MB)
- OpenOffice (38,325 summaries 6.9 MB)
- Eclipse (90,424 summaries 20.3 MB)
Problem report files are pos-tagged text (Stanford Log-linear POS Tagger) and follow the same structure.
Identification
Period of coverage :
January 2006
Version :
Version history :
Production
Creation date :
2006
Applications
application Area :
Training#Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
English
Number of tokens :
173851 summaries
Annotation Coverage : Partial
Annotation Granularity : Word
Annotation level : Morphological
Lexical Unit Information : Single word lemma
Annotation Mode : Automatic
Friday 01 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4