You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W0290
BLOGS08
BLOGS08 is a TREC test collection containing samples of the blogosphere collected once a week between January 2008 and February 2009.
It consists in a crawl of Feeds, associated Permalink and homepage documents. It represents 28,488,767 blog posts from 1,303,520 blog feeds.
Production
Creation date :
2009
Applications
Applications existing :
Discourse analysis
application Area :
Training#Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
English
Annotation language : HTML
Friday 01 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4