Universal Catalogue  
  You are here » Universal Catalogue » Written Resources » Written Corpora
Language Resources
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Anglais
Catalog Reference : ELRA-WC0001
BulTreeBank
BulTreeBank is a treebank for Bulgarian annotated with detailed syntactic information within the framework of HPSG.

The Dependency Part of BulTreeBank (BulTreeBank-DP) represents only the dependency information. It contains about 196,000 tokens.

The Morphologically Annotated Part of BulTreeBank (BulTreeBank-Morph) represents only the morphological information. It contains about 214,000 tokens.

The text archive is also available. It is a collection of Bulgarian texts from the Internet (more than 90,000,000 running words) covering different genres: fiction, newspapers, legal texts, etc.
Format of the archive: XML (TEI scheme).
Production
Project : The BulTreeBank Project
Applications
application Area : Research
Contents Click on the arrow to display content.
 written corpus 
 

Joint Copyright © 2008 ELRA & ELDA
Universal Catalogue 1.0.4