You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-U-W 0261
frWaC French Web Corpus
The frWaC is a French 1.6 billion word corpus constructed from the Web (.fr domain). The input comes from frequency word lists and basic French vocabulary lists.
It is POS tagged and lemmatized. A plain text version without annotation is also available.
Applications
application Area :
Research
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
French
Document source :
Internet
Annotation Coverage : Partial
Annotation Granularity : Word
Annotation level : Morphological
Annotation Mode : Automatic
Friday 01 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4