Wikipedia Corpus

Taxonomy :

This corpus contains the full text of Wikipedia, and it contains 1.9 billion words in more than 4.4 million articles.


- Other info -

Language(s) :

English

Types : monolingual corpus
Domain : Wikipedia
Size : 1.9 billion
Developer : Mark Davies
Availability : Free
Update: 01/2015