EUR-Lex Corpus

Taxonomy :

THe EUR-Lex Corpus is a multilingual corpus in all the official languages of the European Union. The corpus has been built from HTML files available in EUR-Lex database. Thanks to the coverage of a vast area of subjects, the corpus is an excellent general purpose resource for anyone looking for translation examples in many languages.


- Other info -

Language(s) :

Bulgarian
Spanish
Czech
Danish
German
Estonian
Greek
English
French
Irish
Croatian
Italian
Latvian
Lithuanian
Hungarian
Maltese
Dutch
Polish
Portuguese
Romanian
Slovak
Slovenian
Finnish
Swedish

Types : multilingual corpus
Domain : EU legislation (includes many domains)
Size : 0.4M~25M per pair of language
Developer : European Commission
Availability : Registration required
Update: 2016