European Parliament Proceedings Parallel Corpus 1996-2011

Taxonomy :

The Europarl parallel corpus is extracted from the proceedings of the European Parliament. It includes versions in 21 European languages.


- Other info -

Language(s) :

Bulgarian
Czech
Danish
German
Greek
English
Spanish
Estonian
Finnish
French
Hungarian
Italian
Lithuanian
Latvian
Dutch
Polish
Portuguese
Romanian
Slovak
Slovene
Swedish

Types : parallel corpus
Domain : European Parliament Proceedings
Size : 596,694,486 words
Developer : Philipp Koehn
Availability : Free
Update: 2012