Another United Nations parallel corpus: WIPO Corpus Of Parallel Patent Applications

WIPO (World Intellectual Property Organization) has released a Free for Research parallel corpus of the Patent applications in English and French (COPPA). It is significantly larger than my corpus, but is obviously in only two out of six official languages.

It should be useful for machine learning techniques and – I suspect – would [...]

Being useful to translators

UN Corpora site contains the corpus version in the TMX form ready for importing into a Computer Assisted Translation tool, like Trados.

So, Jost Zetzsche mentioned the corpus in the 190th issue of The Tool Kit – the technology newsletter for translators, and also in his tweet (as Jeromobot) as part of collection of [...]