logoalt Hacker News

Grosvenoryesterday at 4:27 PM1 replyview on HN

And we have those French/English text corpora in the form of Canadian law. All laws in Canada at the federal level are written in English and French.

This was used to build the first modern language translation systems, testing them going from English->french->english. And in reverse.

You could do similar here , understanding that your language is quite stilted legalese.

Edit: there might be other countries with similar rules in place that you could source test data from as well.


Replies

adamzwassermanyesterday at 5:13 PM

Incredibly, I had not thought to use that data set.

Now I will. Thanks.

show 1 reply