Call me when it can do Russian Cursive.

__alexs • today at 9:44 AM • 2 replies • view on HN

Replies

Seems to do an OK job:

This is a random image from Twitter with no transcript or English translation provided, so it's not going to be in the training data.

➕ show 3 replies

myth_drannon • today at 1:51 PM

Right, it can do modern writing but anything older than a century ( church records and census)and it produces garbage. Yandex Archives figured that out and have CER in a single digit but they have the resources to collect immense data for training. I'm slowly building a dataset for finetuning TROCR model and the best it can do is CER 18% ... which is sort of readable.

➕ show 1 reply

alt Hacker News

Replies