Is there a SOTA OCR model that prioritises failing in a debuggable way? What I want is an output t...

ChrisKnott • today at 1:42 PM • 0 replies • view on HN

Is there a SOTA OCR model that prioritises failing in a debuggable way?

What I want is an output that records which sections of the image have contributed to each word/letter, preferably with per word confidence levels and user correctable identification information.

I should be able to build a UI to say: no, this section is red-on-green vertically aligned Cyrillic characters; try again.

alt Hacker News