logoalt Hacker News

ChrisKnotttoday at 1:42 PM0 repliesview on HN

Is there a SOTA OCR model that prioritises failing in a debuggable way?

What I want is an output that records which sections of the image have contributed to each word/letter, preferably with per word confidence levels and user correctable identification information.

I should be able to build a UI to say: no, this section is red-on-green vertically aligned Cyrillic characters; try again.