logoalt Hacker News

Orastoday at 12:23 PM3 repliesview on HN

Traditional OCR is faster, cheaper, and much more reliable than LLMs


Replies

j16sdiztoday at 12:40 PM

If you consider non-English script, traditional OCR is not more reliable.

CJK have lots of character and high confusion rate.

Arabic scripts are complex and have lots of morphs.

Vietnamese have easily confused diacritics.

Thai have lots of non-standard fonts.

ta988today at 12:28 PM

I don't think that's a universal statement that aplies to every kind of documents and languages. Mistral OCR is able to do things no "traditional" OCR was ever able to.

JodieBeniteztoday at 12:56 PM

I wish it were. Alas...