> Why LLMs Suck at OCR I paste screenshots into claude code everyday and it's incredible. ...

mritchie712 • last Thursday at 5:40 PM • 2 replies • view on HN

> Why LLMs Suck at OCR

I paste screenshots into claude code everyday and it's incredible. As in, I can't believe how good it is. I send a screenshot of console logs, a UI and some HTML elements and it just "gets it".

So saying they "Suck" makes me not take your opinion seriously.

Replies

ritvikpandey21 • last Thursday at 6:20 PM

yeah models are definitely improving, but we've found even the latest ones still hallucinate and infer text rather than doing pure transcription. we carry out very rigorous benchmarks against all of the frontier models. we think the differentiation is in accuracy on truly messy docs (nested tables, degraded scans, handwriting) and being able to deploy on-prem/vpc for regulated industries.

mikert89 • last Thursday at 5:57 PM

they need to convince customers its what they need

alt Hacker News

Replies