AI models will do all this natively

mikert89 • last Thursday at 4:56 PM • 3 replies • view on HN

Replies

ritvikpandey21 • last Thursday at 5:28 PM

we disagree! we've found llms by themselves aren't enough and suffer from pretty big failure modes like hallucination and inferring text rather than pure transcription. we wrote a blog about this [1]. the right approach so far seems to be a hybrid workflow that uses very specific parts of the language model architecture.

[1] https://www.runpulse.com/blog/why-llms-suck-at-ocr

➕ show 3 replies

throw03172019 • last Thursday at 6:17 PM

This is like saying AI models can generate images. But a hyper focused model or platform on image generation will do better (for now)

alt Hacker News

Replies