Show HN: Local-first fast CPU image to text for screenshots, PDFs, webpages

10 points • by mrkn1 • today at 11:16 AM • 6 comments • view on HN

Comments

Curious how it does on multi-page scanned PDFs vs. single screenshots? The ORT vision/decoder split is the part that usually makes or breaks CPU VLM OCR...

➕ show 1 reply

garrett2558 • today at 12:03 PM

Very cool, I'm building my own local-first product as well

➕ show 1 reply

BIGFOOT_EXISTS • today at 2:05 PM

Now this is legit cool, keep up the great work.

➕ show 1 reply

alt Hacker News

Show HN: Local-first fast CPU image to text for screenshots, PDFs, webpages

Comments