logoalt Hacker News

Show HN: Local-first fast CPU image to text for screenshots, PDFs, webpages

10 pointsby mrkn1today at 11:16 AM6 commentsview on HN

Comments

abstract257today at 1:20 PM

Curious how it does on multi-page scanned PDFs vs. single screenshots? The ORT vision/decoder split is the part that usually makes or breaks CPU VLM OCR...

show 1 reply
garrett2558today at 12:03 PM

Very cool, I'm building my own local-first product as well

show 1 reply
BIGFOOT_EXISTStoday at 2:05 PM

Now this is legit cool, keep up the great work.

show 1 reply