logoalt Hacker News

deivid01/22/20251 replyview on HN

I would be very interested if someone is aware of any small/tiny models to perform OCR, so the app can translate pictures as well


Replies

Eisenstein01/22/2025

MiniCPM-V 2.6 isn't that small (8b) but it can do this.

Here is a demo.

* https://i.imgur.com/pAuTeAf.jpeg

Using this script:

* https://github.com/jabberjabberjabber/LLMOCR/