I would be very interested if someone is aware of any small/tiny models to perform OCR, so the app can translate pictures as well
MiniCPM-V 2.6 isn't that small (8b) but it can do this.
Here is a demo.
* https://i.imgur.com/pAuTeAf.jpeg
Using this script:
* https://github.com/jabberjabberjabber/LLMOCR/
MiniCPM-V 2.6 isn't that small (8b) but it can do this.
Here is a demo.
* https://i.imgur.com/pAuTeAf.jpeg
Using this script:
* https://github.com/jabberjabberjabber/LLMOCR/