logoalt Hacker News

speffyesterday at 12:11 AM2 repliesview on HN

This might be a good place to check the options available for OCR in-place translations. I took a look at OCR3, but it doesn't seem to support my use-case. It looks more tailored towards data extraction for further processing.

I've got some foreign artbooks that I would like to get translated. The translations would need to be in place since the placement of the text relative to the pictures around it is fairly important. I took a look at some paid options online, but they seemed to choke - mostly because of the non-standard text placements and all.

The best solution I could come up with is using Google Lens to overlay a translation while I go through the books, but holding a camera/tablet up to my screen isn't very comfortable. Chrome has Lens built in, but (IIRC) I still need to manually select sections for it to translate - it's not as easy to use as just holding my phone up.

Anyone know of any progress towards in-place OCR/translations?


Replies

claaryesterday at 12:34 AM

If you don't mind a paid solution, try DEEPL. I also use Word's built in document translation to good effect.

show 1 reply
haraldoooyesterday at 5:24 AM

I’m fairly confident this is solvable quite well with “just two api calls”. Are examples of those books available online?

show 1 reply