logoalt Hacker News

AlotOfReadingtoday at 12:06 AM2 repliesview on HN

Translating anything that renders on my screen is the same two clicks to open an LLM with the screen contents. I expect that will become an increasingly universal experience as LLM features get shoved into every nook and cranny of tech.


Replies

refactor_mastertoday at 1:28 AM

There's been a translate button for years which hooks deep into every nook and cranny of the website's HTML. It works great, it's built in and many restaurants even advertise it for tourists, because it's a zero-effort translation of their existing menu. Plus, it's low-data when you're inside a 1-bar basement restaurant.

Using an LLM to translate the visible part of a PDF on a mobile... seems like the worst possible solution to the problem.

show 1 reply
jmyeettoday at 1:29 AM

Translating PDFs is more complicated than that because the strcture of a PDF document doesn't lend itself well to this kind of thing.

For example: if there's a dish name with a 2 line description below it and some allergy symbols below that, in HTML you can imagine the document structure that produces that. In PDF terms that might be 4 separate objects and, in particular, the eyes can see the two lines are adjacent so they fit together but the document structure doesn't really represent it taht way, necessarily.

This might also not work with translation because the lines are set for the size of the text they contain. Same for resizing the font.

Put another waay, PDF should be viewed as a typeset and layout format, not a document format.

show 1 reply