logoalt Hacker News

leketoday at 6:27 PM1 replyview on HN

Do you happen to know if LLMs have issues reading PDFs? Would they prefer EPUB format for example?


Replies

rovr138today at 7:11 PM

Everything has issues reading the content of PDFs natively. It's a format for displaying/rendering. Not for storing format in a way that's easy to parse for the text/content inside.

Is this one storing text or storing coordinates for where to draw a line for the letter 'l'? Is that an 'l' or a line?

The best way to do this is rendering it to an image and using the image. Either through models that can directly work with the image or OCR'ing the image.

show 1 reply