logoalt Hacker News

dvrpyesterday at 7:42 AM0 repliesview on HN

Yes, also many were PPM images (or encoded as such) in PDFs and then I used (cheap/light) multimodal LLMs to classify documents from photos. It was surprisingly cheap: <$1 for a few thousand PDFs / Images.