Did you take any steps to decrease the dimension size of images, if this increases the performance? ...

originalvichy • today at 3:57 PM • 1 reply • view on HN

Did you take any steps to decrease the dimension size of images, if this increases the performance? I have not tried this as I have not peformed an OCR task like this with an LLM. I would be interested to know at what size the vlm cannot make out the details in text reliably.

Replies

embedding-shape • today at 3:59 PM

The performance is OK, takes a couple of seconds at most on my GPU, just the amount of documents to get through that takes time, even with parallelism. The dimension seems fine as it is, as far as I can tell.

alt Hacker News

Replies