logoalt Hacker News

ge96today at 2:57 PM3 repliesview on HN

1000 pages for $4? damn how does it compare to llama parse I wonder


Replies

aliljettoday at 3:26 PM

I was just using infinity parser 2 (flash, to be fair) for pennies self-hosted to run through thousands of pages of documents with remarkable confidence. I decided to use https://huggingface.co/datasets/allenai/olmOCR-bench to determine what was the best OCR tool, yesterday, but I've got no idea what the best is now. What is the dominant OCR eval right now? Between Baidu and Mistral this morning, I wonder if there's a new tool to switch to..

freezed8today at 3:35 PM

(jerry from llamaindex here) we're gonna benchmark on ParseBench and report the results!

thenthenthentoday at 3:08 PM

Or Apples local OCR/Vision models?