logoalt Hacker News

ljouhettoday at 1:04 PM2 repliesview on HN

Real question: what tool do you use? (for long/complex documents with tables, code, maths)

- marker (with --force-ocr) gives me the best results

- Mistral OCR (seems really great, but I never managed to get it work)

- Mathpix (tried a long time ago)

- docling (gives me garbage, I must use it wrong)

- Unlimited OCR (will try it)

- ???


Replies

Orastoday at 1:22 PM

- Azure Document Intelligence (has an option to return markdown too including headers and footers).

- AWS Textract

show 1 reply
ai_fry_ur_braintoday at 1:49 PM

poma-ai has really great chunking techniques that chunk the document based on the document structure/heirarchy.

We use it on 200 page IEEE standards that are notoriously complex, filled with tables and diagram. Highly reccomend.