logoalt Hacker News

GZGavinZhaolast Friday at 10:35 PM3 repliesview on HN

Does it handle math expressions (those rendered from LaTeX) well? I've been looking for a good OCR model to transcribe my math textbooks into markdown (obviously ignoring the images and figures) with LaTeX as math expressions, and none of the current OCR models work reliably enough.

EDIT: you can try it yourself for free at https://console.mistral.ai/build/document-ai/ocr-playground once you create a developer account! Fingers crossed to see how well it works for my use case.


Replies

loaf_apilast Friday at 11:32 PM

I've just finished processing thousands of documents using the Gemini Pro 3 vision model and it outperformed every OCR and image model I've tested by a long shot, perfect markdown with latex for the math every time.

show 2 replies
nerbertyesterday at 8:45 AM

Just need to open the link to answer that question.

RagnarDlast Friday at 10:56 PM

Please post an update on how well it works for you.