It’s a shame all these models target markdown and not something with more structure and a specificat...

ks2048 • 06/16/2025 • 2 replies • view on HN

It’s a shame all these models target markdown and not something with more structure and a specification. There are different flavors of Markdown and limited support for footnotes, references, figures, etc.

Replies

souvik3333 • 06/16/2025

Actually, we have trained the model to convert to markdown and do semantic tagging at the same time. Eg, the equations will be extracted as LaTeX equations, and images (plots, figures, and so on) will be described within the `<img>` tags. Same with `<signature>`, `<watermark>`, <page_number>.

Also, we extract the tables as HTML tables instead of markdown for complex tables.

➕ show 2 replies

starkparker • 06/16/2025

I was more excited to hear about "structured Markdown" than the LLM OCR model, but the extent of it just seems to be tagging certain elements. It's useful in the LLM context but not as much outside of it.

➕ show 1 reply

alt Hacker News

Replies