logoalt Hacker News

ontouchstarttoday at 3:18 AM0 repliesview on HN

When I discovered that some local llama.cpp can OCR PDF images generated by TeX, I started to revisit literate programming defined by Donald Knuth and explore using PDF as the source of truth artifact (instead of Markdown or program source code itself) for LLM to consume.

I only got to the point of having code and data as \verbatim in \LaTeX. Next step is CWEB.

Here is an example (with C and Rust code in \verbatim)

https://ontouchstart.github.io/rabbit-holes/llm_rabbit_hole_...

The ultimate goal is machine and human readable proofs on algorithms.