logoalt Hacker News

predkambrijlast Saturday at 6:41 PM1 replyview on HN

CSV is a lot lighter on tokens, compared to json, so it can go further, before a LLM gets exhausted.


Replies

finnborgelast Saturday at 7:36 PM

If you haven't already seen the DeepSeek OCR paper [1], images can be profoundly more token-efficient encodings of information than even CSVs!

[1]: https://github.com/deepseek-ai/DeepSeek-OCR/blob/main/DeepSe...