logoalt Hacker News

zeeshana07xtoday at 9:46 AM2 repliesview on HN

The gap between how this is described in the paper vs the blog post is pretty wide. Would be nice to see more accessible writing from research teams — not everyone reading is a ML engineer


Replies

om8today at 9:58 AM

These are very different media types with very different goals.

dev_tools_labtoday at 10:10 AM

Agreed. The practical implications are often more interesting than the math anyway — smaller models running locally means you can afford to run multiple models in parallel for cross-validation, which changes how you approach tasks like code analysis or bug detection.