The gap between how this is described in the paper vs the blog post is pretty wide. Would be nice to...

zeeshana07x • today at 9:46 AM • 2 replies • view on HN

The gap between how this is described in the paper vs the blog post is pretty wide. Would be nice to see more accessible writing from research teams — not everyone reading is a ML engineer

Replies

om8 • today at 9:58 AM

These are very different media types with very different goals.

dev_tools_lab • today at 10:10 AM

Agreed. The practical implications are often more interesting than the math anyway — smaller models running locally means you can afford to run multiple models in parallel for cross-validation, which changes how you approach tasks like code analysis or bug detection.

alt Hacker News

Replies