The datacenter setting has huge economies of scale for low-latency, just-in-time inference using ...

zozbot234 • today at 5:08 PM • 0 replies • view on HN

The datacenter setting has huge economies of scale for low-latency, just-in-time inference using extremely large models, but that's not the only viable use of AI. Batched, unattended inference of possibly smaller and weaker models, while theoretically viable in a datacenter setting, is far from the best use of that hardware. This is where local AI is at its best.

alt Hacker News