The large SOTA models have hit very diminishing returns on further scaling, I think. So you’d rather...

layer8 • yesterday at 6:29 PM • 0 replies • view on HN

The large SOTA models have hit very diminishing returns on further scaling, I think. So you’d rather double the number of models you can run in parallel.

alt Hacker News