My hopes are on harness engineering allowing cheaper (but still large) models to shine. I'm evaluating DeepSeek because it would allow insane agent armies. Although DeepSeek charges for thinking tokens, something easy to overlook.
DeepSeek has the tendency to think... a lot!. Without a good harness I can't evaluate it well; time will tell.
OpenAI doesn't; it's embedded into the price, I think.
Cheap = we can run 10x the workloads, bigger imagination = innovation. Maybe 10 dumb agents in a loop can beat 1 Opus? Haha.