Indeed, I feel like we are in the early computer equivalent phase of AI, where giant expensive hardware is still required for frontier models. In 5 years I bet there will be fully open models we'll be able to run on a few $1000 of consumer hardware with equivalent performance to opus 4.7/4.6.
You'll never have the power of what they have though. Cloud capital is insane.
So you can run 1 agent locally on $1k to $3k hardware
They can run a fleet of thousands