My hot take: The way of the future is local LLMs.
The AI equivalent of the PC revolution isn’t quite here yet, but it’s the only way forward.
It will always be worse compared to a centralized approach where hardware utilization can remain high. Except in case which demand low latency which most development things do not need. It's okay if it takes an extra 100ms for a code review to take place.
I agree. We don't need a nondeterministic 10quadrillion vector model. We need an deterministic expert on our narrow business. Something small, that can be run on the 2026 version of the spare PC under the CTOs desk.