We're still in the early days. It's gonna get a lot worse, if the LLM scaling laws are to be believed.
https://metr.org/blog/2025-03-19-measuring-ai-ability-to-com...