LLM performance has already plateaued. I don't know, nor care what benchmarks are saying, because they not once translated to the real world for me.
The only thing that has seen massive boost are harnesses around AI. And AI companies are behind here compared to OSS.
> And AI companies are behind here compared to OSS.
How so?