Just to make sure I understand your argument. Are you saying that today's open source models are on par with frontier closed models of two weeks ago? By what criteria?
HN thread from 2 days ago:
https://news.ycombinator.com/item?id=48567759
Commenters there were saying GLM 5.2 was roughly equivalent to Opus 4.8 in coding prowess, based on personal experience of the people commenting. Opus 4.8 came out on May 28 this year (so more like 3 weeks ago), GLM 5.2 came out 2 days ago.
Sorry definitely mis-remembered on my part, its about 3 months
https://x.com/yaroslavvb/status/2067367657272422584 https://x.com/voratiq/status/2067667800643268928 https://arena.ai/leaderboard/agent