> 1) reasoning capabilities in latest models are rapidly approaching superhuman levels and continue to scale with compute.
What would you say is the strongest evidence for this statement?
Well the contrived benchmarks the industry selling the models made up seem to be improving.
Well the contrived benchmarks the industry selling the models made up seem to be improving.