In my tests, GPT-OSS-120B Q8 was close to DeepSeek R1 671B Q16 in solving graduate-level math but much faster with way fewer thinking tokens.
Supporting TFA'd thesis that it's trained to be good at benchmarks.
Supporting TFA'd thesis that it's trained to be good at benchmarks.