In my tests, GPT-OSS-120B Q8 was close to DeepSeek R1 671B Q16 in solving graduate-level math but mu...

storus • last Sunday at 6:07 PM • 1 reply • view on HN

In my tests, GPT-OSS-120B Q8 was close to DeepSeek R1 671B Q16 in solving graduate-level math but much faster with way fewer thinking tokens.

overfeed • last Sunday at 8:09 PM

Supporting TFA'd thesis that it's trained to be good at benchmarks.

➕ show 1 reply

alt Hacker News