logoalt Hacker News

storuslast Sunday at 6:07 PM1 replyview on HN

In my tests, GPT-OSS-120B Q8 was close to DeepSeek R1 671B Q16 in solving graduate-level math but much faster with way fewer thinking tokens.


Replies

overfeedlast Sunday at 8:09 PM

Supporting TFA'd thesis that it's trained to be good at benchmarks.

show 1 reply