logoalt Hacker News

locknitpickertoday at 4:40 AM4 repliesview on HN

> I am saying this probably is "silly behavior by a government" and it is a milestone that points towards what the future may look like. Why can't it be both?

Here is why it's unlikely this is anything other than "silly behavior by a government":

- some benchmarks show GPT-5.5, Gemini 3.1, and even Claude Opus outperforming Claude Fable, and yet it's Fable which is restricted.

- some benchmarks still show the likes of Kimi 2.5 outperforming any Claude model, and DeepSeek is getting equivalent scores (a few tenths of a percent difference)

> Do you think that Chinese labs will continue to release open models forever (...)

That's immaterial to the discussion. Even if China forced Chinese labs to restrict access to all models, the truth of the matter is that Trump's administration to restrict access to US-based models does not prevent others from having access to models that are as capable or even better.

So what's exactly the point of this?


Replies

dagsstoday at 7:35 AM

I got to try using Fable for a day... it was a clear and definite shift in quality and how independent it is.

It was almost like having another human using and shepherding Opus for me, instead of herding Opus directly myself.

rileyphonetoday at 5:01 AM

All that says is some benchmarks aren’t worth the tokens it takes to evaluate them. Mythos is clearly capable of finding zero days other models can’t, and Fable is close enough to be lumped with it.

show 1 reply
kolinkotoday at 5:55 AM

Did you use the models yourself?

solumunustoday at 5:25 AM

You’re completely overrating these benchmarks and it’s landing you at a nonsense opinion. Just actually use the models and you will see that the gap is significant.

show 1 reply