logoalt Hacker News

wgdyesterday at 11:02 PM5 repliesview on HN

Stockfish is a machine learning system, it seems quite plausible you might be getting slapped with the silent performance degradation (https://news.ycombinator.com/item?id=48467896).


Replies

redox99today at 1:55 AM

Them silently nerfing the model without telling you, and still fully charging for it, is a new low and should probably be illegal.

show 1 reply
taurathtoday at 1:25 AM

Doesn't this "silent degredation" prevent any actual evaluation of the model? If the model fails at something, this allows anyone to claim that it failed due to degradation.

show 2 replies
anematodeyesterday at 11:04 PM

Yup, I suspect that's what's going on

show 1 reply
janalsncmtoday at 1:10 AM

It’s possible this is happening at a technical level, but I have a hard time believing this is in the spirit of what Anthropic intends to throttle. It isn’t chip design or building out a competitor to Claude.

Stockfish does use neural nets but they are tiny, on the order of 10M params. Frontier LLMs are probably 100k or 1M times larger than that.

show 1 reply
komali2today at 2:56 AM

No, since it's a silent failure, it's not plausible. We have to assume all results we get are the actual model performance, because, it's the actual model performance as we understand it.

Someone trying to solve similar problems will have similar results if the "silent failure" applies consistently in aggregate. So, this is the model's performance.