Stockfish is a machine learning system, it seems quite plausible you might be getting slapped with t...

wgd • yesterday at 11:02 PM • 5 replies • view on HN

Stockfish is a machine learning system, it seems quite plausible you might be getting slapped with the silent performance degradation (https://news.ycombinator.com/item?id=48467896).

Replies

redox99 • today at 1:55 AM

Them silently nerfing the model without telling you, and still fully charging for it, is a new low and should probably be illegal.

➕ show 1 reply

taurath • today at 1:25 AM

Doesn't this "silent degredation" prevent any actual evaluation of the model? If the model fails at something, this allows anyone to claim that it failed due to degradation.

➕ show 2 replies

anematode • yesterday at 11:04 PM

Yup, I suspect that's what's going on

➕ show 1 reply

janalsncm • today at 1:10 AM

It’s possible this is happening at a technical level, but I have a hard time believing this is in the spirit of what Anthropic intends to throttle. It isn’t chip design or building out a competitor to Claude.

Stockfish does use neural nets but they are tiny, on the order of 10M params. Frontier LLMs are probably 100k or 1M times larger than that.

➕ show 1 reply

komali2 • today at 2:56 AM

No, since it's a silent failure, it's not plausible. We have to assume all results we get are the actual model performance, because, it's the actual model performance as we understand it.

Someone trying to solve similar problems will have similar results if the "silent failure" applies consistently in aggregate. So, this is the model's performance.

alt Hacker News

Replies