As an example, 2026 GPT doesn't even agree with its 2025 self. Last year I asked it to make a h...

mgrunwald_ • today at 2:37 PM • 1 reply • view on HN

As an example, 2026 GPT doesn't even agree with its 2025 self. Last year I asked it to make a hardware comparison and it correctly identified the objectively better option. Recently I asked again and this time and it got everything completely backwards.

Replies

aspenmartin • today at 2:39 PM

Models are stochastic. Did you look at pass@k? I wouldn’t be surprised if you saw a regression because these models are extremely complex and impact of various decision making downstream is complex.

➕ show 1 reply

alt Hacker News

Replies