logoalt Hacker News

emp17344yesterday at 6:57 PM4 repliesview on HN

I’m extremely skeptical because of all those articles claiming OpenAI was freaking out about Gemini - now it turns out they just casually had a better model ready to go? I don’t buy it.


Replies

Workaccount2yesterday at 8:41 PM

I (and others) have a strong suspicion that they can modulate models intelligence in almost real time by adjusting quantization and thinking time.

It seems if anyone wants, they can really gas a model up in the moment and back it off after the hype wave.

show 2 replies
tempaccount420yesterday at 6:58 PM

They had to rush it out, I'm sure the internal safety folks are not happy about it.

robots0onlytoday at 3:01 AM

how do you know this is a better model? I wouldn't take any of the numbers at face value especially when all they have done is more/better post-training and thus the base pre-trained model capabilities is still the same. The model may just elicit some of the benchmark capabilities better. You really need to spend time using the model to come to any reliable conclusions.

bamboozledtoday at 12:58 AM

It's very inline with their PR strategy, or lack of.