That would be interesting. I've been a bit sceptical of the entire strategy from the beginning....

7moritz7 • last Sunday at 4:15 PM • 2 replies • view on HN

That would be interesting. I've been a bit sceptical of the entire strategy from the beginning. If oss was actually as good as o3 mini and in some cases o4 mini outside benchmarks, that would undermine openai's api offer for gpt 5 nano and maybe mini too.

Edit: found this analysis, it's on the HN frontpage right now

> this thing is clearly trained via RL to think and solve tasks for specific reasoning benchmarks. nothing else.

https://x.com/jxmnop/status/1953899426075816164

Replies

CuriouslyC • last Sunday at 4:26 PM

The strategy of Phi isn't bad, it's just not general. It's really a model that's meant to be fine tuned, but unfortunately fine tuning tends to shit on RL'd behavior, so it ended up not being that useful. If someone made a Phi style model with an architecture that was designed to take knowledge adapters/experts (i.e. small MoE model designed to get separately trained networks plugged into them with routing updates via special LoRA) it'd actually be super useful.

➕ show 1 reply

johnisgood • last Sunday at 11:21 PM

Is there an URL to the post itself on somewhere else?

alt Hacker News

Replies