This is probably less likely with this model, as it’s almost certainly a further RL training continu...

jjcm • today at 3:42 PM • 1 reply • view on HN

This is probably less likely with this model, as it’s almost certainly a further RL training continuation of 3.5 27b. The bugs with this architecture were worked out when that dropped.

Replies

originalvichy • today at 3:51 PM

Valuable note!

alt Hacker News

Replies