I think it's more likely to be the old base model checkpoint further trained on additional data...

redox99 • today at 2:04 AM • 1 reply • view on HN

I think it's more likely to be the old base model checkpoint further trained on additional data.

jumploops • today at 8:34 AM

Is that technically not a new pretrained model?

(Also not sure how that would work, but maybe I’ve missed a paper or two!)

➕ show 1 reply

alt Hacker News