logoalt Hacker News

redox99today at 2:04 AM1 replyview on HN

I think it's more likely to be the old base model checkpoint further trained on additional data.


Replies

jumploopstoday at 8:34 AM

Is that technically not a new pretrained model?

(Also not sure how that would work, but maybe I’ve missed a paper or two!)

show 1 reply