logoalt Hacker News

daemonologisttoday at 5:18 PM1 replyview on HN

The allegation here is that it's not actually a fine-tune of Qwen, but instead an undisclosed mashup (merge) of someone else's fine-tune of Qwen and the original model. Rio subsequently said that the model was in fact a merge, that they did additional fine-tuning after the merge, and that they accidentally uploaded the base merge instead of the version with additional fine-tuning. But this seems like quite an oversight...


Replies

yieldcrvtoday at 6:55 PM

> But this seems like quite an oversight...

Not to me, what would people like to happen? Who are those people? And why do they care?