logoalt Hacker News

zinodaurtoday at 4:39 PM7 repliesview on HN

Oh no, someone is profiting off of their work without proper attribution!?!?


Replies

Aurornistoday at 5:32 PM

This is an open weights model based on other open weights models.

The dispute is that they released it with claims about having done some post training that improved the outputs. It was discovered that the model was not post trained like they claimed.

The HF page now says it’s a merge of models, which wasn’t there before. They’re trying to claim they accidentally uploaded the wrong model to HF and that they’ll upload the real one soon.

Basically, they thought they could splice two open weights models together and claim their team had accomplished some amazing post training, but they weren’t smart enough to realize that other researchers would discover that there wasn’t any post training.

show 2 replies
internet2000today at 4:41 PM

Attribution isn't the relevant part. Lying about your lab's capabilities is.

show 6 replies
clear-octopustoday at 5:02 PM

[dead]

carlosjobimtoday at 5:06 PM

This is a pure scam on tax payer money. But what else would be expected?

show 2 replies
bachmeiertoday at 5:01 PM

"Their work"? First you had the original content creators that did 99.99% of the work. Then you had the US companies bundle it up into a frontier LLM. Then "they" did the "work" of using the US model as a foundation for their own. So in the sense of doing 0.00001% of the actual work that went into their product, sure.

I'd say it's more like someone forking a Linux distro, adding a few themes and fonts, and then complaining when someone else forks their distro and adds another theme.

show 5 replies