logoalt Hacker News

mtct88today at 2:07 PM5 repliesview on HN

Nice release from the Qwen team.

Small openweight coding models are, imho, the way to go for custom agents tailored to the specific needs of dev shops that are restricted from accessing public models.

I'm thinking about banking and healthcare sector development agencies, for example.

It's a shame this remains a market largely overlooked by Western players, Mistral being the only one moving in that direction.


Replies

lelanthrantoday at 2:36 PM

> It's a shame this remains a market largely overlooked by Western players, Mistral being the only one moving in that direction.

I've said in a recent comment that Mistral is the only one of the current players who appear to be moving towards a sustainable business - all the other AI companies are simply looking for a big payday, not to operate sustainably.

show 1 reply
Aurornistoday at 3:36 PM

I play with the small open weight models and I disagree. They are fun, but they are not in the same class as hosted models running on big hardware.

If some organization forbade external models they should invest in the hardware to run bigger open models. The small models are a waste of time for serious work when there are more capable models available.

NitpickLawyertoday at 2:14 PM

I agree with the sentiment, but these models aren't suited for that. You can run much bigger models on prem with ~100k of hardware, and those can actually be useful in real-world tasks. These small models are fun to play with, but are nowhere close to solving the needs of a dev shop working in healthcare or banking, sadly.

kennethopstoday at 2:10 PM

I love the idea of building competitor to open weight models but damn is this an expensive game to play

smrtinserttoday at 2:26 PM

How true is this? How does a regulated industry confirm the model itself wasn't trained with malicious intent?

show 1 reply