Sovereign weights models are a good thing, for a variety of reasons, not least just encapsulating hu...

vessenes • today at 2:37 PM • 3 replies • view on HN

Sovereign weights models are a good thing, for a variety of reasons, not least just encapsulating human diversity around the globe.

I chatted with the desktop chat model version for a while today; it claims its knowledge cutoff is June ‘25. It refused to say what size I was chatting with. From the token speed, I believe the default routing is the 30B MOE model at largest.

That model is not currently good. Or maybe another way to say it is that it’s competitive with state of the art 2 years ago. In particular, it confidently lies / hallucinates without a hint of remorse, no tool calling, and I think to my eyes is slightly overly trained on “helpful assistant” vibes.

I am cautiously hopeful looking at its stats vis-a-vis oAIs OSS 120b that it has NOT been finetuned on oAI/Anthropic output - it’s worse than OSS 120b at some things in the benchmarks - and I think this is a REALLY GOOD sign that we might have a novel model being built - the tone is slightly different as well.

Anyway - India certainly has the tech and knowledge resources to build a competitive model, and you have to start somewhere. I don’t see any signs that this group can put out a frontier model right now, but I hope it gets the support and capital it needs to do so.

Replies

dartharva • today at 4:02 PM

> India certainly has the tech and knowledge resources to build a competitive model

In what universe? India has near-absolutely none of the expensive infra and chip stockpile needed to build frontier models that its American and Chinese counterparts have, even if it did have the necessary expertise (which I also doubt it does).

➕ show 2 replies

Sporktacular • today at 3:00 PM

I'd guess making this a national pride thing will just make it less diverse. Answer would be training models on broader sources, not more nationalistic models.

➕ show 1 reply

segmondy • today at 3:04 PM

You have no idea what you are talking about if you are asking the model what size it is or claiming that a model lies.

➕ show 1 reply

alt Hacker News

Replies