logoalt Hacker News

stavrostoday at 12:45 AM3 repliesview on HN

The Chinese models are distilled from GPT and Claude, so it's not like China would pull ahead if those companies went away for six months. They really are at the forefront of innovation right now, as much as I hate to think of the consequences of this (a single company owning a superintelligence is basically a nightmare scenario for me).


Replies

largbaetoday at 12:53 AM

Don't worry, if someone truly achieves superintelligence it won't be controlled by anyone for long.

show 2 replies
electroglyphtoday at 5:20 AM

i don't buy this. distilled how? you don't get access to logprobs, and the thinking traces are fake and compressed. it's an expensive way to get potentially substandard training data.

isodevtoday at 12:54 AM

I think that’s the realm of conspiracy theories. There are also not only Chinese alternatives- Mistral in Europe is doing pretty good in several categories they’ve opted to focus on.

This kind of reiterates the parent’s question I think - people are maybe too focused on the gpt/claude model and forget about all the other ways of using the tech.

show 1 reply