logoalt Hacker News

SubiculumCodetoday at 4:12 PM3 repliesview on HN

Has anyone taken these open weight models from China and stripped the CCP out of them? I do not mean that snarkily, I mean review them thoroughly using techniques for weight introspection (concept activations) in response to things that one might expect would trigger deceptive/malicious behavior if the CCP had actually tried to implant context-specific behaviors (e.g. the accusation of generating vulnerable code if being used in American government applications, which I don't know if it was ever proven).

Just in case there are those who'd reflexively down vote this post, I'd just like to say that in a time of great national geopolitical rivalries, this kind of question is not unreasonable one to ask. Indeed, its applicable question whichever nation you live in.


Replies

dev_l1x_betoday at 4:16 PM

> Has anyone taken these open weight models from China and stripped the CCP out of them?

The CCP is not influencing my Rust code quality that much. Though I did notice all my lifetimes are now 'static because nothing is ever allowed to leave the party's ownership, unsafe blocks require approval from a central committee.

Honestly the scariest part is that shared mutable state is forbidden unless the state is doing the sharing.

Otherwise it is pretty ok.

justinclifttoday at 4:16 PM

Sounds like something that heretic or similar might be useful for?

https://github.com/p-e-w/heretic

threethirtytwotoday at 4:15 PM

Eh even corporate created LLMs are suspect to corporate biases. Nothing is safe.

show 1 reply