logoalt Hacker News

slt202101/22/20252 repliesview on HN

you dont know which prompt activates the backdoor, how can you firewall it if you run the model in production?


Replies

foolfoolz01/22/2025

3d asset generation is a use case that for most doesn’t need to run in production

show 1 reply
dkjaudyeqooe01/22/2025

Simply sanatieze the model outputs, which is the only thing that would escape running it in complete isolation.