logoalt Hacker News

drawnwrentoday at 6:06 AM1 replyview on HN

Yeah, if you're arguing that "this, according to anthropic, existentially dangerous model has only had its safeguards partially circumvented so we shouldn't step in" ... it's hard for me to take you seriously?

Put another way, the thing we are all concerned with is the complete circumvention of safeguards that is normally possible with llms. If you _aren't_ arguing that this isn't possible, you're not engaging in discussing the the thing that is concerning to regulators or those discussing the regulation.


Replies

linkregistertoday at 6:44 AM

A disappointing trend is to frame the opposing argument in extreme terms rather than engaging with the substance of the assertion.

The latter portion is grand standing about how incredulous the commenter is that someone might trust an LLM company about the strength of their harnesses' if-then-else statements for request routing.

Why bother with an unsubstantial comment?