logoalt Hacker News

titzertoday at 9:37 PM0 repliesview on HN

> Anthropic’s guidelines. This section discusses how Anthropic might give supplementary instructions to Claude about how to handle specific issues, such as medical advice, cybersecurity requests, jailbreaking strategies, and tool integrations. These guidelines often reflect detailed knowledge or context that Claude doesn’t have by default, and we want Claude to prioritize complying with them over more general forms of helpfulness. But we want Claude to recognize that Anthropic’s deeper intention is for Claude to behave safely and ethically, and that these guidelines should never conflict with the constitution as a whole.

Welcome to Directive 4! (https://getyarn.io/yarn-clip/5788faf2-074c-4c4a-9798-5822c20...)