More recent models are better at reading and obeying constraints in AGENTS.md/CLAUDE.md.
GPT-5.2-Codex did a bad job of obeying my more detailed AGENTS.md files but GPT-5.3-Codex very evidently follows it well.
Opus 4.5 successfully ignored the first line of my CLAUDE.md file last week
Perhaps I’m not using the latest and greatest in terms of models. I tend to avoid using tools that require excessive customization like this.
I find it infinitely frustrating to attempt to make these piece of shit “agents” do basic things like running the unit/integrations tests after making changes.