Right. But "prompt" also covers a lot of ground, e.g. planning, tracking tasks, etc. The codex-style frameworks do a good amount of that for you, but it can still make a big difference to structure what you're asking the model to do and let it execute step by step.
A lot of the failures people talk about seem to involve expecting the models to one-shot fairly complex requirements.
Right. But "prompt" also covers a lot of ground, e.g. planning, tracking tasks, etc. The codex-style frameworks do a good amount of that for you, but it can still make a big difference to structure what you're asking the model to do and let it execute step by step.
A lot of the failures people talk about seem to involve expecting the models to one-shot fairly complex requirements.