I’ll try 5.4 again but when I spent a couple of hours with it, I found it much weaker than Claude code + opus 4.6 for working on large projects.
Are you delegating substantial work like planning and executing refactors, or more at single-line and function-level work?
I think the gap is smaller than it has been in the past but I largely agree with you, generally larger work is done much better with Claude Code.