>> I no longer review every line, but I also have not yet gotten to the point, where I can just "trust" the LLM.
Same here. This is also why I haven't been able to switch to Claude Code, despite trying to multiple times. I feel like its mode of operation is much more "just trust to generated code" than Cursor, which let's you review and accept/reject diffs with a very obvious and easy to use UX.
Most of the folks I work with who uninstalled Cursor in favor of Claude Code switched back to VSCode for reviewing stuff before pushing PRs. Which... doesn't actually feel like a big change from just using Cursor, personally. I tried Claude Code recently, but like you preferred the Cursor integration.
I don't have the bandwidth to juggle four independent things being worked on by agents in parallel so the single-IDE "bottleneck" is not slowing me down. That seems to work a lot better for heavy-boilerplate or heavy-greenfield stuff.
I am curious about if we refactored our codebase the right way, would more small/isolatable subtasks be parallelizable with lower cognitive load? But I haven't found it yet.