A lot of people are using Claude Code which many consider to be a noticeably better for coding than the other models.
I think also they tend to be generating non-C++ code where there are more guardrails and less footguns for LLMs to run into. Eg they're generating Javascript or Python or Rust where type systems and garbage collection eliminates entire classes of mistakes that LLMs can run into. I know you said you don't use it for Python because you know the language but even experienced Python devs still see value in LLM-generating Python code.
That’s funny bc I linked my post to a server I’m on and I also was told to use an agent.
My worry about an agent is I’m trying to translate the math with full fidelity and an agent might take liberties with the math rather than full accuracy. I’m already having issues with 0 to 1 indexing screwing up some of the algorithm.
But I will try an agent - can’t hurt to try