He asked the models to fix the problem without commentary and then… praised the models that returned commentary. GPT-5 did exactly what he asked. It doesn’t matter if it’s right or not. It’s the essence of garbage in and garbage out.
If they are supposed to replace actual devs we would expect them to behave like actual devs and push back against impossible requests.
If they are supposed to replace actual devs we would expect them to behave like actual devs and push back against impossible requests.