here is example of project i worked using codex, it took 10 iterations just to get github actions right https://github.com/newbeelearn/whisper.cpp . you can see the commits made by codex. Project was quite simple it needs to modify whisper to add support for transcribing voice with start/stop keys and copy the transcription to clipboard when stopped. That's it. It performs poorly as compare to CC which gets it right in one shot.