logoalt Hacker News

zenapolloyesterday at 10:59 PM3 repliesview on HN

I’ve definitely experienced step jumps down in quality on an almost daily basis. I usually used xhigh. The experience of relying on codex’s outstandingly thorough coding earlier in the year has evaporated for me. I’m seeing incredibly stupid implementations intermittently, and have simply switched to Claude until openai takes the issue seriously. As far as i could tell they haven’t taken it seriously for the several months I’ve been personally seeing it.


Replies

siva7yesterday at 11:03 PM

I've switched 3 months ago to Codex because Claude got incredibly stupid. 6 months ago vice versa. It doesn't matter if you use Codex or Claude. Both will fuck with you at some point. Though Codex probably less.

show 2 replies
matco11today at 3:40 AM

I have noticed this degradation of 5.5 reliability to what, in my experience, I consider Claude-level of reliability since early June.

My journey dealing with this has been transitioning from 5.5 high to 5.5 xhigh to 5.4 high.

5.4 high has been perfectly reliable for me for the last 3 weeks, and I am happy there.

Occasionally, I run some tasks on 5.5 xhigh to check if it has gone back to being 100% perfectly reliable, but, at this point, I am assuming they are just counting on releasing 5.6 rather than dealing with this reliability issue.

cyanydeezyesterday at 11:18 PM

i don't ever believe these issues are technical. They're business decisions to downgrade performance because to fix it means $$$$ and you arn't paying them enough.