logoalt Hacker News

andaitoday at 12:21 AM1 replyview on HN

Claude Code has a big system prompt, most of which isn't necessary for the more recent models. (Codex too.)

I've been running Claude and GPT in my own agent harness. The main difference I notice is that tasks take about 7x longer to complete if they're run in the official Claude or Codex harness (and cost me 7x more).

You would think this would lead to increased correctness, but that doesn't seem to be the case. Today I tested both side by side. They both resulted in data loss. (I had a backup obviously.)

GPT running in the official harness did a bunch of extra tests and double checking, and ended up with the same result regardless (it permanently deleted a bunch of documentation).

All else being equal, I like getting my data loss 7x faster and cheaper ;)


Replies

jack_pptoday at 12:23 AM

So you are using the API directly in your own harness without the subscription?

show 1 reply