Working on my codebase (~100KLoC across multiple Python modules) I felt that Fable was head and shou...

abraxas • today at 4:22 AM • 3 replies • view on HN

Working on my codebase (~100KLoC across multiple Python modules) I felt that Fable was head and shoulders above 4.x series. It was just relentless and always hell bent on testing and proving its own work. It just tore through problems like an animal. I never seen that behaviour in 4.5-4.8. I can't speak for OpenAI models as I don't use them but Fable was in a different league. Especially when tasked with long horizon goals that involved reasoning at a high and low level to solve the task.

Replies

andxor • today at 4:38 AM

I have had the same experience. I can't believe that people couldn't tell the difference.

➕ show 1 reply

mewpmewp2 • today at 4:33 AM

Yeah, and its browser usage on tough web apps/sites was also amazing. This is one of the cases where it is easy to tell a difference. It was figuring out very effectively how to find right elements whereas with previous LLMs I had to constantly babysit and unblock them with browser usage.

earth2mars • today at 4:58 AM

I used codex 5.5 and Claude. I pay for Claude from my pocket. I use Codex at work. I can confidently say Codex 5.5 high is much better in going through long code bases (couple of millions of lines of code) vs Claude Fable/Opus which does only what is been told. while codex covers all sorts of edge cases. Frankly, I am not going to miss a thing if they stopped Fable.

➕ show 1 reply

alt Hacker News

Replies