Are you kidding man? Have you tried the new model for coding? It's absolutely incredible. After...

0000000000100 • today at 2:13 AM • 4 replies • view on HN

Are you kidding man? Have you tried the new model for coding? It's absolutely incredible. After using it, I really see why they were so concerned. The jump in my workflows feels as large as the jump from 3.5 to 4o (OpenAI). It's just that good.

Issues I'd been kinda circling around for weeks, long standing errors in some long-running sync operations for a project I'm working on, all solved the same day the model dropped. Just incredible. And it's effectively a lot more token efficient I find as well (less so with sub-agents). Just areas where Opus 4.8 would occassionally get confused or venture down the wrong direction, just doesn't happen nearly as much as with Fable 5.

Like what is everyone who is dissing on this model / Anthropic using day to day? For me it's just an incredible jump in intelligence. So much so and so quickly after the modest bump from 4.8, that I really can understand why they are starting to shout warnings.

Replies

internet101010 • today at 2:32 AM

It's a huge jump across the board. I was really impressed with its ability to test usability in Claude for Chrome. Very opinionated but in a good way. It was good while it lasted.

➕ show 1 reply

cyberax • today at 2:27 AM

I did not see that?

It's way more _proactive_ than the old models, sometimes in ways it shouldn't really be proactive. But it produces _more_ slop than 4.8, and I have not seen any real breakthroughs from it.

Edit: to give an example, I'm working on integrating a self-hosting auth provider into our app. So I gave it a prompt to create a "bootstrap" script that would create pre-configured settings for the local installation.

Fable did it. And then proceeded (unprompted) to test it by killing the running server, removing the database, re-initializing and (trying) to verify that the bootstrap produced identical results.

Well, yeah. Great. I can see how this "bias for action" works for security research and one-shot projects, not so sure about regular development.

I just tried that with Opus, and it produced a similar bootstrap script but did not start the test by itself.

➕ show 2 replies

imadierich • today at 3:27 AM

[dead]

alt Hacker News

Replies