That's a lame attitude. There are local models that are last year's SOTA, but that's ...

Someone1234 • yesterday at 6:01 PM • 4 replies • view on HN

That's a lame attitude. There are local models that are last year's SOTA, but that's not good enough because this year's SOTA is even better yet still...

I've said it before and I'll say it again, local models are "there" in terms of true productive usage for complex coding tasks. Like, for real, there.

The issue right now is that buying the compute to run the top end local models is absurdly unaffordable. Both in general but also because you're outbidding LLM companies for limited hardware resources.

You have a $10K budget, you can legit run last year's SOTA agentic models locally and do hard things well. But most people don't or won't, nor does it make cost effective sense Vs. currently subsidized API costs.

Replies

gbro3n • yesterday at 6:10 PM

I completely see your point, but when my / developer time is worth what it is compared to the cost of a frontier model subscription, I'm wary of choosing anything but the best model I can. I would love to be able to say I have X technique for compensating for the model shortfall, but my experience so far has been that bigger, later models out perform older, smaller ones. I genuinely hope this changes through. I understand the investment that it has taken to get us to this point, but intelligence doesn't seem like it's something that should be gated.

➕ show 2 replies

aliljet • yesterday at 6:46 PM

First, making sure to offer an upvote here. I happen to be VERY enthusiastic about local models, but I've found them to be incredibly hard to host, incredibly hard to harness, and, despite everything, remarkably powerful if you are willing to suffer really poor token/second performance...

HWR_14 • yesterday at 7:34 PM

$10k is a lot of tokens.

➕ show 1 reply

wellthisisgreat • yesterday at 7:01 PM

> that are last year's SOTA

Early last year or late last year?

opus 4.5 was quite a leap

alt Hacker News

Replies