logoalt Hacker News

echelontoday at 5:17 PM1 replyview on HN

NO!

This is the wrong approach that will turn us into serfs. We need big honking models that do what the leading foundation hyperscaler models do to within a few percentage points of measured performance.

The small-scale models are not productive, and the duct tape solutions built on top of them are hobbyist-tier "year of Linux on desktop" toys.

I imagine fedora-wearing, crypto-shilling, coupon-cutting boffins every time I see small weights thing lauded as the future. This is the Pine Phone F-Droid of AI.

"SMS works most of the time on my phone, I swear! I don't really need my banking app!"

That is not big model energy.

Nothing outside of the top ten is worth spending any time on, and we need to focus on models that bridge the gap.

You're talking about impractical toys for highly technical people wasting their own time. That doesn't move the needle or have any economic impact on the competitive landscape.

We need sharp teeth that bite at the legs of the top-tier foundation labs and hold them back from running away with the prize.

We've been through this time and time again over the last thirty years. It's the same shaped problem as before. We don't need toys - we need real infra for real people paying money to do work. Not freeware for freeloaders who don't spend and invest in the problem space.

Large models fit that precisely, because it forces investment into a wide variety of open infra, routers, inference engines, etc. Not to mention the weights ecosystem itself.


Replies

ktalletttoday at 6:43 PM

Firstly, unless you are the leader of any of the faangs, you are a serf on the whole, if you believe in that philosophy as being relevant.

We need the right tool for the job. Certain models have minimum energy expense no matter what the task is and that's often wasted, both on the scale of some tasks and also repetition.

There is a place and a need for large models, local models, and single purpose models. The same way there is a need for HPC and single board.

show 1 reply