logoalt Hacker News

kinnthtoday at 8:02 AM1 replyview on HN

I think we're also ignoring a potential innovative move in how models work.

If someone could splinter or fragment the models into more specific tasks i.e "spellchecker AI" and get these working as well as Sonnet 4.6-4.8 on those tasks on a personal laptop. You then question the $100 a month fee.

Bear in mind these laptops are likely to be $5000 or so because of the memory, HDD and M7 chip they likely need.

It feels to me like the beginning of the inflection point but software updates not hardware updates will be the accelerant.


Replies

marcitoday at 9:57 AM

  "That’s where EMO comes in.

  We show that EMO – a 1B-active, 14B-total-parameter (8-expert active, 128-expert total) MoE trained on 1 trillion tokens – supports selective expert use: for a given task or domain, we can use only a small subset of experts (just 12.5% of total experts) while retaining near full-model performance."
https://allenai.org/blog/emo