logoalt Hacker News

adityashankaryesterday at 11:40 AM2 repliesview on HN

This depends on how much better the models will get from now in, if Claude Opus 4.6 was transformed into one of these chips and ran at a hypothetical 17k tokens/second, I'm sure that would be astounding, this depends on how much better claude Opus 5 would be compared to the current generation


Replies

aurareturnyesterday at 11:54 AM

I’m pretty sure they’d need a small data center to run a model the size of Opus.

empath75yesterday at 2:03 PM

Even an O3 quality model at that speed would be incredible for a great many tasks. Not everything needs to be claude code. Imagine Apple fine tuning a mid tier reasoning model on personal assistant/MacOs/IOS sorts of tasks and burning a chip onto the mac studio motherboard. Could you run claude code on it? Probably not, would it be 1000x better than Siri? absolutely.

show 1 reply