logoalt Hacker News

zozbot234yesterday at 11:39 PM1 replyview on HN

When it comes to the very largest models the ANE seems to be only marginally useful for prefill. The M5 Neural Accelerators (NAX) help a lot but at a real cost wrt. power and thermals.


Replies

robotresearcheryesterday at 11:42 PM

Yep, but Apple products don’t spend most of their time running huge models. They are running lots of little ones all the time, using hardware designed for that.

show 1 reply