logoalt Hacker News

robotresearcheryesterday at 11:25 PM1 replyview on HN

The M5 has 16 dedicated ‘Neural Engine’ cores and a ‘Neural accelerator’ in each of its conventional GPU cores. It’s been pretty special-purpose juiced for inference.


Replies

zozbot234yesterday at 11:39 PM

When it comes to the very largest models the ANE seems to be only marginally useful for prefill. The M5 Neural Accelerators (NAX) help a lot but at a real cost wrt. power and thermals.

show 1 reply