logoalt Hacker News

heavyset_golast Friday at 2:04 PM0 repliesview on HN

At least with the embedded platforms I'm familiar with, dedicated silicon to NPU is both faster and more power efficient than offloading to GPU cores.

If you're going to be doing ML at the edge, NPUs still seem like the most efficient use of die space to me.