At least with the embedded platforms I'm familiar with, dedicated silicon to NPU is both faster...

heavyset_go • last Friday at 2:04 PM • 0 replies • view on HN

At least with the embedded platforms I'm familiar with, dedicated silicon to NPU is both faster and more power efficient than offloading to GPU cores.

If you're going to be doing ML at the edge, NPUs still seem like the most efficient use of die space to me.

alt Hacker News