best inference silicon in the world generally or specialized to smaller models/edge?

whereismyacc • yesterday at 4:26 PM • 1 reply • view on HN

Not even an Apple fan, but from what I've been testing with for my dev use case (only up to 14b) it absolutely rocks for general models.

➕ show 1 reply

alt Hacker News