best inference silicon in the world generally or specialized to smaller models/edge?
Not even an Apple fan, but from what I've been testing with for my dev use case (only up to 14b) it absolutely rocks for general models.
Not even an Apple fan, but from what I've been testing with for my dev use case (only up to 14b) it absolutely rocks for general models.