logoalt Hacker News

smcleodyesterday at 9:21 PM0 repliesview on HN

Apple Silicon before the M4 does not have matmul instructions which causes the prompt processing to be very slow. It's quite different on the M5, much like using a nvidia GPU