logoalt Hacker News

bigyabaiyesterday at 11:01 PM1 replyview on HN

The M7 Max and M7 Ultra will likely prefill-bottlenecked at 100GB+ scale inference. Layered 6090s would not be.


Replies

aurareturntoday at 2:15 AM

Neural Accelerators in M5 are already 4x faster than M4 at prefill. With M7, especially if they focus on AI like this article claims, it likely will have excellent prefill compute.