The M7 Max and M7 Ultra will likely prefill-bottlenecked at 100GB+ scale inference. Layered 6090s wo...

bigyabai • yesterday at 11:01 PM • 1 reply • view on HN

The M7 Max and M7 Ultra will likely prefill-bottlenecked at 100GB+ scale inference. Layered 6090s would not be.

Replies

Neural Accelerators in M5 are already 4x faster than M4 at prefill. With M7, especially if they focus on AI like this article claims, it likely will have excellent prefill compute.

alt Hacker News

Replies