> So the size of GPT-5.3-Codex-Spark isn't limited by the memory of a single Cerebras chip, ...

johndough • today at 11:52 AM • 1 reply • view on HN

> So the size of GPT-5.3-Codex-Spark isn't limited by the memory of a single Cerebras chip, but the number of such chips that you can chain together and still hit the 1000 tokens per second target.

Chaining chips does not decrease token throughput. In theory, you could run models of any size on Cerebras chips. See for example Groq's (not to be confused with Grok) chips, which only have 230 MB SRAM, yet manage to run Kimi K2.

Replies

EdNutting • today at 12:11 PM

Only if chip-to-chip communication is as fast as on-chip communication. Which it isn’t.

➕ show 2 replies

alt Hacker News

Replies