logoalt Hacker News

simianwordstoday at 12:33 AM2 repliesview on HN

It’s interesting that they kept the price the same while doing inference on Cerebras is much more expensive.


Replies

diwanktoday at 12:56 AM

I dont think this is Cerebras. Running on cerebras would change model behavior a bit and it could potentially get a ~10x speedup and it'd be more expensive. So most likely this is them writing new more optimized kernels for Blackwell series maybe?

show 1 reply
chilleetoday at 12:54 AM

this is almost certainly not being done on cerebras