I dont think this is Cerebras. Running on cerebras would change model behavior a bit and it could po...

diwank • today at 12:56 AM • 1 reply • view on HN

I dont think this is Cerebras. Running on cerebras would change model behavior a bit and it could potentially get a ~10x speedup and it'd be more expensive. So most likely this is them writing new more optimized kernels for Blackwell series maybe?

Replies

simianwords • today at 1:03 AM

Fair point but it remains to answer - why isn’t this speed up available in ChatGPT and only in the api?

alt Hacker News

Replies