logoalt Hacker News

OtherShrezzingtoday at 9:22 AM1 replyview on HN

The tok/s stat is interesting. Since the dominant constraint on inference speed is hardware, it suggests X purchased far more compute than was really needed to serve the demand for their models.

Expensive miscalculation.


Replies

flirtoday at 10:12 AM

Didn't a bunch of hardware that was destined for Tesla get redirected to xAI? I'm sure I remember something like that.

show 1 reply