logoalt Hacker News

dragonwriteryesterday at 2:53 PM1 replyview on HN

I can't imagine that model lifetimes will ever justify using model-specific ASICS for public serving (maybe something like serving fixed certified AI models in a vehicle or robot) over more generic GPUs/NPUs until after the AI bubble pops.


Replies

aleph_minus_oneyesterday at 4:16 PM

Be aware that currently the hardware costs and electric bill are two huge problems of modern LLMs.

If such AI models will deliver on their qualitative promises, and just the huge cost is the burden to overcome, custom ASIC might be a part of the solution.

If, on the other hand, AI models will still be unsuitable for many applications because of their qualitative issues, it is a much harder and different problem to solve - in this case, the AI bubble will plausibly burst.