logoalt Hacker News

ainchtoday at 3:13 AM1 replyview on HN

This understates the possible headroom as technical challenges are addressed - text diffusion is significantly less developed than autoregression with transformers, and Inception are breaking new ground.


Replies

nylonstrungtoday at 3:55 AM

Very good point- if as much energy/money that's gone into ChatGPT style transformer LLMs were put into diffusion there's a good chance it would outperform in every dimension