logoalt Hacker News

sheepscreekyesterday at 7:04 PM0 repliesview on HN

Wow. I always forget how unlike autoregressive models, diffusion models are heavier on resources (for the same number of parameters).