logoalt Hacker News

cubefoxtoday at 1:41 PM0 repliesview on HN

Linear architectures are at least heavily used in image diffusion models. More so in fact than in language models.