logoalt Hacker News

nltoday at 1:30 PM1 replyview on HN

I mean I guess but the diffusion objective and the ability to do simultaneous decode both dictate pretty different architectures in practice.


Replies