logoalt Hacker News

_0ffhtoday at 12:33 AM1 replyview on HN

I see you already mention diffusion - iirc there was a result not too long ago that diffusion models keep improving with more epochs for longer than AR models do.


Replies

sdpmastoday at 12:37 AM

diffusion is promising, but still an open question how much data efficient they are compared to AR. in practice, you can also train AR forever with high enough regularization, so let's see.

show 1 reply