logoalt Hacker News

sdpmasyesterday at 10:33 PM1 replyview on HN

yeah, we do incorporate some of the findings from the paper in our repo! like aggressive regularization and ensembling.


Replies

_0ffhtoday at 12:33 AM

I see you already mention diffusion - iirc there was a result not too long ago that diffusion models keep improving with more epochs for longer than AR models do.

show 1 reply