logoalt Hacker News

iamflimflam1yesterday at 6:55 PM1 replyview on HN

It's working for me - it does max out my 64GB though.


Replies

sheepscreekyesterday at 7:04 PM

Wow. I always forget how unlike autoregressive models, diffusion models are heavier on resources (for the same number of parameters).