logoalt Hacker News

mips_avatartoday at 6:06 AM1 replyview on HN

You can fully train a 1.6b model on a single 3090. That’s a reasonably big model.


Replies

electroglyphtoday at 6:41 AM

you can train it, but not fully