logoalt Hacker News

mslayesterday at 10:02 PM1 replyview on HN

About how many training steps are required to get good output?


Replies

b44yesterday at 10:10 PM

not many. diminishing returns start before 1000 and past that you should just add a second/third layer