logoalt Hacker News

zozbot234yesterday at 10:55 AM1 replyview on HN

The training gives you a very lossy version of the original data (the smaller the model, the lossier it is; very small models will ultimately output gibberish and word salad that only loosely makes some sort of sense) but it's the right format for generalization. So you actually want both, they're highly complementary.


Replies

spockzyesterday at 11:15 AM

[dead]