The training gives you a very lossy version of the original data (the smaller the model, the lossier...

zozbot234 • yesterday at 10:55 AM • 1 reply • view on HN

The training gives you a very lossy version of the original data (the smaller the model, the lossier it is; very small models will ultimately output gibberish and word salad that only loosely makes some sort of sense) but it's the right format for generalization. So you actually want both, they're highly complementary.

Replies

spockz • yesterday at 11:15 AM

[dead]

alt Hacker News

Replies