Could you give us an idea of what you’re hoping for that is not possible to derive from training dat...

zingar • yesterday at 8:07 AM • 1 reply • view on HN

Could you give us an idea of what you’re hoping for that is not possible to derive from training data of the entire internet and many (most?) published books?

Replies

techpression • yesterday at 8:42 AM

This is the problem, the entire internet is a really bad set of training data because it’s extremely polluted.

Also the derived argument doesn’t really hold, just because you know about two things doesn’t mean you’d be able to come up with the third, it’s actually very hard most of the time and requires you to not do next token prediction.

alt Hacker News

Replies