logoalt Hacker News

tsunamifuryyesterday at 11:34 PM1 replyview on HN

The literally churn out random garbage and are trained over time for that garbage to look more and more like an acceptable outcome to humans.

It’s training monkeys at typewriters through reinforcement.


Replies

dparktoday at 12:00 AM

> trained over time

So not random.

> acceptable outcome to humans

And not garbage.

It’s real weird to see people argue that LLM output is no different than random gibberish and then handwave over the fact that it’s clearly not with terms like “training”, as if a steam of random garbage is trainable.