I thought the title meant the training data used was ethics content and ethical reasoning. Turns out...

lovelearning • today at 5:40 AM • 4 replies • view on HN

I thought the title meant the training data used was ethics content and ethical reasoning. Turns out "ethically trained" means the training data used doesn't violate copyright laws.

Replies

RobotToaster • today at 10:04 AM

I thought it was trained trained using Victorian ethics at first... Like it was only trained on computers powered by coal mined by children.

➕ show 1 reply

DonHopkins • today at 7:09 AM

As if copyright laws were ethical.

➕ show 1 reply

verdverm • today at 6:21 AM

Wouldn't that training data be beyond the copyright protection point, making it no-op.

ImHereToVote • today at 11:06 AM

I believe the works are no longer under copyright. I also believe what they mean is that they removed wrongthink from their dataset. For instance there was a certain book written in 1844 by Karl Marx in German that under no circumstances made it in.

This ofc means that the LLM is completely pointless.

https://www.marxists.org/archive/marx/works/date/index.htm

➕ show 1 reply

alt Hacker News

Replies