logoalt Hacker News

irthomasthomasyesterday at 9:24 PM1 replyview on HN

Why frame it as rigging? I assume they would teach the models to improve on tasks the public find interesting. Then we just have to come up with more challenges for it.


Replies

krackersyesterday at 10:09 PM

It's not rigging—it's just RL.