Every interactive system is a potential RL environment. Every CLI, every TUI, every GUI, every API. ...

yorwba • today at 12:23 PM • 1 reply • view on HN

Every interactive system is a potential RL environment. Every CLI, every TUI, every GUI, every API. If you can programmatically take actions to get a result, and the actions are cheap, and the quality of the result can be measured automatically, you can set up an RL training loop and see whether the results get better over time.

Replies

radarsat1 • today at 3:43 PM

> and the quality of the result can be measured automatically

this part is nontrivial though

alt Hacker News

Replies