Are there platforms that make such training more streamlined? Say I have some definition of success ...

smattiso • 05/14/2025 • 1 reply • view on HN

Are there platforms that make such training more streamlined? Say I have some definition of success for a given problem and it’s data how do I go about generating said RL model as fast and easily as possible?

Replies

vrm • 05/14/2025

We're working on an OSS industrial-grade version of this at TensorZero but there's a long way to go. I think the easiest out of the box solution today is probably OpenAI RFT but that's a partial solve with substantial vendor lock-in.

alt Hacker News

Replies