Data sharing agreements permitting, today's inference runs can be tomorrow's training data...

lxgr • today at 2:35 PM • 3 replies • view on HN

Data sharing agreements permitting, today's inference runs can be tomorrow's training data. Presumably the models are good enough at labeling promising chains of thought already.

I could totally imagine "free" inference for researchers under the condition that the reasoning traces get to be used as future training data.

Replies

nhecker • today at 8:32 PM

The site arena.ai does exactly this already, as far as I can tell. (In addition to the whole ranking thing.)

mccoyb • today at 2:48 PM

Agreed, there's no doubt this will happen. It's likely already happening (it feels safe to assume that Anthropic is curating data from the data they record from Claude Code?)

As far as I understand RL scaling (we've already maxxed out RLVR), these machines only get better as long as they have expert reasoner traces available.

Having an expert work with an LLM and successfully solve a problem is high signal data, it may be the only path forward?

My prior is that these companies will take this data without asking you as much as they can.

➕ show 1 reply

the_af • today at 4:38 PM

> Data sharing agreements permitting, today's inference runs can be tomorrow's training data. Presumably the models are good enough at labeling promising chains of thought already.

Wouldn't this lead to model collapse?

➕ show 1 reply

alt Hacker News

Replies