logoalt Hacker News

macleginnyesterday at 8:28 PM1 replyview on HN

Not the worst way to make money, but if internet-scale data were not enough to reduce errors to a somewhat tolerable margin, how much data do they hope to collect in this manner?


Replies

jmalickiyesterday at 8:45 PM

Right now, this is a 10-figure run rate industry.

They are generating a lot of this. Also remember it's not just quantity, it's roughly active learning - they're paying for training data that's at the classification boundary, which is way more valuable.

I have gotten offers for contracts for full time jobs at high rates with AI labs to do this.

Meta has reallocated a lot of their full time SWE staff to do this.

All of this has rapidly accelerated within the last 6 months, who knows far it will go, if someone showed me a Kalshi bet that 10% of the college educated population of the US would be doing this as their primary job by the end of 2027, I wouldn't have the guts to bet against it.

10% of physicians' earnings doing this? Yeah that would totally track.

It doesn't seem like there's a limit. There's a shortage of GPUs and TSMC can only scale up so fast, so the AI labs found something else to spend money on.

show 3 replies