About a year ago OpenAI crawled and go DDOS level the company I work. Even despite the robots.txt not allowing it, and despite some recaptcha we could assemble in time.
We found our data in the outputs of their models but who can do anything about it...
Why hasn't your company sued OpenAI and try to argue they're violating the computer abuse and fraud act? Would it really be impossible to argue this?
Unauthorized access, system damage, and maybe even extortion all apply here.
Lawyers can. As long as that data is actually yours I mean, in a strictly legal sense.
I mean, did you check the IPs and make sure they’re from OpenAI? Obviously a fly-by-night AI company is going to set their User Agent to be from a big player.
> We found our data in the outputs of their models but who can do anything about it...
If the crawlers refuse to voluntarily respect your robots.txt, then you are well within your rights to poison their data.