Deepseek is well known to have ripped off OpenAI APIs extensively in post training, embarrassingly so that it sometimes calls itself “As a model made by OpenAI”.
At least don’t use the hosted version unless you want your data to go to China
Just like OAI and copyrighted content. And I would rather my data go to China than the US, personally.
Why do you care how they trained the model? If OAI can train on copyrighted material, then morally, I see no problem with others training on their outputs too.
For what it's worth, even XAI's chatbot referred to itself as being trained by OAI, simply due to the amount of ChatGPT content available on the web.