> However, I think the battle is already lost - the nature of copyright and open source code philosophy (currently) means that there isn't any real way of preventing your code being used to train AI.
Laws should make it a double-edged sword, make distillation explicitly legal.
Not much else they can do.
> Laws should make it a double-edged sword, make distillation explicitly legal.
Knowledge-distillation is already legal. Current case law says none of outputs of any model are protected by copyright, so one could use model outputs for whatever they want - including distillation. That is why AI companies resort to ToS clauses to block distillation and/or training competing models.