Honestly the biggest problem was/is copyright law. Make everything older than 10-14 years public domain and streaming services would have endless amounts of content always available. Independently operated streaming sites would be all over the internet.
That would also solve the problem of AI training data. Build a data set, wait 14 years, and it's guaranteed to be legal.