And that is how it should be - the knowledge that the LLM trained on should be free, and cannot (and should never be) gatekept behind money.
It's merely the hardware that should be charged for - which ought to drop in price if/when the demand for it rises. However, this is a bottleneck at the moment, and hard to see how it gets resolved amidst the current US environment on sanctioning anyone who would try.
No, a lot of the data they were trained on was pirated.
Is there no value in how the training was done such that it's accessible via inference in a particularly useful way?