You can still make money on open weight models.
The compute required to run these models is still very far out of reach for the average consumer, yet known enthusiast, therefore they still sell inference, whilst also getting consumer goodwill for providing open weights.
And the efficiency! Big accelerator cards are ~100x the throughput per watt in terms of raw processing power.