Devils advocate here - pro and max tier customers for all the major inference providers are loss leaders from the data we have been able to figure out, and reverse engineer. They are effectively a marketing exercise.
The real profitability is selling tokens to enterprise, and enterprise demand is growing so fast that they are short on the total amount of tokens they can generate per minute, and are prioritising rationally - enterprise gets a better experience - instead of optimizing for their lowest paying (and most loss leading) customers.
We are in a hardware crunch right now but that won't be forever, and eventually (likely 2028) we will get experiences like we got in January from pro-sumer accounts again.