20 tok/s is an average. It can be more, it can be less. If you are running off-peak I'm sure you'd get some crazy number.
That doesn’t matter when you have the average. Even if you are somehow able to get 10000tok/s during off peak times, by virtue of how averages work, you’re still only getting 52M tokens per month (as calculated above).
Why wouldn't developers just do llm arbitrage against openrouter if it is a better deal?
That doesn’t matter when you have the average. Even if you are somehow able to get 10000tok/s during off peak times, by virtue of how averages work, you’re still only getting 52M tokens per month (as calculated above).