Isn't the context window the same for all plans, 200k? You would hit usage limits?
If you send the full 200k tokens on every request you will get very few requests before you hit the token limit. Caching reduces the number sent but I don't know how much they can cache?
If you send the full 200k tokens on every request you will get very few requests before you hit the token limit. Caching reduces the number sent but I don't know how much they can cache?