Isn't the context window the same for all plans, 200k? You would hit usage limits?

abhijat • last Friday at 10:57 AM • 1 reply • view on HN

Replies

If you send the full 200k tokens on every request you will get very few requests before you hit the token limit. Caching reduces the number sent but I don't know how much they can cache?

alt Hacker News

Replies