logoalt Hacker News

abhijatlast Friday at 10:57 AM1 replyview on HN

Isn't the context window the same for all plans, 200k? You would hit usage limits?


Replies

billyjoboblast Friday at 3:24 PM

If you send the full 200k tokens on every request you will get very few requests before you hit the token limit. Caching reduces the number sent but I don't know how much they can cache?