logoalt Hacker News

theanonymousonetoday at 2:39 PM1 replyview on HN

Isn't caching a server-side thing? How does the agent affect it, significantly at least?


Replies

embedding-shapetoday at 2:52 PM

Say you put the current time down to the second in the system prompt, which is the message that goes in front of the entire conversation, then basically nothing will be cached, every agent turn needs to ingest the entire session over and over. Contrast to not doing that, and the backend can leverage caching all the way up to the latest message, as nothing until then changed.

show 2 replies