logoalt Hacker News

datherytoday at 2:16 AM1 replyview on HN

That's correct. Input caching helps, but even then at e.g. 800k tokens with all of them cached, the API price is $0.50 * 0.8 = $0.40 per request, which adds up really fast. A "request" can be e.g. a single tool call response, so you can easily end up making many $0.40 requests per minute.


Replies

acjohnson55today at 3:23 AM

Interesting, so a prompt that causes a couple dozen tool calls will end up costing in the tens of dollars?

show 1 reply