It’s crazy that people don’t understand cached tokens despite them being priced separately on the cost pages of every single provider.
> It’s crazy that people don’t understand cached tokens despite them being priced separately on the cost pages of every single provider.
Depends on your subscription type. Some are just a flat monthly fee.
Its crazy that people think caching is such a silver bullet, despite the cost of long context windows still being ridiculously high even with caching. https://blog.exe.dev/expensively-quadratic https://news.ycombinator.com/item?id=47000034