No, it was designed on paper by someone with no understanding of prompt caching and no consideration of latency or token costs
You mean Google doesn't understand prompt caching, latency or token costs?
Or Google teams fail to communicate for such things?
You mean Google doesn't understand prompt caching, latency or token costs?
Or Google teams fail to communicate for such things?