$1.5/m input tokens $9/m output tokens
6x the price of 3.1 flash lite
I haven't used 3.5 at all yet, but previous Gemini (and Gemma models) are by far the most token light per task than any other model.
Cost per task is a more productive measure, but obviously a more difficult one to benchmark.
I wonder why they didn't discuss price in the post?
Compare to the GPT-5.5 announcement: https://openai.com/index/introducing-gpt-5-5/
I don't think input/output pricing matters, 90% of the cost is cache. $0.15 is pretty good, but still very expensive.
"Flash-Lite" is a different product from "Flash", which is more expensive. They couldn't be more confusing with their naming though, especially since they have 3.1 Pro and not 3.1 Flash non-lite.