$1.5/m input tokens $9/m output tokens 6x the price of 3.1 flash lite

asar • yesterday at 6:04 PM • 5 replies • view on HN

$1.5/m input tokens $9/m output tokens

6x the price of 3.1 flash lite

Replies

"Flash-Lite" is a different product from "Flash", which is more expensive. They couldn't be more confusing with their naming though, especially since they have 3.1 Pro and not 3.1 Flash non-lite.

WarmWash • yesterday at 6:40 PM

I haven't used 3.5 at all yet, but previous Gemini (and Gemma models) are by far the most token light per task than any other model.

Cost per task is a more productive measure, but obviously a more difficult one to benchmark.

iwhalen • yesterday at 6:09 PM

I wonder why they didn't discuss price in the post?

Compare to the GPT-5.5 announcement: https://openai.com/index/introducing-gpt-5-5/

himata4113 • yesterday at 6:07 PM

I don't think input/output pricing matters, 90% of the cost is cache. $0.15 is pretty good, but still very expensive.

➕ show 4 replies

John7878781 • yesterday at 6:07 PM

[deleted]

➕ show 1 reply

alt Hacker News

Replies