logoalt Hacker News

pants201/20/20251 replyview on HN

Amazing progress by open-source. However, the 64K input tokens and especially the 8K output token limit can be frustrating vs o1's 200K / 100K limit. Still, at 1/30th the API cost this is huge.


Replies

dtquad01/20/2025

I don't know why people are ignoring this and posting hyperbolic statements like "it's all over for OpenAI and Google".

One of the cheaper Gemini models is actually only 8B and a perfect candidate for a release as a FOSS Gemma model but the Gemini 8B model contains hints of the tricks they used to achieve long context so as business strategy they haven't released it as Gemma FOSS model yet.

show 2 replies