Did you try setting thinkingLevel to minimal?
thinkingConfig: { thinkingLevel: "low", }
More about it here https://ai.google.dev/gemini-api/docs/gemini-3#new_api_featu...
Yes I tried it with minimal and it's roughly 3 seconds for prompts that take flash 2.5 1 second.
On that note it would be nice to get these benchmark numbers based on the different reasoning settings.
Yes I tried it with minimal and it's roughly 3 seconds for prompts that take flash 2.5 1 second.
On that note it would be nice to get these benchmark numbers based on the different reasoning settings.