logoalt Hacker News

alasanotoday at 12:54 PM0 repliesview on HN

My favorite Google LLM benchmark is asking Gemini models to create a script that fetches API usage (just request counts) for a project from GCP.

100% failure rate.