logoalt Hacker News

ionwakeyesterday at 6:59 PM1 replyview on HN

Can you explain what you mean by its bad at agentic stuff?


Replies

karmasimidayesterday at 7:12 PM

Accomplish the task I give to it without fighting me with it.

I think this is classic precision/recall issue: the model needs to stay on task, but also infer what user might want but not explicitly stated. Gemini seems particularly bad that recall, where it goes out of bounds

show 1 reply