> If Claude gives me poor or incorrect advice while I’m working on an AI component, I have no way of knowing whether the model was confused, whether my problem is unsolvable, or if some invisible policy restriction quietly kicked in.
Yeah I think there are ways to know, ways involving less dependence on a LLM.
> Yeah I think there are ways to know, ways involving less dependence on a LLM.
This kills the entire value prop of using LLMs as research accelerators, though.