logoalt Hacker News

jeyyesterday at 7:54 PM4 repliesview on HN

This seems to be incorporated into current LLM generations already -- when code execution is enabled both GPT-5.x and Claude 4.x automatically seem to execute Python code to help with reasoning steps.


Replies

fsfodyesterday at 9:28 PM

I remember seeing that GPT-5 had two python tools defined in its leaked prompt one them would hide the output from user visible chain of thought UI.

Esophagus4yesterday at 9:14 PM

Same with CoT prompting.

If you compare the outputs of a CoT input vs a control input, the outputs will have the reasoning step either way for the current generation of models.

logicprogyesterday at 8:11 PM

Yeah, this is honestly one of the coolest developments of new models.