logoalt Hacker News

s3pyesterday at 6:17 PM4 repliesview on HN

Don't get me started on the thinking tokens. Since 2.5P the thinking has been insane. "I'm diving in to the problem", "I'm fully immersed" or "I'm meticulously crafting the answer"


Replies

ceroxylonyesterday at 11:26 PM

I once saw "now that I've slept on it" in Gemini's CoT... baffling.

raducuyesterday at 10:31 PM

> Don't get me started on the thinking tokens.

Claude provides nicer explanations, but when it comes to CoT tokens or just prompting the LLM to explain -- I'm very skeptical of the truthfulness of it.

Not because the LLM lies, but because humans do that also -- when asked how the figured something, they'll provide a reasonable sounding chain of thought, but it's not how they figured it out.

fozyesterday at 7:14 PM

This is part of the reason I don't like to use it. I feel it's hiding things from me, compared to other models that very clearly share what they are thinking.

show 1 reply
dist-epochyesterday at 7:04 PM

That's not the real thinking, it's a super summarized view of it.