I think it's hard to take any LLM criticism seriously if they don't even specify which mod...

marcellus23 • last Tuesday at 9:02 PM • 2 replies • view on HN

I think it's hard to take any LLM criticism seriously if they don't even specify which model they used. Saying "an LLM model" is totally useless for deriving any kind of conclusion.

Replies

ehnto • last Wednesday at 3:57 AM

When talking about the capabilities of a class of tools long term, it makes sense to be general. I think deriving conclusions at all is pretty difficult given how fast everything is moving, but there is some realities we do actually know about how LLMs work and we can talk about that.

Knowing that ChatGPT output good tokens last tuesday but Sonnet didn't does not help us know much about the future of the tools on general.

➕ show 1 reply

p1esk • last Tuesday at 9:13 PM

Yes, I’d be curious about his experience with GPT-5 Thinking model. So far I haven’t seen any blunders from it.

➕ show 1 reply

alt Hacker News

Replies