When it comes to LLM you really cannot draw conclusions from first principles like this. Yes, it sou...

raincole • today at 10:43 AM • 1 reply • view on HN

When it comes to LLM you really cannot draw conclusions from first principles like this. Yes, it sounds reasonable. And things in reality aren't always reasonable.

Benchmark or nothing.

Replies

samus • today at 11:01 AM

There have been papers about introducing thinking tokens in intermediary layers that get stripped from the output.

alt Hacker News

Replies