logoalt Hacker News

zardoyesterday at 12:50 PM1 replyview on HN

I'm wondering how much the output quality of a small model could be boosted by taking multiple goes at it. Generate 20 answers and feed them back through with a "rank these responses" prompt. Or doing something like MCTS.


Replies

freakynityesterday at 1:26 PM

Isn't this what thinking models do internally? Chain of thoughts?

show 1 reply