Seeing a task-specific model be consistently better at anything is extremely surprising given...

jjmarr • last Saturday at 7:07 AM • 1 reply • view on HN

Seeing a task-specific model be consistently better at anything is extremely surprising given rapid innovation in foundation models.

Have you tried Aristotle on other, non-Lean tasks? Is it better at logical reasoning in general?

runeblaze • yesterday at 2:09 AM

Is it though? There is a reason gpt has codex variants. RL on a specific task raises the performance on that task

➕ show 1 reply

alt Hacker News