logoalt Hacker News

FuckButtons01/21/20250 repliesview on HN

I tried llama70b too with the same task, the reasoning seemed more coherent, but it still wound up coming to very invalid conclusions using that reasoning and the output was even further from correct than qwen.