As far as I can tell they don’t say which LLM they used which is kind of a shame as there is a huge ...

imenani • last Sunday at 8:17 PM • 2 replies • view on HN

As far as I can tell they don’t say which LLM they used which is kind of a shame as there is a huge range of capabilities even in newly released LLMs (e.g. reasoning vs not).

Replies

yosefk • last Sunday at 8:22 PM

ChatGPT, Claude, Grok and Google AI Overviews, whatever powers the latter, were all used in one or more of these examples, in various configurations. I think they can perform differently, and I often try more than one when the 1st try doesn't work great. I don't think there's any fundamental difference in the principle of their operation, and I think there never will be - there will be another major breakthrough

➕ show 3 replies

lowsong • last Sunday at 8:30 PM

It doesn't matter. These limitations are fundamental to LLMs, so all of them that will ever be made suffer from these problems.

alt Hacker News

Replies