logoalt Hacker News

imenanilast Sunday at 8:17 PM2 repliesview on HN

As far as I can tell they don’t say which LLM they used which is kind of a shame as there is a huge range of capabilities even in newly released LLMs (e.g. reasoning vs not).


Replies

yosefklast Sunday at 8:22 PM

ChatGPT, Claude, Grok and Google AI Overviews, whatever powers the latter, were all used in one or more of these examples, in various configurations. I think they can perform differently, and I often try more than one when the 1st try doesn't work great. I don't think there's any fundamental difference in the principle of their operation, and I think there never will be - there will be another major breakthrough

show 3 replies
lowsonglast Sunday at 8:30 PM

It doesn't matter. These limitations are fundamental to LLMs, so all of them that will ever be made suffer from these problems.