logoalt Hacker News

godelskitoday at 7:48 AM0 repliesview on HN

Sure, but they also update the models, especially when things like this go viral. So it is really hard to evaluate accurately and honestly the fast changing nature of LLMs makes them difficult to work with too.