logoalt Hacker News

jwrallietoday at 10:37 AM1 replyview on HN

I think it’s good to play with smaller models to have a grasp of these kind of problems, since they happen more often and are much less subtle.


Replies

ehntotoday at 12:39 PM

Totally agree, these kinds of problems are really common in smaller models, and you build an intuition for when they're likely to happen.

The same issues are still happening in frontier models. Especially in long contexts or in the edges of the models training data.