logoalt Hacker News

BarryMiloyesterday at 1:03 PM1 replyview on HN

I seem to remember that's one of the first things they tried, but the general models tended to win out. Turns out there's more to learn from all code/discussions than from just JS.


Replies

justinliviyesterday at 9:16 PM

From my own empirical research, the generalized models acting as specialists outperform both the tiny models acting as specialists and the generalist models acting as generalists. It seems that if peak performance is what you're after, then having a broad model act as several specialized models is the most impactful.