logoalt Hacker News

kridsdale101/21/20250 repliesview on HN

That would be a true Mixture of Experts.

I sometimes put the 4 biggest models like this to converge on an optimal solution