Or maybe models that are much more task-focused? Like models that are trained on just math & coding?
isn't that what the mixture of experts trick that all the big players do is? Bunch of smaller, tightly focused models
isn't that what the mixture of experts trick that all the big players do is? Bunch of smaller, tightly focused models