>People still talk about fine tuning dedicated models being effective
>it's still always better to use a larger generalist model than a smaller fine tuned one
Smaller fine-tuned models are still a good fit if they need to run on-premises cheaply and are already good enough. Isn't it their main use case?
Latency and size. Otherwise pretty much useless.