> control over the model used
but you lose access to the most capable models, you can run only the small ones
And they run slower and quantized.
And they run slower and quantized.