why not an inbetween scenario like using a managed inference provider to host your own models?
what would be the advantage?
what would be the advantage?