Use local OSS models then? They aren’t as good and you need beefy hardware (either Apple silicon or nvidia GPUs). But they are totally workable, and you avoid your dislikes directly.
"Not as good and costs a lot in hardware" still sounds like I'm at a disadvantage.
"Not as good and costs a lot in hardware" still sounds like I'm at a disadvantage.