1&2 are already happening, these startups take on brand liability and trust to do so
3 depends on how companies want to measure it, but lack of user submitting satisfaction score is not a good thing
you can use a model w/o reasoning, + use various tricks to simulate low latency