On top of it, on-device models increase response times and can be really private if the developer decides.