Curious about how much latency this adds (per input token)? Obviously depends on your computer, but it's it ~10s or ~1s?
Also, how does this deal with inquiries when piece of PII is important to the task itself? I assume you just have to turn it off?