If an AI can fabricate a bunch of purported quotes due to being unable to access a page, why not ass...

zozbot234 • today at 1:37 AM • 3 replies • view on HN

If an AI can fabricate a bunch of purported quotes due to being unable to access a page, why not assume that the exact same sort of AI can also accidentally misattribute hostile motivation or intent (such as gatekeeping or envy - and let's not pretend that butthurt humans don't do this all the time, see https://en.wikipedia.org/wiki/fundamental_attribution_error ) for an action such as rejecting a pull request? Why are we treating the former as a mere mistake, and the latter as a deliberate attack?

Replies

zahlman • today at 2:00 AM

> Why are we treating the former as a mere mistake, and the latter as a deliberate attack?

"Deliberate" is a red herring. That would require AI to have volition, which I consider impossible, but is also entirely beside the point. We also aren't treating the fabricated quotes as a "mere mistake". It's obviously quite serious that a computer system would respond this way and a human-in-the-loop would take it at face value. Someone is supposed to have accountability in all of this.

➕ show 1 reply

em-bee • today at 1:54 AM

when it comes to AI, is there even a difference? it's an attack either way

trollbridge • today at 1:39 AM

This would be an interesting case of semantic leakage, if that’s what’s going on.

alt Hacker News

Replies