logoalt Hacker News

2001zhaozhaoyesterday at 10:36 PM10 repliesview on HN

AKA, if a malicious skill got into your AI agent, you're cooked.

I think this isn't surprising, nor do I think it should be considered a prompt injection at all. An AI skill is akin to a plugin for traditional software - if you install a malicious IDE extension or Outlook plugin, the attacker can also do whatever they want to the PC and exfiltrate whatever data they want to. So this article is a big nothingburger.


Replies

mdavidnyesterday at 11:00 PM

If this can be exploited via a skill, then it can be exploited via untrusted input inserted into context. Does Cowork help with reading email?

show 1 reply
0gsyesterday at 10:47 PM

i think people are probably already doing it. i made a skill scanner but it's also just easy to download a zip and inspect the contents... but people are loading these things remotely. i agree that it is easy to not install a pentester's magic skill, but the attack capabilities a skill can have are pretty insane. people should just make their own is my pov.

show 1 reply
bberenbergyesterday at 11:05 PM

Only if it has access to exfiltrate data. We deny by default and the company has to allowlist each individual destination.

show 1 reply
nicoyesterday at 10:40 PM

I wonder if via-skill could become a software distribution channel. A bit like what has happened with LLM wiki

aabhayyesterday at 10:40 PM

Its actually even worse — its advertising for their product

Yokohiiitoday at 12:21 AM

> nor do I think it should be considered a prompt injection at all

Can we stop the apologetic framing? It's increasingly common to create exploits from multiple vulnerabilities. Each one is bad. Downloading corporate malware is stupid. Adding random prompt injection is reckless. Insane to run autonomous agents on top of it.

Prompt injection is more serious in this regard, because there is no known solid protection. All the other problems are failure in process, prompt injection is failure at the first thought.

SpicyLemonZestyesterday at 10:53 PM

Unlike plugins in traditional software, skills do not represent a carveout from any security boundary nor run with elevated trust. They're just selectively loaded context. Anything you can convince an agent to do with a skill you can convince it to do without one.

show 1 reply
ares623yesterday at 11:14 PM

Thankfully inserting malicious skills is not something that can easily be done, you need to a lot of things wrong and the attacker to do a lot of things right in order for it to be exploited.

show 1 reply
cyanydeezyesterday at 10:43 PM

ai skill is not just a plugin. given the right model, supposedly, it can do much more. since everyones harness tends to be tied to the model, it has a whole tool set to use.

Jabrovyesterday at 10:39 PM

It's yet another surface for dependency attacks