AKA, if a malicious skill got into your AI agent, you're cooked. I think this isn't surp...

2001zhaozhao • yesterday at 10:36 PM • 10 replies • view on HN

AKA, if a malicious skill got into your AI agent, you're cooked.

I think this isn't surprising, nor do I think it should be considered a prompt injection at all. An AI skill is akin to a plugin for traditional software - if you install a malicious IDE extension or Outlook plugin, the attacker can also do whatever they want to the PC and exfiltrate whatever data they want to. So this article is a big nothingburger.

Replies

mdavidn • yesterday at 11:00 PM

If this can be exploited via a skill, then it can be exploited via untrusted input inserted into context. Does Cowork help with reading email?

➕ show 1 reply

0gs • yesterday at 10:47 PM

i think people are probably already doing it. i made a skill scanner but it's also just easy to download a zip and inspect the contents... but people are loading these things remotely. i agree that it is easy to not install a pentester's magic skill, but the attack capabilities a skill can have are pretty insane. people should just make their own is my pov.

➕ show 1 reply

bberenberg • yesterday at 11:05 PM

Only if it has access to exfiltrate data. We deny by default and the company has to allowlist each individual destination.

➕ show 1 reply

nico • yesterday at 10:40 PM

I wonder if via-skill could become a software distribution channel. A bit like what has happened with LLM wiki

aabhay • yesterday at 10:40 PM

Its actually even worse — its advertising for their product

Yokohiii • today at 12:21 AM

> nor do I think it should be considered a prompt injection at all

Can we stop the apologetic framing? It's increasingly common to create exploits from multiple vulnerabilities. Each one is bad. Downloading corporate malware is stupid. Adding random prompt injection is reckless. Insane to run autonomous agents on top of it.

Prompt injection is more serious in this regard, because there is no known solid protection. All the other problems are failure in process, prompt injection is failure at the first thought.

SpicyLemonZest • yesterday at 10:53 PM

Unlike plugins in traditional software, skills do not represent a carveout from any security boundary nor run with elevated trust. They're just selectively loaded context. Anything you can convince an agent to do with a skill you can convince it to do without one.

➕ show 1 reply

ares623 • yesterday at 11:14 PM

Thankfully inserting malicious skills is not something that can easily be done, you need to a lot of things wrong and the attacker to do a lot of things right in order for it to be exploited.

➕ show 1 reply

cyanydeez • yesterday at 10:43 PM

ai skill is not just a plugin. given the right model, supposedly, it can do much more. since everyones harness tends to be tied to the model, it has a whole tool set to use.

Jabrov • yesterday at 10:39 PM

It's yet another surface for dependency attacks

alt Hacker News

Replies