We're using small language models to detect prompt injection. Not too cool, but at least we can...

eb0la • 01/21/2025 • 1 reply • view on HN

We're using small language models to detect prompt injection. Not too cool, but at least we can publish some AI-related stuff on the internet without a huge bill.

Replies

sitkack • 01/21/2025

What kind of prompt injection attacks do you filter out? Have you tested with a prompt tuning framework?

alt Hacker News

Replies