This does surprise me, because you'd think that even if they crank up the filter's sensiti...

mhl47 • yesterday at 6:39 PM • 2 replies • view on HN

This does surprise me, because you'd think that even if they crank up the filter's sensitivity at the expense of specificity, an LLM company wouldn't simply design a filter that triggers on keywords in a completely unrelated context.

Replies

orbital-decay • today at 8:10 AM

Smart classifiers are slow and susceptible to jailbreaking themselves, dumb classifiers are fast but dumb so they need to be either overzealous or useless. Same story as with Gemini's guardrails.

alt Hacker News

Replies