logoalt Hacker News

BoppreHtoday at 11:10 AM2 repliesview on HN

It's fast, but it'll miss a ton of cases. This feels like it would be better served by a prompt instruction, or an additional tiny neural network.

And some of the entries are too short and will create false positives. It'll match the word "offset" ("ffs"), for example. EDIT: no it won't, I missed the \b. Still sounds weird to me.


Replies

hk__2today at 11:16 AM

It’s fast and it matches 80% of the cases. There’s no point in overengineering it.

show 1 reply
vharucktoday at 11:18 AM

The pattern only matches if both ends are word boundaries. So "diffs" won't match, but "Oh, ffs!" will. It's also why they had to use the pattern "shit(ty|tiest)" instead of just "shit".

show 1 reply