logoalt Hacker News

mycalltoday at 12:08 PM4 repliesview on HN

The problem with regex is multi-language support and how big the regex will bloat if you to support even 10 languages.


Replies

doublesockettoday at 12:51 PM

Supporting 10 different languages in regex is a drop in the ocean. The regex can be generated programmatically and you can compress regexes easily. We used to have a compressed regex that could match any placename or street name in the UK in a few MB of RAM. It was silly quick.

show 2 replies
TeMPOraLtoday at 12:19 PM

We're talking about Claude Code. If you're coding and not writing or thinking in English, the agents and people reading that code will have bigger problems than a regexp missing a swear word :).

show 3 replies
crimsonnoodle58today at 1:06 PM

They only need to look at one language to get a statistically meaningful picture into common flaws with their model(s) or application.

If they want to drill down to flaws that only affect a particular language, then they could add a regex for that as well/instead.

b112today at 12:15 PM

Did you just complain about bloat, in anything using npm?