logoalt Hacker News

bigiaintoday at 6:12 AM5 repliesview on HN

And next month you'll need to add on "Claude Database Pro" or you'll just get a working (for demo purposes with dozens of db rows) but completely un indexed database schema and a refusal to optimise SQL requests.

And the month after you'll need "Claude DataScience Pro" to get any Python Pandas or NumPy code generated.

And and and...


Replies

ben_wtoday at 8:45 AM

While this is a perfectly reasonable thing to expect when the models are competent enough, half the conversation on places like Hacker News are about all the times an LLM has produced garbage that was harmful to a business either by hallucinations, by deleting something critical during the work, or by hitting some endpoint way too often and denial-of-servicing it.

Right now, the software guardrails in LLMs are useful for the same kinds of reasons factories have hardware guardrails: to reduce the rate at which errors become "incidents".

Just because they sometimes delete the production database rather than sometimes spilling a thousand tons of incandescent molten metal over a factory floor, doesn't mean LLMs are safe enough to be used the way they're actually being used.

https://simonwillison.net/2025/Dec/10/normalization-of-devia...

animuchantoday at 7:55 AM

This is why I'm thankful for Chinese LLM research. They'll keep us honest.

bandramitoday at 10:06 AM

Same thing with the weird push towards humanoid robots.

"They can do anything!"

Sure, once you subscribe to the $15/mo laundry package, the $25/mo lawn care package (with the $10/mo hedge trimmer upgrade), and the $10/mo dog-walking package.

show 1 reply
patatestoday at 7:06 AM

Isn't this inline with trying to leave no money on the table?

I'd hate it, sure, but it wouldn't surprise me.

goosejuicetoday at 9:02 AM

This is an incredibly unlikely scenario