logoalt Hacker News

bayindirhlast Wednesday at 12:44 PM5 repliesview on HN

Well, the first 90% is easy, the hard part is the second 90%.

Case in point: Self driving cars.

Also, consider that we need to pirate the whole internet to be able to do this, so these models are not creative. They are just directed blenders.


Replies

throwthrowuknowlast Wednesday at 1:14 PM

Even if Opus 4.5 is the limit it’s still a massively useful tool. I don’t believe it’s the limit though for the simple fact that a lot could be done by creating more specialized models for each subdomain i.e. they’ve focused mostly on web based development but could do the same for any other paradigm.

show 2 replies
literalAardvarklast Wednesday at 1:26 PM

They're not blenders.

This is clear from the fact that you can distill the logic ability from a 700b parameter model into a 14b model and maintain almost all of it.

You just lose knowledge, which can be provided externally, and which is the actual "pirated" part.

The logic is _learned_

show 2 replies
mcfedrlast Wednesday at 2:22 PM

i like to think of LLMs as random number generators with a filter

rat9988last Wednesday at 12:53 PM

> Well, the first 90% is easy, the hard part is the second 90%.

You'd need to prove that this assertion applies here. I understand that you can't deduce the future gains rate from the past, but you also can't state this as universal truth.

show 3 replies