They're not blenders. This is clear from the fact that you can distill the logic ability from...

literalAardvark • last Wednesday at 1:26 PM • 2 replies • view on HN

They're not blenders.

This is clear from the fact that you can distill the logic ability from a 700b parameter model into a 14b model and maintain almost all of it.

You just lose knowledge, which can be provided externally, and which is the actual "pirated" part.

The logic is _learned_

encyclopedism • last Wednesday at 3:17 PM

It hasn't learned any LOGIC. It has 'learned' patterns from the input.

➕ show 1 reply

bayindirh • last Wednesday at 1:31 PM

Are there any recent publications about it so I can refresh myself on the matter?

➕ show 1 reply

alt Hacker News