logoalt Hacker News

literalAardvarklast Wednesday at 1:26 PM2 repliesview on HN

They're not blenders.

This is clear from the fact that you can distill the logic ability from a 700b parameter model into a 14b model and maintain almost all of it.

You just lose knowledge, which can be provided externally, and which is the actual "pirated" part.

The logic is _learned_


Replies

encyclopedismlast Wednesday at 3:17 PM

It hasn't learned any LOGIC. It has 'learned' patterns from the input.

show 1 reply
bayindirhlast Wednesday at 1:31 PM

Are there any recent publications about it so I can refresh myself on the matter?

show 1 reply