logoalt Hacker News

sumitkumartoday at 8:04 AM1 replyview on HN

Yes this is more like compression to remember and not for learning/understanding.


Replies

Lplololopotoday at 8:38 AM

Compression is the reason why these Models are able to learn and understand.

My brain is doing the exact same thing.

I learned enough to compress concepts like a bike and what a bike does and for what i can use a bike.

Ask a LLM and it will answer you similiar to humans.

Blind people learn concepts of bikes too and in a smiliar way: by description.

LLMs just have so much data in form of text available and are able to ingest all of this, that the LLM compression algorithm doesn't has to be that good/finetuned than ours.

But I would assume that Yann LeCun's JEPA or other breakthroughs in the next few years will get us there.

show 1 reply