logoalt Hacker News

nee1rlast Tuesday at 5:46 PM1 replyview on HN

real


Replies

sheepscreekyesterday at 11:48 PM

Are you guys affiliated with Meta’s ex-CTO in any way? I remember he famously implied that LLMs hyped. The demos are very impressive. Does this use an attention based mechanism too? Just trying to understand (as a layman) how these models handle context and if long contexts lead to weaker results. Could be catastrophic in the real world!

show 1 reply