logoalt Hacker News

Context Is Software, Weights Are Hardware

17 pointsby maxaravindlast Sunday at 12:29 PM5 commentsview on HN

Comments

maxaravindlast Sunday at 12:30 PM

Author here.

I spent the last weekend thinking about continual learning. A lot of people think that we can solve long term memory and learning in LLMs by simply extending the context length to infinity. I analyse a different perspective that challenges this assumption.

Let me know how you think about this.

show 2 replies