Titans: Learning to Memorize at Test Time

115 points • by birriel • 01/13/2025 • 15 comments • view on HN

Comments

gwern • 01/17/2025

Duplicate: https://news.ycombinator.com/item?id=42718166

➕ show 2 replies

cs702 • 01/13/2025

Interesting. I like the idea of a meta-mechanism that learns to update an associative memory based on how surprising the data is. The other stuff, reading memory via keys and values and selectively erasing it with gating, look pretty conventional on a first glance. Thank you for sharing this on HN. I've added it to my reading list.

EDIT: I'm reminded of this other type of associative memory: https://github.com/glassroom/heinsen_routing. The idea there is to compute a mixture of memories that best predicts the given input sequence. Quite frankly, I don't remember how the whole thing works, but I do remember that it works. It's been a while since I used it, so YMMV. In any case, it may be of interest to you.

➕ show 1 reply

alt Hacker News

Titans: Learning to Memorize at Test Time

Comments