Well it's cool that they released a paper, but at this point it's been 11 months and you c...

mapmeld • last Sunday at 3:02 PM • 7 replies • view on HN

Well it's cool that they released a paper, but at this point it's been 11 months and you can't download a Titans-architecture model code or weights anywhere. That would put a lot of companies up ahead of them (Meta's Llama, Qwen, DeepSeek). Closest you can get is an unofficial implementation of the paper https://github.com/lucidrains/titans-pytorch

Replies

alyxya • last Sunday at 4:34 PM

The hardest part about making a new architecture is that even if it is just better than transformers in every way, it’s very difficult to both prove a significant improvement at scale and gain traction. Until google puts in a lot of resources into training a scaled up version of this architecture, I believe there’s plenty of low hanging fruit with improving existing architectures such that it’ll always take the back seat.

➕ show 5 replies

root_axis • last Sunday at 7:07 PM

I don't think the comparison is valid. Releasing code and weights for an architecture that is widely known is a lot different than releasing research about an architecture that could mitigate fundamental problems that are common to all LLM products.

innagadadavida • last Sunday at 8:49 PM

Just keep in mind it is performance review time for all the tech companies. Their promotion of these seems to be directly correlated with that event.

SilverSlash • last Monday at 12:29 AM

The newer one is from late May: https://arxiv.org/abs/2505.23735

mupuff1234 • last Monday at 4:39 AM

> it's been 11 months

Is that supposed to be a long time? Seems fair that companies don't rush to open up their models.

informal007 • last Sunday at 3:25 PM

I don't think model code is a big deal compared to the idea. If public can recognize the value of idea 11 months ago, they could implement the code quickly because there are so much smart engineers in AI field.

➕ show 2 replies

AugSun • last Monday at 1:27 AM

Gemini 3 _is_ that architecture.

➕ show 1 reply

alt Hacker News

Replies