logoalt Hacker News

in-silicotoday at 4:22 AM0 repliesview on HN

> nobody has tried to generalize it for example by combining the recurrence concept with next token prediction

Here you go: https://arxiv.org/abs/2502.05171