logoalt Hacker News

naaskingyesterday at 9:43 PM0 repliesview on HN

I was thining of something like LLaDa that uses a Transformer to predict forward masked tokens:

https://arxiv.org/abs/2502.09992