logoalt Hacker News

pizza04/02/20251 replyview on HN

They do have a weak relationship, in that earlier index tokens were encountered earlier during the formation of the vocabulary, so they are similar in typicality


Replies

janalsncm04/03/2025

No, if you check the diagram (page 2) these are literally indexes into the KV vectors, not positional indexes in the text. If it was the text I would agree with you.