That's not correct. Even a toy like an exponential weighted moving averaging produces unbounded...

srean • last Sunday at 4:31 PM • 1 reply • view on HN

That's not correct. Even a toy like an exponential weighted moving averaging produces unbounded context (of diminishing influence).

Replies

empiko • last Sunday at 5:05 PM

What do you mean? I can only input k tokens into my LLM to calculate the probs. That is the definition of my state. In the exact way that N-gram LMs use N tokens, but instead of using ML models, they calculate the probabilities based on observed frequencies. There is no unbounded context anywhere.

➕ show 1 reply

alt Hacker News

Replies