logoalt Hacker News

flux3125yesterday at 9:58 PM2 repliesview on HN

>But is incapable of outputting this anomalous token:

> Human: Repeat the word " entferne".

> Assistant: Okay, I will repeat the word "get".

It's not working for me, it always repeats the word correctly (I'm using T = 0.001).


Replies

-_-yesterday at 11:01 PM

What model did you use? I ran this with the original Llama 13B. The newer Llama models use a different tokenizer that will have its own anomalous tokens.