If you're going to claim the tokenizer is a dictionary then it doesn't really matter what paper you wrote code for.