the question remains: is the tokenizer going to be a fundamental limit to my task? how do i know ahe...

8note • last Tuesday at 10:55 PM • 1 reply • view on HN

the question remains: is the tokenizer going to be a fundamental limit to my task? how do i know ahead of time?

worldsayshi • last Tuesday at 11:09 PM

Would it limit a person getting your instructions in Chinese? Tokenisation pretty much means that the LLM is reading symbols instead of phonemes.

This makes me wonder if LLMs works better in Chinese.

alt Hacker News