logoalt Hacker News

fookeryesterday at 10:58 PM0 repliesview on HN

‘Bytes’ is tokenization.

There’s no reason to assume it’s the best solution. It might be the case that a better tokenization scheme is needed for math, reasoning, video, etc models.