To echo the other replies, the tokenizer is definitely not the bottleneck. It just happens to be the...

matthewolfe • yesterday at 10:14 PM • 0 replies • view on HN

To echo the other replies, the tokenizer is definitely not the bottleneck. It just happens to be the first step in inference, so it's what I did first.

alt Hacker News