logoalt Hacker News

Der_Einzigelast Thursday at 7:22 AM0 repliesview on HN

If they solve tokenization, you'll be SHOCKED at how much it was holding back model capabilities. There's tons of works at NeurIPS about various tokenizer hacks or alternatives to bpe which massively improve various types of math that models are bad at (i.e. arithmatic performance)