If you remove the auxiliary tools and just leave the core LLM then strawberry still has an undefined...

Marazan • today at 8:24 AM • 1 reply • view on HN

If you remove the auxiliary tools and just leave the core LLM then strawberry still has an undefined number of `r`s in it.

Replies

p-e-w • today at 8:35 AM

That’s false. Larger LLMs learn token decompositions through their training, and in fact modern training pipelines are designed to occasionally produce uncommon tokenizations (including splitting words into individual characters) for this reason. Frontier models have no trouble spelling words even without tools. Even many mid-sized models can do that.

➕ show 1 reply

alt Hacker News

Replies