logoalt Hacker News

b0a04glyesterday at 2:44 PM0 repliesview on HN

if dagger builds a byte level DFA for special tokens and resolves overlaps via longest match, how does it handle inputs with partial matches at chunk boundaries, say a stream ends mid token like <|endo , does it buffer forward or require lookahead