logoalt Hacker News

jryioyesterday at 6:25 PM1 replyview on HN

1 million tokens is great until you notice the long context scores fall off a cliff past 256K and the rest is basically vibes and auto compacting.


Replies

ollieproyesterday at 11:25 PM

I bet they lack good long context training data and need to start a flywheel of collecting it via their api (from willing customers)