logoalt Hacker News

jballancyesterday at 10:09 PM0 repliesview on HN

We need benchmarks that can distinguish between continuous learning and long-context extrapolation.