logoalt Hacker News

heyethantoday at 6:35 AM0 repliesview on HN

Looks like a model size issue, but the behavior already seems largely shaped by the data distribution.