logoalt Hacker News

throwaway2027yesterday at 9:15 PM0 repliesview on HN

I wonder if at this point they read what people use to benchmark with and specifically train it to do well at this task.