As an outsider can anyone enlighten me how this squares with the news that models that adapt similar LLM architecture can obtain silver medal in mathematical olympiad?
careful statistical massaging, maybe.
would you pick only winning results and only present favorable, massaged results if it got you 150+B USD of worth?
careful statistical massaging, maybe.
would you pick only winning results and only present favorable, massaged results if it got you 150+B USD of worth?