I doubt anyone I know who is using llms outside of work knows that there are benchmark tests for these models.