A year ago it felt like SoTA model developers were not improving so much as moving the dirt around. Maybe we’re in another such rut.