> Do they outperform?
For the size/performance yes.
> In any case, they wouldn't exist if not for superior models they were distilled from.
So? Those models wouldn't exist without the sum total of human knowledge. As long as a work is transformative why does it matter?