> visual similarity
> SigLIP 2
Maybe visual-semantic similarity is more appropriate? Nonetheless the design is fantastic