They're using perceptual hashing, not cryptographic hashing of raw pixels. So it's invariant to variable bitrate, compression, etc.
How does perceptual hashing work?
Have you got any recommendations for further reading on this topic?
wouldn't LSH (Locality Sensitive Hashing) make more sense here?
How does perceptual hashing work?
Have you got any recommendations for further reading on this topic?