I don't know about the photo-management aspects. However, I've had very good experiences running gemma3 (4b and 12b) locally via ollama
I've used gemma to process pictures and get descriptions and also to respond questions about the pictures (eg. is there a bicycle in the picture?). Haven't tried it for face recognition, but if you already have identified someone in one photo, it can probably tell you if the person in that photo is also in another photo
Just one caveat, if you are processing thousands of pictures, it will take a while to process them all (depending on your hardware and picture size). You could also try creating a processing pipeline, first extracting faces or bounding boxes of the faces with something like opencv, and then passing those to gemma3
Please post repo link if you ever decide to open source
Thanks nico for sharing your experience! That's really helpful. The idea of using OpenCV to create a processing pipeline for face detection before passing it to Gemma is brilliant I hadn't thought of that. I'll definitely look into using gemma with ollama.
And for sure, if I get this to a point where it's open-source, I'll post the link here!