Does it work for porn collections too?
Why it’s always the same question? Hahah. I posted my project over Reddit and I got the same one hahah
Last time I tried whisper, it hallucinated an elaborate conversation from sounds of slapping and moaning and it took minutes to spit every single line of it.
Not sure if you’re being sarcastic but I think this is an interesting question. Would deep seek be useful here since it is local?
You'll need a lora for this, porn content rejection is heavy. Or you'll need a abliterated model, not sure if vision also works.
You might want to add something like yolo finetune to detect scenes + face recognition too.