logoalt Hacker News

Escapade5160yesterday at 12:52 AM1 replyview on HN

From my brief testing in the playground, it is not very good. Maybe it needs better prompting than the 1 word examples.


Replies

gpmyesterday at 1:11 AM

For me it either worked great or not at all. Extracting footsteps, the air conditioner noise, voices, one particular persons voice (identified by gender), all worked great (across multiple clips for most of those).

A few prompts failed almost entirely though, "train noises", "background noise" and "clatter"... so definitely sensitive to either prompting or the kind of noise being extracted.