logoalt Hacker News

dupedtoday at 4:22 AM1 replyview on HN

what in theory makes those "super easy" to isolate? Humans are terrible at this to begin with, it takes years to train one of them to do it mildly well. Computers are even worse - blind source separation and the cocktail party problem have been the white whale of audio DSP for decades (and only very recently did tools become passable).


Replies

yunwaltoday at 4:55 AM

The fact that you can do it with spectral analysis libraries, no LLM required.

This is much easier than source separation. It would be different if I were asking to isolate a violin from a viola or another violin, you’d have to get much more specific about the timbre of each instrument and potentially understand what each instruments part was.

But a vibration made from a string makes a very unique wave that is easy to pick out in a file.

show 1 reply