logoalt Hacker News

janalsncmtoday at 4:51 AM0 repliesview on HN

If it’s anything like the original SAM, thousands of hours of annotator time.

If I had to do it synthetically, take single subjects with a single sound and combine them together. Then train a model to separate them again.