logoalt Hacker News

geuistoday at 9:28 AM0 repliesview on HN

Doesn't work for my use-case. GroundingDINO is a text to bounding box model. SAM2 supports coordinate based masks (user taps or clicks somewhere in an image), which is what my research app needs.