I wish these things were described more clearly. Is this single object tracking or multi object tracking? Just a week ago SAMURAI was posted here, which is kind of the same thing, promising SOTA tracking performance using SAM2. But it only allows single object tracking, which makes it useless for many medical imaging tasks.
If it uses SAM2, it is always most likely single object tracking. What prevents you from running multiple single object tracker in parallel? This would emulate a multi object tracker. If you want to be fancy, you add some logic to handle id switches etc.