Give them real world problems you're encountering and see which can solve them the best, if at all
A full week of that should give you a pretty good idea
Maybe some models just suit particular styles of prompting that do or don't match what you're doing