Or an image detection model. Fraction of the compute and can run even on edge embedded. And easy to train with your own data
Or something like this https://www.youtube.com/watch?v=YZkLQsv3huo
Or something like this https://www.youtube.com/watch?v=YZkLQsv3huo