its easy to solve at the offline level where you have time to filter out. in fact this is already done in pre-training by OpenAI and other companies.
you think its hard?