This is super interesting, thanks for sharing! I did a similar thing a few years ago which I'd been meaning to properly finish and share, and your post was the inspiration needed to make mine public (albeit still in a state much too messy for my liking, hopefully having it public will force me to improve it).
We took fairly different approaches, but I really enjoy the visual explanation element of yours! Well done.
My investigation stemmed from wondering if the seemingly useless 1st, 10th, and 1000th nearest word similarity scores were enough to uniquely ID the word. Turns out—yes, pretty much! It's effectively just a kind of reverse engineering, similar to how you also made your own version of the game. Can definitely improve on a lot.
Tried today's puzzle and got it in two (first was 999/1000).
Here's my code & write up: https://github.com/OisinMoran/Solving-Semantle/blob/main/Sol...