logoalt Hacker News

trevoragilberttoday at 3:03 PM0 repliesview on HN

This is very cool! For the name extraction, how are you handling false positives across such a large dataset? I’m assuming there are mentions that could be a name but are actually just a noun. For example, Agricola being the word for farmer but also a name.