logoalt Hacker News

jmoletoday at 3:23 PM1 replyview on HN

Ban it from the dataset, add it to the analysis. You can choose your own flavor of noise.

I don't know what the political undertones are here, but at some level you need to have actual ground truth, including "this person/household declined".

Publishing raw data though? That seems like shooting yourself in the foot from a national security perspective, not to mention all the other reasons not to do it.


Replies

glitchctoday at 3:42 PM

> Ban it from the dataset, add it to the analysis. You can choose your own flavor of noise.

It is introduced in the public data, not the secret data.