logoalt Hacker News

dpkirchneryesterday at 7:14 PM1 replyview on HN

They can speak for themselves, you and I don't really know what they want, or what they think counts as "raw" data.


Replies

tonymettoday at 12:00 AM

Regardless, ascii encoding isn’t raw data. You’re making software engineer assumptions. Statistical noise is introduced 4-5 steps before the data is recorded digitally.

Even after it’s digitized, more noise is introduced through recording errors and normalization.

To understand the original distribution, the entire workflow needs to have been recorded