logoalt Hacker News

saberienceyesterday at 12:19 PM2 repliesview on HN

Well, at my old company we had some datasets in the 6-8 PB range, so tell me how we would run analytics on that dataset on an Intel NUC.

Just because you don't have experience of these situations, it doesn't mean they don't exist. There's a reason Hadoop and Spark became synonymous with "big data."


Replies

dapperdrakeyesterday at 2:28 PM

These situations are rare not difficult.

The solutions are well known even to many non-programmers who actually have that problem:

There are also sensor arrays that write 100,000 data points per millisecond. But again, that is a hardware problem not a software problem.