Great article. Hadoop (and other similar tools) are for datasets so huge they don't fit on one machine.
https://www.scylladb.com/2019/12/12/how-scylla-scaled-to-one...
I like this one where they put a dataset on 80 machines only then for someone to put the same dataset on 1 Intel NUC and outperform in query time.
https://altinity.com/blog/2020-1-1-clickhouse-cost-efficienc...
Datasets never become big enough…
And we can have pretty fucking big single machines right now
https://www.scylladb.com/2019/12/12/how-scylla-scaled-to-one...
I like this one where they put a dataset on 80 machines only then for someone to put the same dataset on 1 Intel NUC and outperform in query time.
https://altinity.com/blog/2020-1-1-clickhouse-cost-efficienc...
Datasets never become big enough…