But when indexing your json or csv, if you have say 10 rows, each row is separated on your disk inst...

simlevesque • today at 4:48 PM • 1 reply • view on HN

But when indexing your json or csv, if you have say 10 rows, each row is separated on your disk instead of all together. So a scan for one columb only needs to read a tenth of the disk space used for the data. Obviously this depends on the columns' content.

Replies

gdulli • today at 5:21 PM

But you can have a surprisingly large amount of data before the inefficiency you're talking about becomes untenable.

alt Hacker News

Replies