logoalt Hacker News

hoerzu11/07/20241 replyview on HN

Can you give an example if I have 5gig (2 million rows)

How will it be created differently for columnar access?


Replies

exAspArk11/07/2024

We ran some benchmarks (TPC-H, designed for OLAP) with ~10M records https://github.com/BemiHQ/BemiDB#benchmark

The BemiDB storage layer produced ~300MB columnar Parquet files (with ZSTD compression) vs 1.6GB of data in Postgres.

show 1 reply