logoalt Hacker News

Big Data on the Cheapest MacBook

216 pointsby bcyetoday at 11:41 AM189 commentsview on HN

Comments

Robdel12today at 1:01 PM

I’ve been tempted to buy one and do “real dev work” on it just to show people it’s not this handicapped little machine.

I built multiple iOS apps and went through two start up acquisitions with my M1 MBA as my primary computer, as a developer. And the neo is better than the M1 MBA. I edited my 30-45 min long 4k race videos in FCP on that air just fine.

show 18 replies
montrosertoday at 12:43 PM

This is as much an indictment of AWS compute as it is anything else.

show 4 replies
__mharrison__today at 1:37 PM

When I teach, I use "big data" for data that won't fit in a single machine. "Small data" fits on a single machine in memory and medium data on disk.

Having said that duckDB is awesome. I recently ported a 20 year old Python app to modern Python. I made the backend swappable, polars or duckdb. Got a 40-80x speed improvement. Took 2 days.

show 1 reply
scottlambtoday at 2:53 PM

> The cloud instances have network-attached disks

Props for identifying the issue immediately, but armed with that knowledge, why not redo the benchmark on a different instance type that has local storage? E.g. why not try a `c8id.2xlarge` or `c8id.4xlarge` (which bracket the `c6a.4xlarge`'s cost)?

clamladytoday at 1:09 PM

as a broke ecologist, this little computer can do everything I need in R and word and is a phenomenal build for the price. I'm really enjoying it thus far.

show 1 reply
hirako2000today at 3:06 PM

Would it not also work on a raspberry.

With I/O streaming and efficient transformation I do big data on my consumer PC and good old cheap HDDs just fine.

show 2 replies
refactor_mastertoday at 1:08 PM

I think it’s relevant to first read [1] to see why they’re doing this. It’s basically done as a meme.

[1] https://motherduck.com/blog/big-data-is-dead/

show 1 reply
1a527dd5today at 1:59 PM

I adore DuckDB.

Did a PoC on a AWS Lambda for data that was GZ'ed in a s3 bucket.

It was able to replace about 400 C# LoC with about 10 lines.

Amazing little bit of kit.

ody4242today at 12:49 PM

I would have benchmarked with an instance that has local nvme, like c8gd.4xlarge.

show 2 replies
devnotes77today at 2:06 PM

The DuckDB team benchmarked with an r7i.16xlarge which uses EBS - that's the expected bottleneck. A fairer comparison would be an i4i or c8gd with local NVMe, where you'd likely see the laptop and cloud instance much closer in practice.

show 1 reply
onlyrealcuzzotoday at 1:29 PM

This is awesome.

I wish more companies would do showcases like this of what kind of load you can expect from commodity-ish hardware.

show 1 reply
tasukitoday at 1:30 PM

That's not Big Data. If you "need to process Big Data on the move" - what you need is a network.

show 2 replies
butILoveLifetoday at 2:16 PM

You could get a laptop with an Nvidia GPU, 16gb ram, 512 ssd... or a 'cheap' Macbook.

I totally understand if you need to compile for iphones. We need to make apps for the lower and middle class people that think a $40/mo cellphone is a status symbol. I get it.

But if you are not... why? I hate windows, but we have Fedora... and you get an Nvidia. Is it just a status symbol? And I have a hard time believing people who tell me stories about low power consumption, because no one had cared about that until Apple pretended people cared about it.

show 11 replies
alex_createstoday at 1:58 PM

Funny just yesterday I almost bought one but got cold feet and opted for a low range MacBook with M5 chip. The Apple sales rep was not convinced it would be enough when i described using it for vibecoding and deploying so kind of talked me out of getting the Neo. I normally use a mix of LLMs, then connect to Github and do a one-click deploy on CreateOS. Do you think I over-reacted? The price of the Neo is SO attractive, a clean half price compared to what I got.

show 4 replies
nicoritscheltoday at 1:12 PM

> compared to 3–5 GB/s

Their numbers are a bit outdated. M5 Macbook pro SSDs are literally 5x this speed. It's wild.

show 2 replies
toshtoday at 1:01 PM

For the TPC-DS results it would also have been nice to show how the macbook neo compares to the AWS instances.

Or am I missing something?

zipping1549today at 12:57 PM

> TL;DR: How does the latest entry-level MacBook perform on database workloads? We benchmarked it to find out.

That's not tldr, that's just subheader.

show 2 replies
aaronharnlytoday at 1:51 PM

That c8g.metal-48xl instance costs $7.63008 on demand[1], so for the price of the laptop, you could run queries on it for about ~90 hours.

:shrug: as to whether that makes the laptop or the giant instance the better place to do one's work…

[1] https://aws.amazon.com/ec2/pricing/on-demand/

varispeedtoday at 1:12 PM

If you can fit it on a thumb drive, it's not Big Data.

show 1 reply
TacticalCodertoday at 1:04 PM

I'm interested by one (not for big data) but only 8 GB or RAM is kinda really sad.

My good old LG Gram (from 2017? 2015? don't even remember) already had 24 GB of RAM. That was 10 years ago.

A decade later I cannot see myself being a laptop with 1/3rd the mem.

show 1 reply
fnord77today at 2:25 PM

this has a phone CPU/memory

pellatoday at 2:29 PM

other test:

2025-09-08 : "Big Data on the Move: DuckDB on the Framework Laptop 13"

"TL;DR: We put DuckDB through its paces on a 12-core ultrabook with 128 GB RAM, running TPC-H queries up to SF10,000."

https://duckdb.org/2025/09/08/duckdb-on-the-framework-laptop...

BoredPositrontoday at 12:32 PM

Queue the endless blog posts about running tech on the potato macbook and being stunned it’s functional with massive trade-offs. Groundbreaking stuff.

show 2 replies
irenetusuqtoday at 1:00 PM

[dead]

shablulmantoday at 12:21 PM

[dead]

Aniakshaytoday at 1:35 PM

[flagged]

Jeffrin-devtoday at 12:25 PM

[flagged]

show 1 reply
opentokixtoday at 12:34 PM

Mind blown, if you need to handle "big" data on the move - the macbook neo is not the right choice. - Who would have guessed that outcome?

show 1 reply
hermanzegermantoday at 12:20 PM

That's an awesome idea to get a bricked MacBook Neo really fast because those idiots soldered the SSD inside

show 3 replies
ramgaletoday at 1:03 PM

Seems completely unnecessary, there is probably 0 overlap between people who buy a cheap MacBook and people running DuckDB locally

show 5 replies
TutleCpttoday at 12:14 PM

Oh great, the term "big data" is back.

show 1 reply
evanjrowleytoday at 2:04 PM

>Can I expect good performance from the MacBook Neo with Slack, Microsoft Office, and Google Chrome signed into Atlassian and a CRM, all running simultaneously?

No.

>Do I reject a world where all of the above is necessary to realize value from an entry-level MacBook?

In theory, yes.