logoalt Hacker News

ImHereToVoteyesterday at 8:15 AM1 replyview on HN

I wonder how much GPU compute you would need to create a public domain version of this. This would be a really valuable for the general public.


Replies

wongarsuyesterday at 10:43 AM

To get a single knowledge-cutoff they spent 16.5h wall-clock hours on a cluster of 128 NVIDIA GH200 GPUs (or 2100 GPU-hours), plus some minor amount of time for finetuning. The prerelease_notes.md in the repo is a great description on how one would achieve that

show 1 reply