logoalt Hacker News

wielandbr01/20/20250 repliesview on HN

I am curious about the rough compute budget they used for training DeepSeek-R1. I couldn't find anything in their report. Anyone having more information on this?