logoalt Hacker News

mmaundertoday at 2:53 PM0 repliesview on HN

That's between 1 and 10 training runs on a large foundational model, depending on pricing discounts and how much they manage to optimize it. I priced this out last night on AWS, which is admittedly expensive, but models have also gotten larger.