Hoe much power did it take to train the models?

phplovesong • yesterday at 6:06 PM • 4 replies • view on HN

Replies

I would honestly guess that this is just a small amount of tweaking on top of the Sonnet 4.x models. It seems like providers are rarely training new 'base' models anymore. We're at a point where the gains are more from modifying the model's architecture and doing a "post" training refinement. That's what we've been seeing for the past 12-18 months, iirc.

➕ show 1 reply

brutalc • yesterday at 6:18 PM

[dead]

neural_thing • yesterday at 6:18 PM

Does it matter? How much power does it take to run duolingo? How much power did it take to manufacture 300000 Teslas? Everything takes power

➕ show 2 replies

alt Hacker News

Replies