logoalt Hacker News

interestpiquedyesterday at 8:46 PM1 replyview on HN

Release date seems like a terrible x axis with how much more compute they are using. Not to mention while I like what METR is trying to measure, it is an uber specific metric. And frankly, me just complaining, they’re prompts I feel do most of the work for the AI. I’ve never gotten as detailed instructions as they give the AI for the task


Replies

HDBaseTtoday at 2:52 AM

Whilst true, if you had unlimited compute 5 years ago, we wouldn't be anywhere near Mythos level purely because the technology behind the models wasn't refined enough.