Previewing GPT‑5.6 Sol: a next-generation model

729 points • by minimaxir • today at 5:06 PM • 454 comments • view on HN

System card: https://deploymentsafety.openai.com/gpt-5-6-preview

Comments

Would love to see benchmarks on cognition's FrontierCode

GodelNumbering • today at 6:31 PM

I do not like the fact that this forces people to remember one more hierarchy of "Sol vs Terra vs Luna". OpenAI was supposed to simplify their naming since at least 2025.

➕ show 1 reply

osti • today at 5:14 PM

Sol? Looks like openai is jealous of anthropics good model naming ability and wants to emulate it.

➕ show 2 replies

josefrichter • today at 9:21 PM

Sol, Terra, Luna – crypto disaster vibes

➕ show 1 reply

arendtio • today at 5:25 PM

I didn't know that I was color blind, but thanks to those charts, I think I need to see a doctor...

I mean, you can read them even without the colors, but who on earth thought that those are a good set of colors? Oh, I forgot it was probably someone on 'Sol'.

➕ show 1 reply

nopakos • today at 6:09 PM

People where mocking EU for regulations and now this is happening in the US. I know that Europe is behind in AI but still...

➕ show 3 replies

micimize • today at 6:31 PM

Haven't we established defensive and offensive security usage are intractably entangled? I.e. "patch all [security] bugs, make no mistakes" gives one a list of potential exploits to hand off to less capable models.

Doesn't that undermine all good-faith discourse on cybersecurity safeguards, controlled usage etc? Or is that overstating the case (I'm not a security researcher myself so kinda parroting).

duggan • today at 5:30 PM

> As part of our ongoing engagement with the U.S. government, we previewed our plans and the models’ capabilities ahead of today’s launch. At their request, we are starting with a limited preview for a small group of trusted partners whose participation has been shared with the government, before releasing more broadly.

The clowns in the US administration can barely remain coherent from one sentence to the next.

Having them be the gatekeepers of technological progress in 2026 is fucking lame.

➕ show 1 reply

ddp26 • today at 5:14 PM

I'm going to pre-register my prediction that GPT-5.6 Sol is significantly behind Claude Fable 5, as evaluated by general consensus once time has passed for people to get familiar with both.

➕ show 6 replies

phplovesong • today at 7:58 PM

Is there any model that rivals Opus or Fable? I would like to try something else, as Anthropic is pretty suss.

➕ show 2 replies

kissgyorgy • today at 7:57 PM

    we expect substantial benefit for legitimate defensive work, while meaningfully constraining prohibited offensive use.

That's literally impossible. Writing an exploit agains a known vulnerability needs the exact same knowledge that defending against the exploit of the same vulnerability.

Also just making the model better at code is just making it better to writing offensive code.

hereme888 • today at 6:26 PM

Seems like OpenAI's strategy to release models after Anthropic has been paying off.

Is it just me, or does it seem like Anthropic has been more of a pioneer the past few years, and OpenAI tries to copy features they like?

➕ show 1 reply

simianwords • today at 6:15 PM

No comments on the cerebras version that might finally enable intelligent voice mode instead of being stuck with 4o-mini class

h4x0rr • today at 8:30 PM

FUCK the US government. That's it, I am rooting for China now

rvz • today at 5:11 PM

Other than the worst naming I have ever seen (Sol / Terra / Luna), the pricing is still expensive:

> GPT‑5.6 is priced per 1M tokens across three model sizes:

> Sol is $5 input / $30 output;

> Terra is $2.50 input / $15 output

> Luna is $1 input / $6 output.

The OpenAI casino has never been more ready to take your money on gambling even more tokens.

➕ show 5 replies

ChrisArchitect • today at 5:09 PM

Pre-official discussions:

https://news.ycombinator.com/item?id=48678789

https://news.ycombinator.com/item?id=48683021

throwitaway222 • today at 7:16 PM

Sun Earth Moon

thesurlydev • today at 5:21 PM

Not really news until it's widely available.

Anyone know the latest around Fable being re-released after gov smackdown?

moomin • today at 6:53 PM

The language used in this press release is borderline hilarious. It’s simultaneously trying to tell you how great it is while also telling it’s not THAT great. Nothing to worry about, move along.

simianwords • today at 5:37 PM

Thoughts

1. Naming convention is copied from Anthropic and honestly is more catchy than a number (amongst normal people)

2. How in the world did Anthropic have to do all the theatrics about Mythos just to have OpenAI release an equivalent or stronger model a month later without any drama???

3. Cheaper models are just don’t fit any usecase imo and OpenAI knows it so they keep increasing the floor - I’m still convinced task per capability is reduced with each release

4. How in the world would open source models keep up with the multi layer security? Either this security is all theater or we will finally see a ceiling in open source models because by definition they can’t have those protections

5. Cybersecurity things are boring to me because it’s all zero sum cat and mouse games

submeta • today at 5:30 PM

Are GPT 5.5 and Opus 4.8 the last models we're going te be allowed to use in Europe? Is there going to be a cut, and we're only be allowed to use less capabale models outside of the US?

I mean, if they deem Fable 5 to powerful to share with the rest of the world, what's left for us?

➕ show 2 replies

oofbey • today at 10:32 PM

Another year, and OpenAI comes up with yet another naming scheme for their models. First it was integers (GPT2, GPT3). Then they added friendly names (remember Ada, Babbage, Curie, Davinci?), but decided against it. Instead we got dot integers (GPT3.5), then then letter-number modifiers (o1), plus word modifiers like o1-pro, o3-mini, or -mini-high, or codex, codex-max, Pro, etc.

Now they've got friendly cosmic names. And this time they want us to believe that this time they're gonna stick to a naming convention? I'll believe it when they do 3 releases in a row without inventing a new naming scheme.

nubg • today at 9:33 PM

A question I always have is, how to the AI labs safeguard the leak of their model? Training a cutting edge model basically cost a minimum of hundreds of millions of dollars. And its all contained within a file. Okay, that file might be 500GB large, but its still just one blob that is worth almost a billion dollars. And they need to train new models every few weeks, have lots of people with access to it to debug it, run inference etc. I wonder when we will see the first leaks? Imagine if e.g. Opus 4.8 got leaked. Wouldnt that bankrupt Anthropic?

➕ show 1 reply

urig • today at 8:31 PM

It's only next generation? Anthropic has frontier models! lol

meetpateltech • today at 5:44 PM

Another model family, another naming scheme to get used to.

Sol Ultra ≈ Pro

Sol ≈ Standard

Terra ≈ Mini

Luna ≈ Nano

➕ show 2 replies

BoorishBears • today at 7:12 PM

> For GPT‑5.6 and later models, cache writes are billed at 1.25x the model’s uncached input rate, while cache reads continue to receive the 90% cached-input discount.

Not them joining Anthropic with this bullshit. *

Caching infrastructure is already a leaky abstraction over a feature that is not as reliable or debuggable to the end user as it should be, charging for the 'privilege' of interacting with it is really annoying.

(* for reference on 'this bullshit': ChatGPT previously didn't require anything special for a basic level of caching. Unless you wanted extended cache times, it'd just "do the right thing" and try to use nodes that had your prefix already cached in memory)

➕ show 1 reply

ALittleLight • today at 5:45 PM

I hate not being able to use the latest models. There needs to be a much faster resolution to whatever is happening with the federal government.

➕ show 2 replies

da_grift_shift • today at 5:27 PM

    Flagged activity can also trigger account-level review across relevant conversations and risk signals, consistent with our terms and policies around content retention and review. Looking beyond a single conversation helps our systems distinguish persistent malicious behavior from legitimate dual-use security work, where similar technical concepts may appear in very different contexts.

Fascinating!

Every conversation you have with these "more capable" models will be monitored and joined up and then your entire account might one day be tagged as Distiller or Cyber Threat Actor or whatnot. When combined with identity verification (which isn't discussed in this press release), expect people to be falsely flagged and banned from ever using OpenAI models again.

Wish I could find the thread from last week where discussions of exactly this kind of thing were dismissed as daft and outlandish.

➕ show 2 replies

masonwan • today at 6:34 PM

Guess it's just another price bump hidden behind output token speed.

gck1 • today at 5:55 PM

[dead]

randomuser558 • today at 6:44 PM

[flagged]

w4yai • today at 5:36 PM

[flagged]

wonkyfruit • today at 7:07 PM

TLDR - It's not quite Mythos but it uses about 5 times less tokens, and those tokens are also cheaper?

https://pbs.twimg.com/media/HLwuJLvbwAAOfQZ?format=jpg&name=...

➕ show 1 reply

HarHarVeryFunny • today at 5:45 PM

[flagged]

➕ show 1 reply

nakedrobot2 • today at 5:14 PM

This is disgusting groveling to the Orange Shit Stain.

Beam me up Scotty. No intelligent life forms on this planet.

➕ show 2 replies

CurbStomper • today at 7:04 PM

Could not care less.

andrewlin247 • today at 5:40 PM

they're trying to be anthropic with these model names

ericyd • today at 7:06 PM

whoa, a new model that surpasses benchmarks of other models? wild.

johnnyApplePRNG • today at 5:53 PM

Doesn't it strike anyone as strange that SOL, TERRA, and LUNA are all quasi-scam crypto tickers?

➕ show 2 replies

throwitaway222 • today at 7:22 PM

Time to create more LLM based startups.

  * House design plans from prompts
  * Government surveillance of public communication
  * Extracting world/spatial concepts from language models (do we really need a world/spatial models now?)
  * Driverless City planning startups
  * Election vote rigging/harvesting startups
  * Video game NPC backstory startups (all NPCs in GTA 6 go to work, go home, shower, go to sleep now?)

Keep moving don't doom.

JohnRoseDev • today at 7:14 PM

I can’t help but think that these benchmarks are completely fake. Sam even posted a benchmark on X a couple days ago of how the ‘complete version’ of 5.5 cyber was already ahead of Mythos apparently. This just feels like absolutely fake nonsense. The impact of Mythos on the industry was clear and in front of everyone’s eyes. The amount of vulnerabilities Mozilla fixed. The vulnerabilities and exploits Anthropic showcased in that blog post about the chrome sandbox escape etc. And now we’re supposed to believe this 5.5 cyber is already ahead of Mythos, ok. And yeah, gpt 5.6 is even further ahead, alright.

➕ show 1 reply

alt Hacker News

Previewing GPT‑5.6 Sol: a next-generation model

Comments