logoalt Hacker News

simonwyesterday at 7:24 PM14 repliesview on HN

This doesn't have API access yet, but OpenAI seem to approve of the Codex API backdoor used by OpenClaw these days... https://twitter.com/steipete/status/2046775849769148838 and https://twitter.com/romainhuet/status/2038699202834841962

And that backdoor API has GPT-5.5.

So here's a pelican: https://simonwillison.net/2026/Apr/23/gpt-5-5/#and-some-peli...

I used this new plugin for LLM: https://github.com/simonw/llm-openai-via-codex

UPDATE: I got a much better pelican by setting the reasoning effort to xhigh: https://gist.github.com/simonw/a6168e4165a258e4d664aeae8e602...


Replies

DrProticyesterday at 7:28 PM

That pelican you posted yesterday from a local model looks nicer than this one.

Edit: this one has crossed legs lol

show 1 reply
GistNoesisyesterday at 8:34 PM

Isn't it awful ? After 5.5 versions it still can't draw a basic bike frame. How is the front wheel supposed to turn sideways ?

show 3 replies
Schlagbohreryesterday at 9:52 PM

That's amazing that the default did that much in just 39 "reasoning tokens" (no idea what a reasoning token is but that's still shockingly few tokens)

show 1 reply
XCSmeyesterday at 7:38 PM

Is this direct API usage allowed by their terms? I remember Anthropic really not liking such usage.

show 1 reply
deflatoryesterday at 7:42 PM

Hmm. Any idea why it's so much worse than the other ones you have posted lately? Even the open weight local models were much better, like the Qwen one you posted yesterday.

show 3 replies
noonethingyesterday at 9:48 PM

Thank you for doing all this. It's appreciated.

postalcoderyesterday at 7:42 PM

I made pelicans at different thinking efforts:

https://hcker.news/pelican-low.svg

https://hcker.news/pelican-medium.svg

https://hcker.news/pelican-high.svg

https://hcker.news/pelican-xhigh.svg

Someone needs to make a pelican arena, I have no idea if these are considered good or not.

show 5 replies
andriy_kovalyesterday at 7:41 PM

what is your setup for drawing pelican? Do you ask model to check generated image, find issues and iterate over it which would demonstrate models real abilities?

show 1 reply
droidjjyesterday at 7:31 PM

It's... like no pelican I've ever seen before.

show 1 reply
gpmyesterday at 8:27 PM

I for one delight in bicycles where neither wheel can turn!

It continues to amaze me that these models that definitely know what bicycle geometry actually looks like somewhere in their weights produces such implausibly bad geometry.

Also mildly interesting, and generally consistent with my experience with LLMs, that it produced the same obvious geometry issue both times.

show 1 reply
SkyBelowyesterday at 8:15 PM

Wait, I thought we were onto racoons on e-scooters to avoid (some of) the issues with Goodhart's Law coming into play.

show 1 reply
rolymathyesterday at 8:17 PM

Exciting. Another Pelican post.

show 2 replies
dakolliyesterday at 8:23 PM

You know they are 1000% training these models to draw pelicans, this hasn't been a valid benchmark for 6 months +

show 2 replies
sjdv1982yesterday at 8:15 PM

At some point, OpenAI is going to cheat and hardcode a pelican on a bicycle into the model. 3D modelling has Suzanne and the teapot; LLMs will have the pelican.