logoalt Hacker News

willahmadyesterday at 5:20 PM2 repliesview on HN

I think this benchmark could be slightly misleading to assess coding model. But still very good result.

Yes, SVG is code, but not in a sense of executable with verifiable inputs and outputs.


Replies

jstummbilligyesterday at 6:35 PM

I love that we are earnestly contemplating the merits of the pelican benchmark. What a timeline.

show 1 reply
hdjrudnitoday at 2:56 AM

But it does have a verifiable output, no more or less than HTML+CSS. Not sure what you mean by "input" -- it's not a function that takes in parameters if that's what you're getting at, but not every app does.