The mysterious Hy3 LLM is topping OpenRouter Model Rankings by a large margin

109 points • by freediver • yesterday at 12:09 AM • 94 comments • view on HN

Comments

First model I've tried that gave me back HTML with a "Change Pelican Color" button: https://static.simonwillison.net/static/2026/hy3-preview-pel...

(Transcript: https://gist.github.com/simonw/c2a0d8ecd3056a2681319eae8fc3f...)

➕ show 1 reply

zone411 • yesterday at 5:10 AM

I’ve tested this model on four of my benchmarks:

https://github.com/lechmazur/buyout_game 10th out 36.

https://github.com/lechmazur/pact/ 14th out 25.

https://github.com/lechmazur/nyt-connections/ 60th out 81.

https://github.com/lechmazur/debate 16th out of 29.

Aurornis • yesterday at 2:07 AM

> Two new models are now beating LLM darling Claude in terms of token usage and by more than 50%?

Time for a reminder that OpenRouter leaderboards only show tokens sent through OpenRouter, which most Anthropic API users don’t use.

➕ show 1 reply

simonw • yesterday at 4:21 AM

OpenRouter rankings frustrate me, because they show the total number of tokens but they provide no indication of how many unique users a model has.

Which means if a surprise model tops the leaderboard one week we can never be sure if it was because a single whale user pushing billions of tokens a day switched to it, or if it represents a genuine community trend towards that model.

➕ show 2 replies

andai • yesterday at 1:22 AM

So basically, Hy3 is the cheapest decent model on OpenRouter, unless you use DeepSeek as the provider for DeepSeek V4 Flash, in which case DeepSeek's insane caching wins out. (And Hy3 is close-ish on the benchmarks.)

➕ show 1 reply

cicko • yesterday at 6:15 AM

How is it a "mysterious" model? It's Tencent's Hy3?

➕ show 1 reply

0xbadcafebee • yesterday at 2:35 AM

> it makes sense that a cheaper model would prevail, but only if it offered similar quality

You're trying to think logically, which has no place in an AI discussion. :) People just jump to whatever the latest model is. Plenty of people also prefer price to "quality" (which is very subjective). It's new, it's cheap, so people use it. It's likely people will stop using it when something else is cheaper and/or newer.

➕ show 1 reply

sheepscreek • today at 1:23 AM

FYI - DeepSeek has NOT announced its own coding platform. That app is an independent project. It says so in the footer as well:

“Independent open-source project · not affiliated with DeepSeek”

alecco • yesterday at 6:40 AM

PSA: Don't use OpenRouter for DeepSeek V4 as it messes up you caching. Use DeepSeek API directly and you'll get 2x to 3x more cached tokens.

vessenes • yesterday at 3:18 AM

Since there’s only one inference provider it could be a recycling/ad experiment. The similar usage between trial and paid periods would be explained by this as well.

gmerc • yesterday at 9:59 AM

Very mysterious: https://huggingface.co/tencent/Hy3-preview

lithiumii • yesterday at 6:51 AM

What's so mysterious? Isn't it from Tencent?

thot_experiment • yesterday at 5:47 AM

Tried this extensively in OpenCode, never used it once since Gemma 4 came out, got into thought loops and did stupid edits I didn't ask for more often than the local 31b model. One of the worst "frontier" models I've ever tried.

freakynit • yesterday at 4:50 AM

This was originally a 400+B param model which was later reduced to 295B considering it as the "optimal zone".

https://www.mdshare.online/s/uend0pj3og_A_rgcxzINf

segmondy • yesterday at 12:40 PM

High token usage cuz it's free doesn't count

➕ show 1 reply

bandrami • yesterday at 4:38 AM

For the life of me I will never understand the thought process that leads you to say "we don't really know who developed this LLM but I'm going to feed all of my business's data to it"

➕ show 5 replies

ravirdp • yesterday at 2:37 PM

[flagged]

haeseong • yesterday at 4:11 AM

[dead]

alt Hacker News

The mysterious Hy3 LLM is topping OpenRouter Model Rankings by a large margin

Comments