GPT-4 at $24.7 per million tokens vs Mixtral at $0.24 - that's a 100x cost difference! Even if ...

pbd • yesterday at 5:49 PM • 5 replies • view on HN

GPT-4 at $24.7 per million tokens vs Mixtral at $0.24 - that's a 100x cost difference! Even if routing gets it wrong 20% of the time, the economics still work. But the real question is how you measure 'performance' - user satisfaction doesn't always correlate with technical metrics.

Replies

FINDarkside • yesterday at 6:26 PM

It's trivial to get better score than GPT-4 with 1% of the cost by using my propertiary routing algorithm that routes all requests to Gemini 2.5 Flash. It's called GASP (Gemini Always, Save Pennies)

➕ show 1 reply

simpaticoder • yesterday at 7:27 PM

PPT (price-per-token) is insufficient to compute cost. You will also need to know an average tokens-per-interaction (TPI). They multiply to give you a cost estimate. A .01x PPT is wiped out by 100x TPI.

➕ show 1 reply

Keyframe • yesterday at 6:04 PM

number of complaints / million tokens?

mkoubaa • yesterday at 7:28 PM

> How you measure 'performance'

I heard the best way is through valuations

pqtyw • yesterday at 6:20 PM

> GPT-4 at $24.7 per million tokens

While technically true why would you want to use it when OpenAI itself provides a bunch of many times cheaper and better models?

➕ show 1 reply

alt Hacker News

Replies