logoalt Hacker News

pbdyesterday at 5:49 PM5 repliesview on HN

GPT-4 at $24.7 per million tokens vs Mixtral at $0.24 - that's a 100x cost difference! Even if routing gets it wrong 20% of the time, the economics still work. But the real question is how you measure 'performance' - user satisfaction doesn't always correlate with technical metrics.


Replies

FINDarksideyesterday at 6:26 PM

It's trivial to get better score than GPT-4 with 1% of the cost by using my propertiary routing algorithm that routes all requests to Gemini 2.5 Flash. It's called GASP (Gemini Always, Save Pennies)

show 1 reply
simpaticoderyesterday at 7:27 PM

PPT (price-per-token) is insufficient to compute cost. You will also need to know an average tokens-per-interaction (TPI). They multiply to give you a cost estimate. A .01x PPT is wiped out by 100x TPI.

show 1 reply
Keyframeyesterday at 6:04 PM

number of complaints / million tokens?

mkoubaayesterday at 7:28 PM

> How you measure 'performance'

I heard the best way is through valuations

pqtywyesterday at 6:20 PM

> GPT-4 at $24.7 per million tokens

While technically true why would you want to use it when OpenAI itself provides a bunch of many times cheaper and better models?

show 1 reply