> I wonder if we are at a point where the focus can shift to improving the cost of inference. T...

vlovich123 • yesterday at 11:50 PM • 1 reply • view on HN

> I wonder if we are at a point where the focus can shift to improving the cost of inference.

There's always working on improving the cost of inference, but I don't think this is an area of R&D that will slow down. The reason is:

1. A better competitor model risks eating away at how much they can charge for inference (i.e. revenue) 2. Whoever unlocks AGI will unlock even more growth 3. Even when you unlock AGI, you'll want to throw gobs of money at it to improve itself and all sorts of things.

> If Fable comes out and demands 50x the price of DeepSeek in order for Anthropic to make a profit on it, how much more productive would I be compared to my personal experience + DeepSeek? 3x? 50x?

You're pricing it wrong and looking at it wrong. First, the per token price doesn't consider that a smarter model can end up using fewer tokens overall to achieve a result. Secondly, if the difference is between failing to accomplish the task and accomplishing the task, suddenly that 50x can seem like a bargain.

> Is it cost effective for a business to hire someone without SWE experience + Fable verses hiring someone with SWE experience and DeepSeek? When does R&D hit diminishing returns?

At this time, someone without SWE experience + <name AI model> vs someone good with SWE experience and <name another AI model> is a no-brainer. The AI model is an accelerant but the "no SWE experience" will be accelerated into a wall. Now maybe that doesn't matter for prototyping and certain other things, but anything in production the lack of experience will hurt them with things they won't even know about or even know how to look for it (e.g. slow, insecure, etc).

alt Hacker News

Replies