I wouldn't be surprised if the implementation is - Turn down the thinking token budget to one...

digiown • yesterday at 11:47 PM • 0 replies • view on HN

I wouldn't be surprised if the implementation is

- Turn down the thinking token budget to one half

- Multiply the thinking tokens by 2 on the usage stats returned

- Phew! Twice the speed

IMO charging for the thinking tokens that you can't see is scam.

alt Hacker News