Fair point but it remains to answer - why isn’t this speed up available in ChatGPT and only in the api?