I just need their API to be faster. 15-30 seconds per request using 4o-mini isn't good enough for responsive applications.
The new Realtime Websocket API appears to send back responses within less than a second. It might be just what you want.
That is odd. Longest I’ve experienced in my use of it is a few seconds.
That doesn’t match my experience using it a lot at all
You should try Azure: it comes with dedicated capacity which is typically a very expensive "call our sales team" feature with OpenAI