Is it 0.003 per minute of audio uploaded, or "compute minute"?
For example fal.ai has a Whisper API endpoint priced at "$0.00125 per compute second" which (at 10-25x realtime) is EXTREMELY cheaper than all the competitors.
I think the point is having it for real-time; this is for conversations rather than transcribing audio files.
I think the point is having it for real-time; this is for conversations rather than transcribing audio files.