logoalt Hacker News

ldenouetoday at 3:54 AM1 replyview on HN

I developed a stack on Cloudflare workers where latency is super low and it is cheap to run at scale thanks to Cloudflare pricing.

Runs at around 50 cents per hour using AssemblyAI or Deepgram as the STT, Gemini Flash as LLM and InWorld.ai as the TTS (for me it’s on par with ElevenLabs and super fast)


Replies

pugiotoday at 4:43 AM

Do you have anything written up about how you're doing this? Curious to learn more...