I developed a stack on Cloudflare workers where latency is super low and it is cheap to run at scale...

ldenoue • today at 3:54 AM • 1 reply • view on HN

I developed a stack on Cloudflare workers where latency is super low and it is cheap to run at scale thanks to Cloudflare pricing.

Runs at around 50 cents per hour using AssemblyAI or Deepgram as the STT, Gemini Flash as LLM and InWorld.ai as the TTS (for me it’s on par with ElevenLabs and super fast)

Replies

pugio • today at 4:43 AM

Do you have anything written up about how you're doing this? Curious to learn more...

alt Hacker News

Replies