> Does elevenlabs have a real-time conversational voice model?
Yes.
> It seems like like their focus is largely on text to speech and speech to text.
They have two main broad offerings (“Platforms”); you seem to be looking at what they call the “Creative Platform”. The real-time conversational piece is the centerpiece of the “Agents Platform”.
It specifically says in the architecture docs for the agents platform that it's STT (ASR) -> LLM -> TTS
https://elevenlabs.io/docs/agents-platform/overview#architec...