Is RAG even necessary here? Minimal information like a couple of price list with job times and opening hours should easily fit into any context window, right? It's not like he's dumping entire service manuals into the vector database here...
Everytime I read or hear "the brain", my brain instantly shuts off.
>and he’s losing thousands of dollars per month because he misses hundreds of calls per week. He’s under the hood all day. The phone rings, he can’t answer, the customer hangs up and calls someone else. That’s a lost job — sometimes a $450 brake service, sometimes a $2,000 engine repair — just gone because no one picked up.
How much does it cost to have an outsourced receptionist? Even if it is 500 a month if we are really talking about thousands of dollars per month lost your ROI is still crazy.
The amount of negative comments here to someone building something is incredible.
I appreciated your post and have some takeaways around text formatting for TTS in my own projects. Thanks!
At the moment I'm pretty inclined to hang up if I feel I'm wasting my time with a robot.
But maybe soon we will not even realise we speak to a robot, given the current speed of ai development.
I wonder how that will erode trust in calls. I moved from cold emailing and cold LinkedIn to cold calling because of the massive amounts of ai spam I have to compete with. But maybe cold calling will die soon as well if the robots emerge.
I admire what you have done, but for a luxury experience, I do not want to talk to an AI that just tells me what is already on the website. If I have gotten to the point where I am calling you, its because I couldn’t find an answer to my question on the website in the first place.
I understand the other comments in this post, I too would be allergic to this sort of experience - luxury or not.
However, does the regular "joe/jane" feel the same way? I imagine my mom or dad would most likely not notice or care if they did.
Ignore the expected negativity, many here have not used the latest gen of voice agents in development. Even if used as a router , prefer that to waiting to get through
Why not gpt voice directly instead of elevenlabs for voice and sonnet for intelligence?
clanker != luxury, quite the opposite
Honestly great work, but this is very much something where the results matter more than the product. It ends without a single comment about whether it worked in Production.
> Wired up Claude for response generation — The retrieved documents get passed as context to Anthropic Claude (claude-sonnet-4-6) along with a strict system prompt: answer only from the knowledge base, keep responses short and conversational, and if you don’t know — say so and offer to take a message. No hallucinations allowed.
Claude will hallucinate anyway, sometimes.
I don't think there's any way around this other than a cli or MCP that says "press the 'play prerecorded .WAV file button that says the brake repair service info and prices.'"
This is an LLM generated slop post.
"No hallucinations allowed" :')
[dead]
[dead]
No idea what `luxury` is doing here, but if I get an LLM receptionist, that ain't it.
This isn't to disparage the project - I think this sort of usage will become very common and a decent standard that produces good consumer surplus in terms of reduced costs etc. Especially impressive is that it's a DIY family-first implementation that seems to be working. It's great hacker work.
But be warned it will erode - in general - the luxury previously associated with your brand, and also turn some customers away entirely.