> I'd skip this for now - it does not allow any kind of interactive conversation - as I lear...

Lapel2742 • yesterday at 9:12 AM • 1 reply • view on HN

> I'd skip this for now - it does not allow any kind of interactive conversation - as I learned after downloading 5G of models - it's a proof of concept that takes a wav file in.

I haven't looked into it that much but to my understanding a) You just need an audio buffer and b) Thye seem to support streaming (or at least it's planed)

> Looking at the library’s trajectory — ASR, streaming TTS, multilingual synthesis, and now speech-to-speech — the clear direction was always streaming voice processing. With this release, PersonaPlex supports it.

Replies

isodev • yesterday at 10:38 AM

> You just need an audio buffer

That alone to do right on macOS using Swift is an exercise in pain that even coding bots aren't able to solve first time right :)

➕ show 1 reply

alt Hacker News

Replies