logoalt Hacker News

ks2048yesterday at 8:57 PM2 repliesview on HN

Why do you say they "stopped focusing on AI"? I see a pretty consistent release of pretty good products - particularly in speech and OCR.


Replies

SyneRyderyesterday at 9:33 PM

I used to use Mistral OCR, but found it was better just to write a program that sent the documents to Claude Sonnet to OCR instead. Claude is far better quality, better formatting and fewer errors.

I'm also using Voxtral TTS to try to replace OpenAI. It "works", but I've had problems with volume levels being radically different between different audio chunks. It doesn't seem to "understand the full text" the way OpenAI's voice models do, which can be more expressive. Voxtral sometimes sounds robotic in the reading. And some Voxtral TTS output contains music in the background occasionally, which suggests their training corpus isn't that clean. Try generating a personalized news podcast, and the intro may occasionally sound like the music for BBC News underneath....

As for not focusing on AI, there's this interview in the Big Technology Podcast 2 months ago, where the Mistral CEO says their main focus is on helping companies fine-train models for internal use, over being a general model builder.

https://www.youtube.com/watch?v=xxUTdyEDpbU&t=1357s

show 1 reply
dwedgeyesterday at 9:01 PM

I used their OCR against a few hundred page PDF that was printed text but missing the OCR. It cost me $5 and was useless, it did worse than tesseract. That's how all my experience with mistral is