logoalt Hacker News

rohan_joshitoday at 6:16 PM0 repliesview on HN

80M model is the highest quality while also being quite efficient. it is superior in terms of pronunciation accuracy for less common words, and also is more stable in terms of speed. its my fav model. i think the 40M is quite similar to 80M for most usecases. 15M is for resource cpus, loading onto a browser etc.

The new 15M is way better than the previous 80M model(v0.1). So we're able to predictably improve the quality which is very encouraging.