logoalt Hacker News

antireztoday at 6:16 PM1 replyview on HN

Different representations at the same bitrate may have features that make one a lot more resilient to errors. This thing about Italian, you fill find in any benchmark of vastly different AI transcribing models. You can find similar results also on the way LLMs mostly trained on English generalize usually very well with Italian. All this despite Italian accounting for marginal percentage of the training set. How do you explain that? I always cringe when people refute evidence.


Replies

testdelacc1today at 6:42 PM

Where is this evidence you’ve cited for your claims?