logoalt Hacker News

mattnewtontoday at 7:10 PM1 replyview on HN

My theory with no insider information: it’s a little of all of the above, but mostly money. To some extent, you can dig yourself out of a data hole with RL and a lot of compute. And you can buy a lot of compute and some data with a lot of money. Big labs have been operating in this regime for a while and it’s one of the drivers behind their costs beyond just scaling the weights and doing the actual training. Mistral just doesn’t have access to this level of compute or the money to try and muscle their way in.


Replies

MichaelZuotoday at 8:38 PM

Don’t they supposedly have a huge amount of EU support?

Or at least there’s been a lot of noise about that.

show 2 replies