Right, and my point is that if you use 80% Brazilian Portuguese during base model training + 20% Eur...

embedding-shape • today at 4:23 PM • 1 reply • view on HN

Right, and my point is that if you use 80% Brazilian Portuguese during base model training + 20% European Portuguese as post-training, you pretty much get exactly that, except with a ton more of available training data.

Replies

KK7NIL • today at 4:29 PM

What's your evidence for that?

And if the first 80% doesn't bias the language after post-training (which I think is what you're claiming) why not go for English or a mixture of languages, which is essentially what they did by starting with EuroLLM?

➕ show 1 reply

alt Hacker News

Replies