logoalt Hacker News

miohtamayesterday at 11:06 PM1 replyview on HN

For the next model training version, would it make sense to incorporate all of these in the base model?


Replies

Bolwinyesterday at 11:41 PM

Not all. In fact a small model that has none of them but loads them on demand might be the most efficient thing