getting tricks embedded into the weights is expensive, it doesn't happen in a single pass
they's why we teach them new tricks on the fly (in-context learning) with instruction files
Right, it sounds like an artificial limitation.
Right, it sounds like an artificial limitation.