logoalt Hacker News

valleyeryesterday at 7:16 PM1 replyview on HN

Why is this? Do labs reinforce the model name during training? I was under the impression that this sort of "self-knowledge" always came from the system prompt, but I guess not...


Replies

jdiffyesterday at 8:11 PM

Yes. In this case, during fine tuning. Other blurbs are also baked in during fine tuning that are perfectly reproducible from the Nex model. The details inside the linked issue are quite accessible.