logoalt Hacker News

magicalhippoyesterday at 10:12 AM0 repliesview on HN

> I think it should be acceptable for AI to learn and produce, but not to learn and copy.

Ok but that's just a training issue then. Have model A be trained on human input. Have model A generate synthetic training data for model B. Ensure the prompts used to train B are not part of A's training data. Voila, model B has learned to produce rather than copy.

Many state of the art LLMs are trained in such a two-step way since they are very sensitive to low-quality training data.