logoalt Hacker News

SeanLangtoday at 2:37 PM1 replyview on HN

Couldn't you create synthetic data based on your entries using local models? Or would that defeat the purpose of fine tuning it?


Replies

embedding-shapetoday at 3:01 PM

Yeah, I suppose, but how do I get sufficiently high quality synthetic data without sending the original data to OpenAI/Anthropic, or by using local models when none of them seem strong enough to be able to generate that "sufficiently high quality synthetic data" in the first place?

show 1 reply