logoalt Hacker News

stymaaryesterday at 10:16 PM0 repliesview on HN

That could be. Just use pre-training for language understanding and let the post-training on synthetic data do the heavy lifting.