Nor that the training from scratch will even work.

amelius • yesterday at 10:51 PM • 1 reply • view on HN

exactly, that is the current objective. To proove that generation for a specific domain is on-par with causal attention models

alt Hacker News