It's possible that the training data (and research data) is already out there, just not (yet) combined into a single open source CAD kernel.
Then again, the success of such a project might depend on other factors. Given the complexity of the task, I can imagine that just "lucking into" the right design decisions early on could have a major impact.