Security issues aside, I really like the idea of a common open database with this kind of agent docs. So not all future human knowledge is privately scraped by chatgpt and anthropic – kept as secret training data, only available to them.
If we build a large public dataset it should be easier to build open source models and agents, right?