There is some papers [0] showing that the skill and agent files reduce the reasoning effectiveness in some use cases (e.g. autogenerated)
[0] https://arxiv.org/abs/2602.11988
reference: https://news.ycombinator.com/item?id=47034087