llm works great in closed loop so they can self correct but we don't have a reliable way to lint and test specs we need a new language for that