The tip of the sphere in agentic code harnesses today is to RL train them as dedicated conductor/orchestrator models.
Not 200 lines of Python.
Can you elaborate on this?
Can you elaborate on this?