I've found RTK CLI proxy [1] quite useful for reducing token usage
[1]: https://github.com/rtk-ai/rtk/