https://platform.claude.com/docs/en/build-with-claude/prompt...
suggests the can cache outside the gpu.